Monday, 29 October 2007

Chapter 3, Lesson 2: Encoding, decoding.

When is manual control of encoding required:
  • When interoperating with legacy/UNIX sys.
  • r/w files in other languages.
  • ASCII - 0-127 - 7-bit byte.
  • for languagues other than English - 128-255 - 8-bit byte.
  • code pages - ASCII for 0-127, and language specific values for 128-255.
  • code page - a list of character codes (code points) in a certain order.
  • code pages - support specific languages.
  • Windows code pages - 256 code points + zero-based.
  • Unicode - a massive code page with 10000+ characters
  • stds for encoding Unicode - UTF-16, UTF-8,...
  • overloaded Stream constructor - new StreamWriter("abc.txt", false, Encoding.Unicode);

No comments: