FANDOM


convmvEdit

iconvEdit

  • command and library to convert character encoding
  • library

LocaleEdit

LiteralEdit

ASCIIEdit

  • Most basic character encoding.
  • Most of the modern character encoding is based on this.
  • 0x00 to 0x7F

JISEdit

LatinEdit

WindowsEdit

UnicodeEdit

  • concept
  • Normalization
    • use to compare unicode string
    • has 4 type
    • utf8proc
      • C library for utf-8 normalization.
      • supports all 4 normalization
      • returned string is allocated with malloc() so needs to be deallocated with free()
      • has ruby, posgresql binding

wchar_tEdit

EncodingEdit

UTF-8Edit

UTF-16Edit

  • Mostly used in Windows.
  • needs to decode to use as Unicode so use UTF-32 as possible.

UTF-32Edit

  • uint32_t array of unicode.
  • good for internal use but not good for exchanging.
  • endian matters.

Unicode IteratorEdit

  • Used to convert unicode encoding.
  • Most of the time it needs to be converted to UTF-32 first.
  • Older API(before 1.48.0) doesn't have range check so use the header that has range check.
  • test code
  • source code

Ad blocker interference detected!


Wikia is a free-to-use site that makes money from advertising. We have a modified experience for viewers using ad blockers

Wikia is not accessible if you’ve made further modifications. Remove the custom ad blocker rule(s) and the page will load as expected.