Search



You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 4 Next »

UTF-8 is an efficient encoding of Unicode character - String that recognizes the fact that the majority of text-based communications are in ASCII. It therefore optimizes the encoding of these characters.

Unicode is preferred to ASCII because it permits the inclusion of accents, scientific symbols and characters used in languages other than English. The UTF-8 format is a standard encoding that provides the most efficient means of encoding 16-bit Unicode characters in cases where the majority of characters are in the ASCII range. Both UTF-8 and the alternative UTF-16 encoding are supported by all widely used operating systems and major applications (and has been for more than 15 years).

SNOMED CT uses the UTF-8representation 1  of characters in terms and other text fields.


Footnotes
Ref Notes
1 Note that SNOMED CT does not use, or require use of, the Byte Order Mark (BOM) specified by the Unicode standard because all SNOMED CT release files use UTF-8.


Feedback
  • No labels