|
Details
|
|
Fundamentally, computers only understand numbers. They encode letters and other characters in the form of numbers. Unicode specifies a unique number for each character, whatever the platform, software or language.
The incorporation of Unicode in Internet site applications enables the simultaneous satisfaction without modification of the requests of several platforms, languages and countries. Unicode also allows different software programs to exchange characters without loss of data.
The Unicode standard defines three forms of encoding which allow the transformation of the same data according to different formats (8 bits, 16 bits, 32 bits). For example, the UTF-8 format (Unicode Transformation Format 8 bits) is very well known for its use in HTML and all other related protocols.
Unicode encoding is characterized by %xx (x corresponding to a hexadecimal digit, therefore only 0123456789ABCDEF).
|