With Unicode each character uses a unique number between U+0000 and U+10FFFF, Unicode may be 8-bit, 16-bit, or 32-bit. Numbers, mathematical notation, popular symbols and characters from all languages are assigned a code point, for example, U+0041 is an English letter "A." Below is an example of how "Computer Hope" would be written in English Unicode.
U+0043 U+006F U+006D U+0070 U+0075 U+0074 U+0065 U+0072 U+00A0 U+0048 U+006F U+0070 U+0065
A common type of Unicode is UTF-8, which utilizes 8-bit character encoding. It's used in Linux environments, to encode foreign characters so they display properly when output to a text file.