Unicode is a standard of computer character sets that aims to unambiguously represent every known glyph in every human language. Unicode's native encoding is 32 bit (older versions use 16 bits). Research Unicode