Skip to main content


Scuba Docs


Unicode is a computing industry standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. Unicode contains a repertoire of 139 modern and historic scripts, as well as multiple symbol sets. 

The standard has been implemented in modern operating systems, XML, Java (and other programming languages). The Unicode standard defines UTF-8, UTF-16, and UTF-32.

Related terms

  • Character set
  • Character set encoding
  • Was this article helpful?