Unicode

From WikiMD's Food, Medicine & Wellness Encyclopedia

Unicode sample.png
Hiero O4.png
Cyrillic cursive.svg
I acute - soft dotted and Lithuanian dot.svg

Template:Infobox character encoding

Unicode is a computing industry standard designed to ensure that text and symbols from all the world's writing systems are consistently encoded, represented, and handled by computers. The standard is maintained by the Unicode Consortium, a non-profit organization.

History[edit | edit source]

The development of Unicode began in 1987, with the first version of the Unicode Standard being published in 1991. The goal was to address the limitations of earlier character encoding systems, such as ASCII and various national and vendor-specific encodings, which were insufficient for representing the wide array of characters used in global languages.

Design Principles[edit | edit source]

Unicode is based on several key design principles:

  • **Universal Character Set**: Unicode aims to include every character used in writing systems across the world.
  • **Efficiency**: Unicode is designed to be efficient in terms of storage and processing.
  • **Unification**: Similar characters from different writing systems are unified into a single code point where possible.

Encoding Forms[edit | edit source]

Unicode can be implemented in different encoding forms:

  • UTF-8: A variable-width encoding that uses one to four bytes for each character.
  • UTF-16: A variable-width encoding that uses two or four bytes for each character.
  • UTF-32: A fixed-width encoding that uses four bytes for each character.

Character Properties[edit | edit source]

Each Unicode character has a set of properties that define its behavior in text processing. These properties include:

  • **General Category**: Defines the character type (e.g., letter, digit, punctuation).
  • **Combining Class**: Used for characters that combine with others, such as diacritics.
  • **Bidirectional Class**: Determines how characters are displayed in bidirectional text.

Unicode Blocks[edit | edit source]

Unicode characters are grouped into blocks based on their script or usage. Examples include:

Applications[edit | edit source]

Unicode is widely used in various applications, including:

Unicode Consortium[edit | edit source]

The Unicode Consortium is responsible for the development and maintenance of the Unicode Standard. It collaborates with other standards organizations, such as the International Organization for Standardization (ISO), to ensure compatibility and interoperability.

See Also[edit | edit source]

References[edit | edit source]

External Links[edit | edit source]

Template:Character encoding-stub

Wiki.png

Navigation: Wellness - Encyclopedia - Health topics - Disease Index‏‎ - Drugs - World Directory - Gray's Anatomy - Keto diet - Recipes

Search WikiMD


Ad.Tired of being Overweight? Try W8MD's physician weight loss program.
Semaglutide (Ozempic / Wegovy and Tirzepatide (Mounjaro / Zepbound) available.
Advertise on WikiMD

WikiMD is not a substitute for professional medical advice. See full disclaimer.

Credits:Most images are courtesy of Wikimedia commons, and templates Wikipedia, licensed under CC BY SA or similar.

Contributors: Prab R. Tumpati, MD