Unicode
Page Template:Mono/styles.css must have content model "Sanitized CSS" for TemplateStyles (current model is "wikitext").
File:Unicode logo.svg | |
Language(s) | Multiple languages |
---|---|
Standard | Unicode Standard |
Classification | Character encoding |
Other related encoding(s) | ISO/IEC 10646 |
Unicode is a computing industry standard designed to ensure that text and symbols from all the world's writing systems are consistently encoded, represented, and handled by computers. The standard is maintained by the Unicode Consortium, a non-profit organization.
History[edit | edit source]
The development of Unicode began in 1987, with the first version of the Unicode Standard being published in 1991. The goal was to address the limitations of earlier character encoding systems, such as ASCII and various national and vendor-specific encodings, which were insufficient for representing the wide array of characters used in global languages.
Design Principles[edit | edit source]
Unicode is based on several key design principles:
- **Universal Character Set**: Unicode aims to include every character used in writing systems across the world.
- **Efficiency**: Unicode is designed to be efficient in terms of storage and processing.
- **Unification**: Similar characters from different writing systems are unified into a single code point where possible.
Encoding Forms[edit | edit source]
Unicode can be implemented in different encoding forms:
- UTF-8: A variable-width encoding that uses one to four bytes for each character.
- UTF-16: A variable-width encoding that uses two or four bytes for each character.
- UTF-32: A fixed-width encoding that uses four bytes for each character.
Character Properties[edit | edit source]
Each Unicode character has a set of properties that define its behavior in text processing. These properties include:
- **General Category**: Defines the character type (e.g., letter, digit, punctuation).
- **Combining Class**: Used for characters that combine with others, such as diacritics.
- **Bidirectional Class**: Determines how characters are displayed in bidirectional text.
Unicode Blocks[edit | edit source]
Unicode characters are grouped into blocks based on their script or usage. Examples include:
- Basic Latin: U+0000 to U+007F
- Cyrillic: U+0400 to U+04FF
- CJK Unified Ideographs: U+4E00 to U+9FFF
Applications[edit | edit source]
Unicode is widely used in various applications, including:
- Operating systems: Most modern operating systems support Unicode.
- Web browsers: Unicode is essential for displaying web pages in multiple languages.
- Programming languages: Many programming languages, such as Python and Java, support Unicode.
Unicode Consortium[edit | edit source]
The Unicode Consortium is responsible for the development and maintenance of the Unicode Standard. It collaborates with other standards organizations, such as the International Organization for Standardization (ISO), to ensure compatibility and interoperability.
See Also[edit | edit source]
References[edit | edit source]
External Links[edit | edit source]
Search WikiMD
Ad.Tired of being Overweight? Try W8MD's physician weight loss program.
Semaglutide (Ozempic / Wegovy and Tirzepatide (Mounjaro / Zepbound) available.
Advertise on WikiMD
WikiMD's Wellness Encyclopedia |
Let Food Be Thy Medicine Medicine Thy Food - Hippocrates |
Translate this page: - East Asian
中文,
日本,
한국어,
South Asian
हिन्दी,
தமிழ்,
తెలుగు,
Urdu,
ಕನ್ನಡ,
Southeast Asian
Indonesian,
Vietnamese,
Thai,
မြန်မာဘာသာ,
বাংলা
European
español,
Deutsch,
français,
Greek,
português do Brasil,
polski,
română,
русский,
Nederlands,
norsk,
svenska,
suomi,
Italian
Middle Eastern & African
عربى,
Turkish,
Persian,
Hebrew,
Afrikaans,
isiZulu,
Kiswahili,
Other
Bulgarian,
Hungarian,
Czech,
Swedish,
മലയാളം,
मराठी,
ਪੰਜਾਬੀ,
ગુજરાતી,
Portuguese,
Ukrainian
WikiMD is not a substitute for professional medical advice. See full disclaimer.
Credits:Most images are courtesy of Wikimedia commons, and templates Wikipedia, licensed under CC BY SA or similar.
Contributors: Prab R. Tumpati, MD