Unicode
Computing industry standard for consistent encoding, representation, and handling of text
Unicode is a computing industry standard designed to ensure that text and symbols from all the world's writing systems are consistently encoded, represented, and handled by computers. The standard is maintained by the Unicode Consortium, a non-profit organization.
History[edit | edit source]
The development of Unicode began in 1987, with the first version of the Unicode Standard being published in 1991. The goal was to address the limitations of earlier character encoding systems, such as ASCII and various national and vendor-specific encodings, which were insufficient for representing the wide array of characters used in global languages.
Design Principles[edit | edit source]
Unicode is based on several key design principles:
- **Universal Character Set**: Unicode aims to include every character used in writing systems across the world.
- **Efficiency**: Unicode is designed to be efficient in terms of storage and processing.
- **Unification**: Similar characters from different writing systems are unified into a single code point where possible.
Encoding Forms[edit | edit source]
Unicode can be implemented in different encoding forms:
- UTF-8: A variable-width encoding that uses one to four bytes for each character.
- UTF-16: A variable-width encoding that uses two or four bytes for each character.
- UTF-32: A fixed-width encoding that uses four bytes for each character.
Character Properties[edit | edit source]
Each Unicode character has a set of properties that define its behavior in text processing. These properties include:
- **General Category**: Defines the character type (e.g., letter, digit, punctuation).
- **Combining Class**: Used for characters that combine with others, such as diacritics.
- **Bidirectional Class**: Determines how characters are displayed in bidirectional text.
Unicode Blocks[edit | edit source]
Unicode characters are grouped into blocks based on their script or usage. Examples include:
- Basic Latin: U+0000 to U+007F
- Cyrillic: U+0400 to U+04FF
- CJK Unified Ideographs: U+4E00 to U+9FFF
Applications[edit | edit source]
Unicode is widely used in various applications, including:
- Operating systems: Most modern operating systems support Unicode.
- Web browsers: Unicode is essential for displaying web pages in multiple languages.
- Programming languages: Many programming languages, such as Python and Java, support Unicode.
Unicode Consortium[edit | edit source]
The Unicode Consortium is responsible for the development and maintenance of the Unicode Standard. It collaborates with other standards organizations, such as the International Organization for Standardization (ISO), to ensure compatibility and interoperability.
See Also[edit | edit source]
References[edit | edit source]
External Links[edit | edit source]
Search WikiMD
Ad.Tired of being Overweight? Try W8MD's physician weight loss program.
Semaglutide (Ozempic / Wegovy and Tirzepatide (Mounjaro / Zepbound) available.
Advertise on WikiMD
WikiMD's Wellness Encyclopedia |
Let Food Be Thy Medicine Medicine Thy Food - Hippocrates |
Translate this page: - East Asian
中文,
日本,
한국어,
South Asian
हिन्दी,
தமிழ்,
తెలుగు,
Urdu,
ಕನ್ನಡ,
Southeast Asian
Indonesian,
Vietnamese,
Thai,
မြန်မာဘာသာ,
বাংলা
European
español,
Deutsch,
français,
Greek,
português do Brasil,
polski,
română,
русский,
Nederlands,
norsk,
svenska,
suomi,
Italian
Middle Eastern & African
عربى,
Turkish,
Persian,
Hebrew,
Afrikaans,
isiZulu,
Kiswahili,
Other
Bulgarian,
Hungarian,
Czech,
Swedish,
മലയാളം,
मराठी,
ਪੰਜਾਬੀ,
ગુજરાતી,
Portuguese,
Ukrainian
Medical Disclaimer: WikiMD is not a substitute for professional medical advice. The information on WikiMD is provided as an information resource only, may be incorrect, outdated or misleading, and is not to be used or relied on for any diagnostic or treatment purposes. Please consult your health care provider before making any healthcare decisions or for guidance about a specific medical condition. WikiMD expressly disclaims responsibility, and shall have no liability, for any damages, loss, injury, or liability whatsoever suffered as a result of your reliance on the information contained in this site. By visiting this site you agree to the foregoing terms and conditions, which may from time to time be changed or supplemented by WikiMD. If you do not agree to the foregoing terms and conditions, you should not enter or use this site. See full disclaimer.
Credits:Most images are courtesy of Wikimedia commons, and templates Wikipedia, licensed under CC BY SA or similar.
Contributors: Prab R. Tumpati, MD