The Unicode Consortium

産業: Computer; Software

Number of terms: 11048

Number of blossaries: 0

Company Profile:

The Unicode Consortium or Unicode Inc. is a not-for-profit organization that coordinates the development of the Unicode standard. Its stated goal is to eventually enable computers to operate in all languages from around the world. The consortium develops and publishes a list of freely-available ...

その他

leading surrogate

A 16-bit code unit in the range D80016 to DBFF16, used in UTF-16 as the leading code unit of a surrogate pair.

Industry:Computer; Software

hiragana

One of two standard syllabaries associated with the Japanese writing system. Hiragana syllables are typically used in the representation of native Japanese words and grammatical particles.

Industry:Computer; Software

hypertext markup language (HTML)

A text description language related to SGML; it mixes text format markup with plain text content to describe formatted text. HTML is ubiquitous as the source language for Web pages on the Internet. Starting with HTML 4.0, the Unicode Standard functions as the reference character set for HTML content.

Industry:Computer; Software

Internet Assigned Numbers Authority (IANA)

The Internet Assigned Numbers Authority (IANA) is the entity that oversees global IP address allocation, autonomous system number allocation, root zone management in the Domain Name System (DNS), media types, and other Internet Protocol-related symbols and numbers. IANA is a department operated by the Internet Corporation for Assigned Names and Numbers, also known as ICANN.

Industry:Computer; Software

international components for unicode (ICU)

An Open Source set of C/C++ and Java libraries for Unicode and software internationalization support.

Industry:Computer; Software

ideograph

A technical term for a Chinese character. In the Unicode Standard, sinograms are systematically referred to instead as CJK ideographs or Han ideographs.

Industry:Computer; Software

ideographic property

Informative property of characters that are ideographs.

Industry:Computer; Software

iicore

A subset of common-use CJK unified ideographs, defined as the fixed collection 370 IICore in ISO/IEC 10646. This subset contains 9,810 ideographs and is intended for common use in East Asian contexts, particularly for small devices that cannot support the full range of CJK unified ideographs encoded in the Unicode Standard.

Industry:Computer; Software

ill-formed

A Unicode code unit sequence that purports to be in a Unicode encoding form is called ill-formed if and only if it does not follow the specification of that Unicode encoding form. * Any code unit sequence that would correspond to a code point outside the defined range of Unicode scalar values would, for example, be ill-formed. * UTF-8 has some strong constraints on the possible byte ranges for leading and trailing bytes. A violation of those constraints would produce a code unit sequence that could not be mapped to a Unicode scalar value, resulting in an ill-formed code unit sequence.

Industry:Computer; Software

ill-formed code unit sequence

A code unit sequence that does not follow the specification of a Unicode encoding form.

Industry:Computer; Software

用語

ソーシャル

エクストラ

ソリューション