upload
The Unicode Consortium
産業: Computer; Software
Number of terms: 11048
Number of blossaries: 0
Company Profile:
The Unicode Consortium or Unicode Inc. is a not-for-profit organization that coordinates the development of the Unicode standard. Its stated goal is to eventually enable computers to operate in all languages from around the world. The consortium develops and publishes a list of freely-available ...
A sequence of bytes that is used for code extension. The first byte in the sequence is escape (hex 1B).
Industry:Computer; Software
A character defined by an end user, using a private-use code point, to represent a character missing in a particular character encoding. These are common in East Asian implementations.
Industry:Computer; Software
A character defined by an end user, using a private-use code point, to represent a character missing in a particular character encoding. These are common in East Asian implementations.
Industry:Computer; Software
Forms of decimal digits first used in Europe and now used worldwide. Historically, these digits were derived from the Arabic digits; they are sometimes called “Arabic numerals,” but this nomenclature leads to confusion with the real Arabic digits. Also called "Western digits" and "Latin digits."
Industry:Computer; Software
A canonical decomposition mapping from a character to a sequence of more than one character.
Industry:Computer; Software
A value for an encoded character property that is explicitly associated with a code point in one of the data files of the Unicode Character Database.
Industry:Computer; Software
Any base character, or any standard Korean syllable block. This term is defined to take into account the fact that sequences of Korean conjoining jamo characters behave as if they were a single Hangul syllable character, so that the entire sequence of jamos constitutes a base.
Industry:Computer; Software
A maximal character sequence consisting of either an extended base followed by a sequence of one or more characters where each is a combining character, zero width joiner, or zero width non-joiner ; or a sequence of one or more characters where each is a combining character, zero width joiner, or zero width non-joiner.
Industry:Computer; Software
The text between extended grapheme cluster boundaries as specified by Unicode Standard Annex #29, “Unicode Text Segmentation.” * Extended grapheme clusters are defined in a parallel manner to legacy grapheme clusters, but also include sequences of spacing marks. * Grapheme clusters and extended grapheme clusters may not have any particular linguistic significance, but are used to break up a string of text into units for processing. * Grapheme clusters and extended grapheme clusters may be adjusted for particular processing requirements, by tailoring the rules for grapheme cluster segmentation. * The associated base character is the base character in the combining character sequence that a combining mark is part of. * A combining mark in a defective combining character sequence has no associated base character and thus cannot be said to depend on any particular base character. This is one of the reasons why fallback processing is required for defective combining character sequences. * Dependence concerns all combining marks, including spacing marks and combining marks that have no visible display.
Industry:Computer; Software
A subset of the range of numeric values for combining classes— specifically, any value in the range 10..199. * Fixed position classes are assigned to a small number of Hebrew, Arabic, Syriac, Telugu, Thai, Lao, and Tibetan combining marks whose positions were conceived of as occurring in a fixed position with respect to their grapheme base, regardless of any other combining mark that might also apply to the grapheme base. * Not all Arabic vowel points or Indic matras are given fixed position classes. The existence of fixed position classes in the standard is an historical artifact of an earlier stage in its development, prior to the formal standardization of the Unicode Normalization Forms.
Industry:Computer; Software