upload
The Unicode Consortium
産業: Computer; Software
Number of terms: 11048
Number of blossaries: 0
Company Profile:
The Unicode Consortium or Unicode Inc. is a not-for-profit organization that coordinates the development of the Unicode standard. Its stated goal is to eventually enable computers to operate in all languages from around the world. The consortium develops and publishes a list of freely-available ...
A character used as a substitute for an uninterpretable character from another encoding. The Unicode Standard uses U+FFFD REPLACEMENT CHARACTER for this function.
Industry:Computer; Software
A glyph used to render a character that cannot be rendered with the correct appearance in a particular font. It often is shown as an open or black rectangle.
Industry:Computer; Software
A glyph used to render a character that cannot be rendered with the correct appearance in a particular font. It often is shown as an open or black rectangle.
Industry:Computer; Software
Any code point of the Unicode Standard that is reserved for future assignment. Also known as an unassigned code point. * Surrogate code points and noncharacters are considered assigned code points, but not assigned characters. In general, a conforming process may indicate the presence of a code point whose use has not been designated (for example, by showing a missing glyph in rendering or by signaling an appropriate error in a streaming protocol), even though it is forbidden by the standard from interpreting that code point as an abstract character.
Industry:Computer; Software
row
A range of 256 contiguous Unicode code points, where the first code point is an integer multiple of 256. Two code points are in the same row if they share all but the last two hexadecimal digits.
Industry:Computer; Software
The Syriac Abbreviation Mark is a Unicode control character (U+070F) that forms part of the Syriac script block. In Syriac, words are sometimes written in an abbreviated form, omitting some of the last letters. In such cases, a special overline is drawn over some of the final letters of the abbreviated word. Another use of this overline is to mark numbers: in Syriac numbers are written using numerical values which are assigned to letters (similarly to the Gematria system in Hebrew). The sequence of letters used to write the number are also marked by the overline.
Industry:Computer; Software
Any one-byte character encoding. This term is generally used in contrast with DBCS and/or MBCS.
Industry:Computer; Software
A collection of letters and other written signs used to represent textual information in one or more writing systems. For example, Russian is written with a subset of the Cyrillic script; Ukranian is written with a different subset. The Japanese writing system uses several scripts.
Industry:Computer; Software
A writing style without spaces or punctuation.
Industry:Computer; Software
The Standard Compression Scheme for Unicode (SCSU) is a Unicode Standard for reducing the number of bytes needed to represent Unicode text, especially if that text uses mostly characters from one or a small number of per-language character blocks. It does so by dynamically mapping values in the range 128–255 to offsets within particular blocks of 128 characters. The initial conditions of the encoder mean that existing strings in ASCII and ISO-8859-1 that do not contain C0 control codes other than NULL TAB CR and LF can be treated as SCSU strings. Since most alphabets do reside in blocks of contiguous Unicode codepoints, texts that use small alphabets and either ASCII punctuation or punctuation that fits within the window for the main alphabet can be encoded at one byte per character, most other punctuation can be encoded at 2 bytes per symbol through non-locking shifts. SCSU can also switch to UTF-16 internally to handle non-alphabetic languages.
Industry:Computer; Software