Home Articles FAQs XREF Games Software Instant Books BBS About FOLDOC RFCs Feedback Sitemap
irt.Org

UCS transformation format

You are here: irt.org | FOLDOC | UCS transformation format

<standard, character> (UTF) A set of standard character encodings in accordance with ISO 10646.

One of a set of standard character encodings, the most widely used of which are UTF-8, UTF-16, and UTF-32. The code tables in ISO 10646 and in the Unicode standard are identical, although the Unicode standard includes additional material.

UTF-8 is the most widely used encoding, at least on Unix systems. Since it does not include any bytes like '\0' or '/' which have a special meaning in filenames and other C library function parameters, and 7-bit ASCII characters have the same encoding under both ASCII and UTF-8, the required changes to existing software are minimised.

Other UTFs: UTF-1 and UTF-7 are not widely used.

UTF-8 and Unicode FAQ for Unix/Linux (http://cl.cam.ac.uk/~mgk25/unicode.html#ucs).

(2002-01-15)

Nearby terms: UCP « UCS « UCSD Pascal « UCS transformation format » UCX » udb » UDDI

FOLDOC, Topics, A, B, C, D, E, F, G, H, I, J, K, L, M, N, O, P, Q, R, S, T, U, V, W, X, Y, Z, ?, ALL

©2018 Martin Webb

Home Articles FAQs XREF Games Software Instant Books BBS About FOLDOC RFCs Feedback Sitemap