The Unicode Standard, Version 3.0
Description:
Detailed specifications for Unicode: structure, conformance encoding forms, character properties, semantics, equivalence, combining characters, logical ordering, conversion, allocation, big/little endian usage, Korean syllable formation, control characters, case mappings, numeric values, mathematical properties, writing directions (Arabic, Japanese, English, and so on), character shaping (Arabic, Devanagari, Tamil, and so on).
Expanded implementation guidelines by experts in global software design: normalization, sorting and searching, case mapping, compression, language tagging, boundaries (characters, words, lines, and sentences), rendering of non-spacing marks, transcoding to other character sets, handling unknown characters, surrogate pairs, numbers, editing and selection, keyboard input, and more.