tlhIngan-Hol Archive: Mon May 06 14:11:26 1996

Back to archive top level

To this year's listing



[Date Prev][Date Next][Thread Prev][Thread Next]

ANNOUNCEMENT: The ConScript Unicode Registry (CSUR)



This is to announce the forming of the ConScript Unicode Registry,
or CSUR for short.  This is a Web page for coordinating the assignment
of blocks out of the Unicode Private Use space (E000-F8FF and
000F0000-0010FFFF) to constructed/artificial scripts, including
scripts for constructed/artificial languages.

What is Unicode?  It's a 16-bit character code standard (also known
as ISO 10646) which is intended to provide character codes for, in
principle, all the world's written languages.  Currently, about
24 major scripts are handled, with more expected.

A range of 6400 characters has been reserved for "private use"; these
codes will never be given standard values, and can be used by anyone
for any purpose.  In addition, 131072 additional codes outside the
16-bit codespace, and encoded with two consecutive 16-bit codes
(from a reserved area) are also available for private use.  Between
these two areas, there are more than enough character slots (called
codepoints) available for all the constructed scripts ever thought of,
without stepping on each others' toes.

I am therefore volunteering to maintain a registry of scripts and
the codes assigned to them.  The relevant information on each script
will be available at the CSUR Web page at http://www.ccil.org/~cowan/csur,
currently a copy of this message.  I'll be adding information on
how to write a script registration document and adding it to the
above page over the next week or so.

Initial registration documents will reflect the following blocks:

	E000-E06F TENGWAR
	E070-E0BF CIRTH
	F8D0-F8FF KLINGON

The Tengwar block is compatible (but extends) the existing defined
Tengwar block; the Klingon block is compatible with Linux.

-- 
John Cowan						[email protected]
			e'osai ko sarji la lojban



Back to archive top level