The TEI Character Encoding Workgroup, chaired by Christian Wittern, began its work in 2003. The group completed its work in 2005.
Resources
Draft Documents for P5
[Replacement draft for TEI P5/CH](FASC-ch.pdf)
[Replacement draft for TEI P5/WD](FASC-wd.pdf)
Draft Papers
[CE01: Terms of Reference for the TEI
Workgroup on Character Encoding](ce01.xml)
[CE W 01: [DRAFT] Chapter 4: Languages and Character sets ](cew01.xml)
- CE W 02: XSLT-based proof of concept for
solutions discussed at Tuebingen meeting
[CE W 03: A collection of use cases for
extensions to the basic character set of a document](cew03.xml)
[CE W 04: Language and script identification
](cew04.html) ([Additional comments from PD](cew04-1.html))
[CE W 05: Semantics for characters and
linguistic features](cew05.xml)
[CE W 06: Extending the document character
set](cew06.xml)
[CE W 07: Private use characters in XML](cew07.xml)
[CE W 08:An analysis of topics in P4
chapter 4 and CE W 01.](cew08.xml)
[CE W 09: Language identification](cew09.xml): draft for inclusion in P5/CH
[CE W 12: Report from Sanskrit Workgroup](cew12.pdf)
Meetings and Reports
[CE M 01 Minutes of Workgroup Meeting in Nancy, 05-06 Nov 2003](cem02.xml)
[CE R 01 Report to the TEI Members Meeting in Chicago, Oct 2002](cer01.xml)
[CE M 01 Minutes of Workgroup Meeting in Tuebingen, 23-24 Jul 2002](cem01.xml)
Background Documents and Links
[Design Of An
Electronic Method For Describing Writing Systems](Design_of_an_Electronic.pdf) (Eric S. Albrights thesis)
[The Text in the Age
of Digital Reproduction](chibs-2002-paper.html)
(Draft paper by Christian Wittern)
- (TEI-C)
[P4: The XML Version of the TEI Guidelines](http://www.tei-c.org/P4X/)
- (W3C)
[Character Model for the World Wide Web 1.0](http://www.w3.org/TR/charmod/)
- (W3C, Unicode Consortium)
[Unicode in XML and other
Markup Languages](http://www.w3.org/TR/unicode-xml/)
- Jukka Korpela:
[A tutorial on character code issues](http://www.cs.tut.fi/~jkorpela/chars.html)
Some use cases
[Typographic Regularization in the WWP Textbase](http://www.nyu.edu/its/humanities/ach_allc2001/papers/russom/index.html)
A proposal for ACH/ALLC 2001
by Jacqueline H. Russom and Sydney D. Bauman
(Scholarly Technology Group, Brown University)
How to refer to characters/glyphs not in the document character set
- The SVG Specification uses an element
[AltGlyph](http://www.w3.org/TR/SVG/text.html#AltGlyphElement)
to refer to variant glyphs
- MathML uses an element
[<mglyph>](http://www.w3.org/TR/MathML2/chapter3.html#presm_mglyph)
for "presentation glyphs".
- Unicode has specific and generic Variation Selectors
(U+FE00~U+FE0F), see (Unicode Consortium)
[Standardized
Variants](http://www.unicode.org/Public/UNIDATA/StandardizedVariants.html). The usage of these is also discussed in the document [Unicode in XML and other
Markup Languages](http://www.w3.org/TR/unicode-xml/#Format) mentioned above.
Character semantics
- Unicode defines character semantics in the Unicode Character
Database (UCD, available at
[UnicodeData.txt](http://www.unicode.org/Public/UNIDATA/UnicodeData.txt);
here is an explanation of its contents: [
Unicode Data File Format](http://www.unicode.org/Public/UNIDATA/UnicodeData.html),
see also: (Unicode Consortium, UTR Draft) [ Unicode Technical Report #23
CHARACTER Properties](http://www.unicode.org/unicode/reports/tr23/index.html)
- (Unicode Consortium, TUS Annex 21)
[Case Mappings](http://www.unicode.org/unicode/reports/tr11/)
- (Unicode Consortium, UTR Draft)
[Unicode Technical Report #30
Character Foldings](http://www.unicode.org/unicode/reports/tr30/)
- (Unicode Consortium, TUS Annex 15)
[Unicode Normalization Forms](http://www.unicode.org/unicode/reports/tr15/)