Report on TEI InternationalizationSebastian RahtzOctober 2006The Text Encoding Initiative
No source
TEI internationalisation project
The aim is to lower the barrier for entry of non-English-speaking users by:
Ensuring that all of the TEI is Unicode-safeTranslating the reference element and attribute descriptionsProviding tools to easily access non-English versionsLocalizing TEI softwareLocalizing the TEI examplesTranslating the running prose of the GuidelinesTranslating the object names
Reminder: definitions
Internationalization is the process of generalizing a product so
that it can handle multiple languages and cultural conventions without
the need for redesign. Internationalization takes place at the level
of program design and document development.Localization is the process of taking a product and
making it linguistically and culturally appropriate to a given
target locale (country/region and language) where it will be
used.
Examples of translation
instead of addrLine, the TEI user might
prefer to write l�-neaDirecci�"n, ligneAdresse,
linDireccio or AdressZeile. instead of supplies the descriptive and declarative
information making up an electronic title page prefixed to every
TEI-conformant text., the French-speaking user might find it more
helpful to read L'élément En-tête TEI TeiHeader fournit
les informations descriptives et déclaratives précédant chaque texte
conforme � la TEI et qui permettent de constituer une page de titre
électronique
Localisation of examples
What does this
Sire Thopas was a doghty swayn;White was his face as payndemayn,His lippes rede as rose;His rode is lyk scarlet in grayn,And I yow telle in good certayn,He hadde a semely nose.
mean to a Chinese scholar?
Example of translated ODD
TEI HeaderTEIヘ�*ーtei標� -En-tête TEIsupplies the descriptive and declarative information making
up an electronic title page prefixed to every TEI-conformant
text. TEI準�- テ�-�'トに付与さ�'�-�'電�-�版のタイトルページを
�-成す�-�'記述的・宣�*的情� �を含�*.在�*有符合TEI標準的文本起�-的電�-��'名� �當�-提供敘述�*�以及宣告�*�的�"�訊�*�L'élément En-tête TEI TeiHeader fournit les informations descriptives et déclaratives précédant chaque texte conforme � la TEI et qui permettent de constituer une page de titre électronique.specifies the kind of document to which the header is attached.当�"ヘ�*ー�'付与さ�'�-文書の種類を示す.Spécifie le type de document auquel l'en-tête se rapporte.textthe header is attached to a single text.L'en-tête se rapporte � un texte isolé.the header is attached to a corpus.L'en-tête se rapporte � un corpus.Shakespeare: the first folio (1623) in electronic formShakespeare, William (1564�*�1616)Originally prepared byTrevor Howard-HillRevised and edited byChristine Avern-CarrOxford Text Archive13 Banbury Road, Oxford OX2 6NN, UK119
Freely available on a non-commercial basis.
1968The first folio of Shakespeare, prepared by Charlton Hinman
(The Norton Facsimile, 1968)
Originally prepared for use in the production of a series of
old-spelling concordances in 1968, this text was extensively
checked and revised for use during the editing of the new Oxford
Shakespeare (Wells and Taylor, 1989).
Turned letters are silently corrected.
Original spelling and typography is retained, except
that long s and ligatured forms are not encoded.
A reference is created by assembling the following,
in the reverse order as that listed here:
the n value of the preceding lba periodthe n value of the ancestor div2a spacethe n value of the parent div1
12 Apr 89 Last checked by CAC1 Mar 89 LB made new fileShakespeare: the first folio (1623) sous forme électroniqueShakespeare, William (1564�*�1616)Préparé parTrevor Howard-HillRévisé et édité parChristine Avern-CarrOxford Text Archive13 Banbury Road, Oxford OX2 6NN, UK119
Disponible gratuitement � des fins non commerciales.
1968The first folio of Shakespeare, préparé par Charlton Hinman
(The Norton Facsimile, 1968)
Préparé pour la production d'une collection de concordances old-spelling en 1968, ce texte a été profondément relu et révisé pour l'édition du new Oxford
Shakespeare (Wells and Taylor, 1989).
Les caractères bloqués sont corrigés sans commentaire.
L'orthographe et la typographie originales sont conservées, � l'exception des s longs et des ligatures qui ne sont pas encodées.
Une référence est créée en assemblant les éléments suivants dans l'ordre inverse de la liste suivante :
la valeur de l'attribut n de l'élément lbprécédent.un pointla valeur de l'attribut n de l'élément div2 ancêtre.un espacela valeur de l'attribut n de l'élément div1 parent.
12 avril 1989 Dernière vérification par CAC1er mars 1989 Nouveau fichier par LB
One of the few elements unconditionally required in any
Un des rares éléments obligatoires dans tout document TEI.
The scale of work involved
494 elements116 classes476 attributes1115 gloss elements, 29357 characters1170 desc elements, 98415 characters
Who's up for this?
Volunteers for the following languages have stepped up:
ChineseDutchFrenchGermanHindiItalianJapanesePolishPortugueseRomanianSerbianSlovenianSpanishSwedish
The 2007 project
The ALLC has generously provided funding of £10,0000 to
oil the wheels and we should now be able to deliver in
French, Spanish, German, Chinese and Japanese by this time next year,
producing:
translated desc and gloss textsa web application to allow review and update of translationsa mechanism to allow users to easily take advantage of the worktranslated element and attribute names
Infrastructure work
Roma has been changed to support the following output schemes:
English names, descriptions in English English names, descriptions in chosen
language names designed to make sense to a speaker of the chosen language, descriptions in English both names and descriptions in chosen
language
Partners
Weining Huang, Würzburg; checking by Marcus BingenheimerCoordinated by Pierre-Yves Duchesmin
(ENSSIB) for AFNORWerner Wegstein, Würzburg; checking by
Christian WitternOhya Kazushi, Tsurumi University? University of AlicanteSebastian Rahtz