Report on TEI Internationalization Sebastian Rahtz October 2006 The Text Encoding Initiative

No source

TEI internationalisation project

The aim is to lower the barrier for entry of non-English-speaking users by: Ensuring that all of the TEI is Unicode-safe Translating the reference element and attribute descriptions Providing tools to easily access non-English versions Localizing TEI software Localizing the TEI examples Translating the running prose of the Guidelines Translating the object names

Reminder: definitions

Internationalization is the process of generalizing a product so that it can handle multiple languages and cultural conventions without the need for redesign. Internationalization takes place at the level of program design and document development. Localization is the process of taking a product and making it linguistically and culturally appropriate to a given target locale (country/region and language) where it will be used.

Examples of translation instead of addrLine, the TEI user might prefer to write l�-neaDirecci�"n, ligneAdresse, linDireccio or AdressZeile. instead of supplies the descriptive and declarative information making up an electronic title page prefixed to every TEI-conformant text., the French-speaking user might find it more helpful to read L'élément En-tête TEI TeiHeader fournit les informations descriptives et déclaratives précédant chaque texte conforme � la TEI et qui permettent de constituer une page de titre électronique
Localisation of examples

What does this Sire Thopas was a doghty swayn; White was his face as payndemayn, His lippes rede as rose; His rode is lyk scarlet in grayn, And I yow telle in good certayn, He hadde a semely nose. mean to a Chinese scholar?

Example of translated ODD TEI HeaderTEIヘ�*ー tei標� - En-tête TEI supplies the descriptive and declarative information making up an electronic title page prefixed to every TEI-conformant text. TEI準�- テ�-�'トに付与さ�'�-�'電�-�版のタイトルページを �-成す�-�'記述的・宣�*的情� �を含�*. 在�*有符合TEI標準的文本起�-的電�-��'名� �當�-提供敘述�*�以及宣告�*�的�"�訊�*� L'élément En-tête TEI TeiHeader fournit les informations descriptives et déclaratives précédant chaque texte conforme � la TEI et qui permettent de constituer une page de titre électronique. specifies the kind of document to which the header is attached.当�"ヘ�*ー�'付与さ�'�-文書の種類を示す. Spécifie le type de document auquel l'en-tête se rapporte. text the header is attached to a single text. L'en-tête se rapporte � un texte isolé. the header is attached to a corpus. L'en-tête se rapporte � un corpus. Shakespeare: the first folio (1623) in electronic form Shakespeare, William (1564�*�1616) Originally prepared by Trevor Howard-Hill Revised and edited by Christine Avern-Carr Oxford Text Archive
13 Banbury Road, Oxford OX2 6NN, UK
119

Freely available on a non-commercial basis.

1968
The first folio of Shakespeare, prepared by Charlton Hinman (The Norton Facsimile, 1968)

Originally prepared for use in the production of a series of old-spelling concordances in 1968, this text was extensively checked and revised for use during the editing of the new Oxford Shakespeare (Wells and Taylor, 1989).

Turned letters are silently corrected.

Original spelling and typography is retained, except that long s and ligatured forms are not encoded.

A reference is created by assembling the following, in the reverse order as that listed here: the n value of the preceding lba periodthe n value of the ancestor div2a spacethe n value of the parent div1

12 Apr 89 Last checked by CAC 1 Mar 89 LB made new file
Shakespeare: the first folio (1623) sous forme électronique Shakespeare, William (1564�*�1616) Préparé par Trevor Howard-Hill Révisé et édité par Christine Avern-Carr Oxford Text Archive
13 Banbury Road, Oxford OX2 6NN, UK
119

Disponible gratuitement � des fins non commerciales.

1968
The first folio of Shakespeare, préparé par Charlton Hinman (The Norton Facsimile, 1968)

Préparé pour la production d'une collection de concordances old-spelling en 1968, ce texte a été profondément relu et révisé pour l'édition du new Oxford Shakespeare (Wells and Taylor, 1989).

Les caractères bloqués sont corrigés sans commentaire.

L'orthographe et la typographie originales sont conservées, � l'exception des s longs et des ligatures qui ne sont pas encodées.

Une référence est créée en assemblant les éléments suivants dans l'ordre inverse de la liste suivante : la valeur de l'attribut n de l'élément lbprécédent.un pointla valeur de l'attribut n de l'élément div2 ancêtre.un espacela valeur de l'attribut n de l'élément div1 parent.

12 avril 1989 Dernière vérification par CAC 1er mars 1989 Nouveau fichier par LB

One of the few elements unconditionally required in any

Un des rares éléments obligatoires dans tout document TEI.

The scale of work involved 494 elements 116 classes 476 attributes 1115 gloss elements, 29357 characters 1170 desc elements, 98415 characters
Who's up for this?

Volunteers for the following languages have stepped up: Chinese Dutch French German Hindi Italian Japanese Polish Portuguese Romanian Serbian Slovenian Spanish Swedish

The 2007 project

The ALLC has generously provided funding of £10,0000 to oil the wheels and we should now be able to deliver in French, Spanish, German, Chinese and Japanese by this time next year, producing: translated desc and gloss texts a web application to allow review and update of translations a mechanism to allow users to easily take advantage of the work translated element and attribute names

Infrastructure work

Roma has been changed to support the following output schemes: English names, descriptions in English English names, descriptions in chosen language names designed to make sense to a speaker of the chosen language, descriptions in English both names and descriptions in chosen language

Partners Weining Huang, Würzburg; checking by Marcus Bingenheimer Coordinated by Pierre-Yves Duchesmin (ENSSIB) for AFNOR Werner Wegstein, Würzburg; checking by Christian Wittern Ohya Kazushi, Tsurumi University ? University of Alicante Sebastian Rahtz