Ce document constitue une introduction aux Recommandations de la Text Encoding Initiative (TEI). Il décrit un sous-ensemble spécifique du schéma complet TEI. Il peut être utilisé pour encoder une grande variété de caractéristiques textuelles courantes, ce qui permet d’une part de maximiser l’utilisation (et la réutilisation) de transcriptions électroniques et d’autre part de faciliter les échanges entre des chercheurs utilisant des ordinateurs de systèmes différents. Ce schéma est entièrement compatible avec le schéma complet TEI, tel que celui-ci est défini dans TEI P5, Guidelines for Electronic Text Encoding and Interchange, février 2006, documents disponibles sur le site web du Consortium :http://www.tei-c.org

1 Introduction

Les Recommandations de la Text Encoding Initiative (TEI) s’adressent à tous ceux qui souhaitent échanger des informations stockées sous forme électronique. Elles mettent l’accent sur l’échange d’informations textuelles, mais d’autres types de données, tels que les images et les sons, sont également pris en compte. Les Recommandations s’appliquent aussi bien à la création de nouvelles ressources qu’aux échanges de ressources existantes.

Les Recommandations fournissent les moyens de rendre explicites certaines caractéristiques d’un texte, de façon à faciliter le traitement de ce texte par des logiciels fonctionnant sur des plateformes différentes. Cette tâche d’explicitation est appelée balisage ou encodage. Toute représentation textuelle met en jeu une forme ou une autre de balisage ; la TEI a été créée en partie pour résoudre les questions d’incommunicabilité entre ces formats d’encodage, et en partie à cause de l’éventail toujours plus diversifié et toujours plus important des usages scientifiques des textes électroniques.

Les Recommandations de la TEI décrivent un schéma d’encodage, qui peut s’exprimer dans différents langages formels. Dans les premières éditions, les Recommandations utilisaient SGML (Standard Generalized Markup Language) ; depuis 2002, celui-ci a été remplacé par XML (Extensible Markup Language). Ces langages partagent en commun la capacité de pouvoir définir un texte en termes d’éléments, d’attributs, et de règles régissant leur apparence dans un texte. L’utilisation que la TEI fait de XML est ambitieuse dans sa complexité et dans sa généralité. Mais elle n’est pas fondamentalement différente de tous les autres schémas XML proposés. N’importe quel logiciel fondé sur un XML généraliste peut ainsi traiter des textes conformes à la TEI.

Depuis 2001, la TEI est un projet collectif s’appuyant sur un consortium international. Au départ, il s’agissait d’un projet de recherche soutenu par l’Association for Computers and the Humanities, l’Association for Computational Linguistics, et l’Association for Literary and Linguistic Computing. Il a été financé ses cinq premières années par le U.S. National Endowment for the Humanities, le Directorate General XIII of the Commission of the European Communities, la Fondation Andrew W. Mellon, le Social Science and Humanities Research Council of Canada entre autres. Les Recommandations ont été publiées la première fois en mai 1994, après 6 ans de travaux impliquant de nombreux chercheurs du monde entier, issus de différentes disciplines.

Durant les années qui ont suivi, les Recommandations ont eu beaucoup d’influence dans le développement des bibliothèques numériques, dans le domaine des industries de la langue, et même dans le développement du web lui-même. Le Consortium TEI a été mis en place en janvier 2001. Une année plus tard, il publiait des Recommandations entièrement révisées, et désormais fondées sur XML. En 2004, le Consortium procédait à une refonte majeure des Recommandations, pour tirer pleinement parti des nouveaux langages à schéma. Une première version a été publiée en 2005. La révision du document TEI Lite est conforme à la version 2.1. des Recommandations, soit la version TEI P5 la plus récente, qui est sortie en juin 2012.

A l’origine, les objectifs généraux de la TEI ont été définis par la résolution finale de la conférence préparatoire, qui s’est tenue au Vassar College (New-York), en novembre 1987. Ces « Principes de Poughkeepsie » ont été ensuite précisés dans une série de documents, qui établissaient notamment que les Recommandations, devaient :

être suffisamment précises pour représenter les caractéristiques textuelles d’un texte, intéressant des chercheurs ;
être simples, claires et concrètes ;
être utilisables facilement par les chercheurs et ne pas nécessiter l'utilisation de logiciels spécifiques ;
permettre une définition rigoureuse et des traitements efficaces des textes ;
prévoir des extensions définies par l’utilisateur ;
respecter les standards existants ou émergents.

Le monde de la recherche est vaste et divers. Pour que les Recommandations suscitent une large adhésion, il était important de s’assurer que :

le noyau commun de caractéristiques textuelles soit facile à partager ;
des traits spécialisés soient faciles à ajouter (ou à supprimer) ;
des encodages multiples et parallèles d’une même caractéristique soient possibles ;
la richesse du balisage puisse être déterminée par l’utilisateur, et ce, avec un minimum de connaissances ;
une documentation pertinente relative au texte et à son encodage soit fournie.

Le présent document, qui se nomme TEI Lite, décrit un sous-ensemble des éléments et des recommandations de la TEI, qui ont été établis à partir des objectifs et des principes rappelés ci-dessus.

Avec la sélection opérée parmi les centaines d’éléments définis dans le schéma TEI complet, nous nous sommes efforcés d’identifier un sous-ensemble, comprenant des éléments que (presque) tous les utilisateurs devraient connaître.

Ce sous-ensemble doit répondre aux objectifs suivants :

permettre d’appréhender adéquatement une variété raisonnablement grande de textes, avec le degré de précision des pratiques existantes (cf. par exemple, le traitement des collections d’Oxford Text Archive)
être utile à la production de nouveaux documents (tel que celui-ci), mais aussi à l’encodage de textes existants
être utilisable par une grande variété de logiciels XML existants
être dérivé du schéma TEI complet et être défini en se conformant aux méthodes d’adaptation décrites dans les Recommandations
être aussi simple et petit que possible, en respectant les autres buts.

Le lecteur jugera jusqu'à quel point nous avons realisé ces objectifs.

Bien que nous ayons essayé de rendre ce document indépendant, tel que l’est un tutoriel, le lecteur doit être conscient qu’il ne décrit pas en détail la totalité de la TEI. Tous les éléments décrits ici sont entièrement documentés dans les Recommandations. Celles-ci constituent la documentation de référence, qui doit être consultée aussi bien pour les éléments décrits ici que pour tous les autres qui ne sont pas décrits. On suppose que le lecteur a un savoir de base à propos de XML.

2 Un court exemple

Nous commençons par un bref exemple¹ Son but est d’illustrer ce qui se passe quand un passage en prose est saisi sur ordinateur par une personne qui n’est pas au fait du balisage ou du potentiel des textes électroniques. Dans un monde idéal, un tel résultat pourrait être généré par un scanner optique très précis. Il tente de rester fidèle à l’apparence du texte imprimé, en retenant les mêmes fins de lignes que l’original, en insérant des blancs pour représenter la disposition des titres originaux et les fins de page, et ainsi de suite, et en gardant la ponctuation originelle.

Figure 1. Transcription naive d'un texte imprimé

Cette transcription présente quelques défauts :

les numéros de pages et les titres se mêlent au texte de manière telle qu’un logiciel aurait beaucoup de difficultés à les « démêler » ;
avec le maintien des césures, toute procédure de recherche peu élaborée ne pourra pas trouver les mots coupés ;
la division en paragraphes est indiquée seulement par des espaces, et des retours à la ligne ont été insérés à la fin de chaque ligne. Tout changement dans la taille de la police, entrainera de facto des problèmes de mise en page ;
les guillemets ont été retenus, mais ils n’identifient pas précisément les passages au discours direct. Par exemple la mention « John était un vieux serviteur etc. » n'est pas prononcé par John, bien qu'entouré de guillemets.

Nous présentons maintenant le même passage, avec un encodage conforme aux Recommandations. Comme nous allons le voir, cet encodage pourrait être étendu de bien des façons, mais la TEI nous permet, au minimum, de représenter les distinctions suivantes :

la division en paragraphes et en chapitre est maintenant indiquée explicitement ;
les débuts de page ont été balisées par l’élément vide pb ;
les lignes du document original n’ont pas été retenues et les césures ont été supprimées, sans qu’il y ait de commentaire ;
pour faciliter la correction, une nouvelle ligne a été insérée au début de chaque paragraphe, et l’alinéa a été supprimé ;
les mots faisant partie d'un discours direct sont balisés explicitement ;
l'usage d'une police différente pour le titre du chapitre est indiqué; par contre, les titres courants ont été supprimés.

<pb n="239"/>
<div n="XXXVIII" type="chapitre">
<head rend="petitMajuscules">conclusion.</head>
<p>J’ai enfin épousé M. Rochester. Notre mariage se fit sans bruit; lui, moi, le
   ministre et le clerc, étions seuls présents. Quand nous revînmes de l’église, j’entrai
   dans la cuisine, où Marie préparait le dîner, tandis que John nettoyait les couteaux. </p>
<p> «<q>Marie</q>, dis-je, <q>j’ai été mariée ce matin à M. Rochester.</q> » </p>
<p>La femme de charge et son mari appartenaient à cette classe de gens discrets et
   réservés auxquels on peut toujours communiquer une nouvelle importante sans crainte
   d’avoir les oreilles percées par des exclamations aiguës, ni d’avoir à supporter un
   torrent de surprises. Marie leva les yeux et me regarda. Pendant quelques minutes elle
   tint suspendue en l’air la cuiller dont elle se servait pour arroser deux poulets qui
   cuisaient devant le feu, et John cessa de polir ses couteaux. Enfin Marie, se penchant
   vers son rôti, me dit simplement : </p>
<p>« <q>En vérité, mademoiselle ? Eh bien, tant mieux, certainement.</q> » </p>
<p>Au bout de quelque temps elle ajouta : « <q>Je vous ai bien vue sortir avec mon
     maître ; mais je ne savais pas que vous alliez à l’église pour vous marier.</q> » </p>
<p>Et elle continua d’arroser son rôti. </p>
<p>Quand je me tournai vers John, je vis qu’il ouvrait la bouche si grande qu’elle
   menaçait d’aller rejoindre ses oreilles. </p>
<p> « <q>J’avais bien averti Marie que cela arriverait,</q> dit-il. <q>Je savais que M.
     Édouard</q> (John était un vieux serviteur et avait connu son maître alors qu’il
   était encore cadet de famille ; c’est pourquoi il l’appelait souvent par son nom de
   baptême), <q>je savais que M. Édouard le ferait, et j’étais persuadé qu’il
     n’attendrait pas longtemps ; je suis sûr qu’il a bien fait.</q> » </p>
<p>En disant ces mots, John tira poliment ses cheveux de devant. </p>
<p> « <q>Merci, John,</q> répondis-je. <q>Tenez, M. Rochester m’a dit de vous donner
     ceci, à vous et à Marie.</q> » Et je lui remis un billet de cinq livres. </p>
<p> Sans plus attendre je quittai la cuisine. Quelque temps après, en repassant devant
   la porte, j’entendis les mots suivants : </p>
<p> « <q>Elle lui conviendra mieux qu’une grande dame.</q> » Puis : « <q>Il <pb n="240"/> y en a de plus jolies, mais elle est bonne et n’a pas de défauts. Du reste, il
     est facile de voir qu’elle lui semble bien belle.</q> » </p>
<p> J’écrivis immédiatement à Moor-House, pour annoncer ce que j’avais fait. Je donnai
   toutes les explications nécessaires dans ma lettre. Diana et Marie m’approuvèrent
   entièrement. Diana m’annonça qu’elle viendrait me voir après la lune de miel. </p>
<p> « <q>Elle ferait mieux de ne pas attendre jusque-là, Jane,</q> me dit M. Rochester,
   lorsque je lui lus la lettre ; <q>car la lune de miel brillera sur toute notre vie, et
     ses rayons ne s’éteindront que sur votre tombe ou sur la mienne.</q> » </p>
<p>...</p>
</div>

L’encodage ci-dessus a été construit sur un ensemble de choix et de priorités : nous avons ici considéré que le texte de Brontë en tant que tel était plus important que son édition. Nous avons ainsi supprimé le tiret de césure, sans pour autant l’avoir marqué (cf. par exemple « de de-vant » dans l’édition, « de devant » dans l’encodage). Encoder c’est donc toujours faire des choix. Un encodage explicite seulement les caractéristiques textuelles qui importent à l’encodeur. Et il est facile d’imaginer différentes manières d’étendre l’encodage d’un passage même aussi court. Par exemple :

des gloses ou des commentaires en notes de bas de page pourraient être ajoutés ;
des liens associant des parties de ce texte à d’autres pourraient être ajoutés ;
on pourrait faire ressortir du contexte des noms propres de différentes sortes ;
des données bibliographiques détaillées sur la provenance du texte et son contexte pourraient être ajoutées dans l’en-tête ;
une analyse linguistique en phrases, en propositions, en mots, etc. pourrait être fournie, où chaque unité serait pourvue de sa catégorie ;
le texte pourrait être segmenté en unités narratives ou en unités discursives ;
on pourrait annoter chaque passage de discours pour comparer le discours des personnages de différentes classes ou de différents sexes ;
on pourrait inclure dans l’encodage l’analyse systématique ou l’interprétation du texte, et ce avec des alignements et des associations potentiellement complexes entre le texte et l’analyse, ou entre le texte et une ou plusieurs de ses traductions ;
des passages pourraient être liés à des images ou à du son, stockés sur d'autres supports.

L’encodage de la plupart de ces extensions est décrit dans la suite du document. Le schéma TEI dans sa totalité fournit également un large éventail d’autres possibilités. Nous en citerons uniquement quelques-unes :

analyse détaillée des constituants d’un nom ;
méta-informations détaillées sur les origines du texte et sur ses thèmes ;
informations sur l’histoire de son impression, ou sur ses variantes manuscrites, telles qu’on pourrait les observer dans les différentes versions du même texte.

Pour savoir comment procéder dans ces différents cas, mais aussi pour connaître beaucoup d’autres possibilités, on se reportera aux Recommandations.

3 La structure d’un texte TEI

Tous les textes qui se conforment à la TEI comprennent : (a) un en-tête TEI (balisé avec l’élément teiHeader) (b) la transcription du texte (balisé avec l’élément text). Ces deux éléments sont associés pour former un seul élément TEI, qui doit être déclaré dans l’espace de noms TEI².

L’en-tête TEI fournit des informations analogues à celles de la page de titre d’un texte imprimé. Il comporte au plus 4 parties :

une description bibliographique
une description portant sur la manière dont il a été encodé (traîtement des césures, guillemets, etc.)
une description « non-bibliographique » du texte (un text profile)
l’historique des modifications et révisions

L’en-tête est décrit de manière détaillée dans la section 19 La page titre électronique.

Un document TEI peut être unitaire (une œuvre) ou composite (un recueil d’œuvres, comme une anthologie). Dans les deux cas, le texte peut comporter des Parties liminaires ou des Annexes. Entre les deux, se trouve le corps du texte. Dans un texte composite, le corps peut rassembler des groupes, chaque groupe contenant d’autres groupes ou des textes.

Un texte unitaire sera encodé en utilisant une structure comme celle-ci :

Un texte composite peut également comporter des Parties liminaires ou des Annexes. Entre les deux, on aura un ou plusieurs groupes de textes, chaque groupe pouvant avoir ses propres Parties liminaires et ses Annexes. Un texte composite sera alors encodé en utilisant une structure comme celle-ci :

Il est aussi possible de définir un texte composite rassemblant des textes TEI complets, chacun d’entre eux étant pourvu de son propre en-tête. Un tel recueil est appelé TEI Corpus, et peut avoir son propre en-tête:

Il est également possible de créer un ensemble de corpus - un élément teiCorpus peut contenir d’autres éléments teiCorpus plutôt que des éléments TEI, et ce quelle que soit la profondeur.

Dans la suite du document, nous discuterons surtout des structures textuelles simples. Dans chaque cas, nous présenterons a) les éléments TEI pertinents, avec pour chacun une brève définition ; b) les attributs spécifiques de chaque élément, et leur définition ; c) la référence à la classe dont l’élément est un membre. Ces références sont associées aux spécifications complètes de chaque objet, comme cela est présenté dans les Recommandations. La plupart des cas sont illustrés par de brefs exemples.

Par exemple, voici les éléments dont on a discuté jusqu’à maintenant :

TEI (TEI document) contains a single TEI-conformant document, comprising a TEI header and a text, either in isolation or as part of a teiCorpus element.
teiHeader (TEI header) supplies the descriptive and declarative information making up an electronic title page prefixed to every TEI-conformant text.
text contains a single text of any kind, whether unitary or composite, for example a poem or drama, a collection of essays, a novel, a dictionary, or a corpus sample.
teiCorpus contains the whole of a TEI encoded corpus, comprising a single corpus header and one or more TEI elements, each containing a single text header and a text.

4 L’encodage du corps du texte

Comme nous l’avons indiqué, un document TEI simple rassemble les éléments suivants :

front (front matter) contains any prefatory matter (headers, title page, prefaces, dedications, etc.) found at the start of a document, before the main body.
group contains the body of a composite text, grouping together a sequence of distinct texts (or groups of such texts) which are regarded as a unit for some purpose, for example the collected works of an author, a sequence of prose essays, etc.
body (text body) contains the whole body of a single unitary text, excluding any front or back matter.
back (back matter) contains any appendixes, etc. following the main part of a text.

Les éléments traitant spécifiquement des Parties liminaires et des Annexes sont décrits ci-dessous dans la section 18 Les Parties liminaires et les Annexes. Dans cette section, nous discutons des éléments constituant le corps d’un texte.

4.1 Les éléments marquant la division du texte

Le corps d’un texte en prose peut être une simple suite de paragraphes ; ou bien, ces paragraphes peuvent être regroupés en chapitres, sections, sous-sections, etc. Chaque paragraphe est balisé par la balise p. L’élément div indique un regroupement de paragraphes.

On utilise l’attribut type de l’élément div pour indiquer la dénomination usuelle de la division, ou sinon pour distinguer plusieurs divisions. Typiquement, ses valeurs sont « livre », « chapitre », « section », « partie », « poème », « chanson », etc. Dans un projet particulier, il est conseillé de définir la liste des valeurs possibles et de s’y tenir.

Un élément div peut contenir des div enchâssés, reflétant ainsi la structure traditionnelle d’un livre : il peut être décomposé hiérarchiquement en parties, chapitres, sections, etc. Les textes TEI se conforme généralement à un modèle hiérarchique simple.

On utilise l’attribut xml:id pour indiquer l’identifiant unique de la division. Ce qui est utile pour faire des références croisées ou simplement faire référence à une division donnée, via un commentaire par exemple. Voir aussi ci-dessous 8 Les références croisées et les liens Il est souvent utile d’associer l’attribut xml:id à chaque unité structurelle importante du texte, et d’établir ses valeurs de manière systématique. Par exemple en ajoutant un numéro de section à un code court pour faire référence au titre de l’œuvre. C’est aussi particulièrement utile si la ressource a vocation à être disponible sur le web : les autres applications web pourront facilement pointer directement les parties du texte.

On utilise l’attribut n pour attribuer (de manière additionnelle ou de manière alternative) un nom mnémotechnique ou un nombre à une division, ou bien à n’importe quel autre élément.

On utilise l’attribut xml:lang pour spécifier la langue d’une division. Les langues sont identifiées par un code défini au niveau international. Voir ci-dessous.

On utilise l’attribut rend pour spécifier l’apparence d’une division, ou de n’importe quel autre élément. Voir ci-dessous. Tout comme avec l’attribut type, il est souvent utile de prédéfinir les valeurs possibles de cet attribut. La TEI Lite ne contraint pas l’expression des valeurs de rend.

Ces 4 attributs xml:id, n, xml:lang et rend sont si largement utilisés qu’ils s’appliquent à n’importe quel élément du schéma TEI : ce sont des attributs globaux. D’autres attributs globaux définis dans la TEI Lite sont discutés dans la section 8.3 Les types spéciaux de liens.

La valeur de chaque attribut xml:id doit être unique dans un même document. Une manière simple de s’en assurer est de vérifier qu’elle reflète la structure hiérarchique du document. Par exemple, prenons le roman Les Misérables de Victor Hugo, qui est divisé en cinq tomes, dont chacun contient plusieurs ‘livres’, chaque livre comprenant des chapitres. L’attribut xml:id pourrait avoir la structure suivante :

Un système de numérotation différent peut être utilisé pour les attributs xml:id et n, ce qui est souvent utile lorsqu’un schéma de référence ne concorde pas avec la structure de l’œuvre. Par exemple, prenons un roman en plusieurs tomes, chaque tome comprend des chapitres numérotés en fonction de l’ensemble des tomes (et non à l’intérieur de chaque tome). On pourrait utiliser le schéma suivant :

Ici, l’œuvre comprend toujours deux volumes, chaque volume contenant deux chapitres. Les chapitres ont été numérotés de 1 à 4, mais les valeurs de xml:id, qui ont été spécifiées, permettent en outre de les considérer comme s’ils avaient été numérotés 1.1, 1.2, 2.1, 2.2.

4.2 Les têtes et fins de chapitre

Chaque div peut commencer par un titre ou une tête de chapitre, et se terminer (moins couramment) par une formule de fermeture, telle que « Fin du chapitre 1 ». On utilise alors les éléments suivants :

head (heading) contains any type of heading, for example the title of a section, or the heading of a list, glossary, manuscript description, etc.
trailer contains a closing title or footer appearing at the end of a division of a text.

D’autres éléments qui peuvent être nécessaires au début ou à la fin d’une division sont discutés dans la section 18.1.2 Différents types de liminaires.

Que les têtes de chapitre et fins de chapitre soient inclus ou non dans la transcription relève du choix de l’encodeur. Quand une tête de chapitre est complètement régulière (par exemple « Chapitre 1 ») ou peut être automatiquement générée à partir de la valeur des attributs (par exemple <div type="chapter" n="1">), on peut l’omettre. Mais quand elle contient du texte qu’on ne peut retrouver autrement, on doit toujours ajouter le texte. Par exemple, le début de Notre-Dame de Paris de Victor Hugo pourrait être encodé comme suit:

<div xml:id="NDP01" n="I" type="livre">
<head>Livre premier</head>
<div n="I.1 " type="chapitre">
  <head>La Grand'salle</head>
  <p>Il y a aujourd'hui trois cent quarante-huit ans six mois et dix-neuf jours que
     les parisiens s'éveillèrent au bruit de toutes les cloches sonnant à grande
     volée dans la triple enceinte de la Cité, de l'Université et de la Ville. </p>

</div>
</div>

4.3 La prose, les vers et le théâtre

Comme dans l’exemple de Brontë ci-dessus, les paragraphes formant une division textuelle sont balisés avec p. Les textes poétiques ou le théâtre demandent d’autres balises, pour représenter par exemple les vers, les strophes dans le premier cas, le discours des individus et les indications scéniques dans le second.

l (verse line) contains a single, possibly incomplete, line of verse.
lg (line group) contains one or more verse lines functioning as a formal unit, e.g. a stanza, refrain, verse paragraph, etc.
sp (speech) contains an individual speech in a performance text, or a passage presented as such in a prose or verse text.
speaker contains a specialized form of heading or label, giving the name of one or more speakers in a dramatic text or fragment.
stage (stage direction) contains any kind of stage direction within a dramatic text or fragment.

Par exemple, voici un poème de Baudelaire où les vers et les strophes ont été balisés :

<text>
<body>
  <lg type="quatrain">
   <l>Les amoureux fervents et les savants austères</l>
   <l>Aiment également, dans leur mûre saison,</l>
   <l>Les chats puissants et doux, orgueil de la maison,</l>
   <l>Qui comme eux sont frileux et comme eux sédentaires.</l>
  </lg>
  <lg type="quatrain">
   <l>Amis de la science et de la volupté</l>
   <l>Ils cherchent le silence et l'horreur des ténèbres ;</l>
   <l>L'Erèbe les eût pris pour ses coursiers funèbres,</l>
   <l>S'ils pouvaient au servage incliner leur fierté.</l>
  </lg>
  <lg type="tercet">
   <l>Ils prennent en songeant les nobles attitudes</l>
   <l>Des grands sphinx allongés au fond des solitudes,</l>
   <l>Qui semblent s'endormir dans un rêve sans fin ;</l>
  </lg>
  <lg type="tercet">
   <l>Leurs reins féconds sont pleins d'étincelles magiques,</l>
   <l>Et des parcelles d'or, ainsi qu'un sable fin,</l>
   <l>Etoilent vaguement leurs prunelles mystiques.</l>
  </lg>
</body>
</text>

Notons que l’élément l indique la ligne du vers et non la ligne typographique. La disposition en lignes des premiers vers, ci-dessus, n’a pas été explicitement encodée, et risque donc d'être perdue. On peut utiliser l’élément lb, décrit dans la section 5 Les numéros de pages et les numéros de ligne, si l’on souhaite aussi marquer le début d’une ligne typographique.

Voici la fin d’une pièce de théâtre de Samuel Beckett, dans laquelle les répliques et les indications scéniques sont encodées :

<sp>
<speaker>Vladimir</speaker>
<p>Relève ton pantalon.</p>
</sp>
<sp>
<speaker>Estragon</speaker>
<p>Que j'enlève mon pantalon ?</p>
</sp>
<sp>
<speaker>Vladimir</speaker>
<p>
<emph>RE</emph>lève ton pantalon.</p>
</sp>
<sp>
<speaker>Vladimir</speaker>
<p> C'est vrai. </p>
<stage>Il relève son pantalon. Silence.</stage>
</sp>
<sp>
<speaker>Vladimir</speaker>
<p>Alors, on y va ?</p>
</sp>
<sp>
<speaker>Estragon</speaker>
<p>Allons-y.</p>
</sp>
<stage>Ils ne bougent pas.</stage>

Notons que l’élément stage (‘indication scénique’) peut apparaître ou bien dans la réplique ou bien entre deux répliques. L’élément sp (speech ou énoncé) contient, après l’élément optionnel speaker, qui indique qui parle, ou bien des paragraphes (si la réplique est en prose) ou bien des vers ou des strophes (comme dans l’exemple suivant). Dans ce cas, il est assez courant de trouver des vers coupés. La façon la plus simple d’encoder ceci est d’utiliser l’attribut part pour indiquer qu’une ligne (ou un ensemble de lignes) est incomplète de point de vue métrique comme par exemple dans ce passage de Molière :

<sp>
<speaker>Alceste.</speaker>
<l>Non : j' ai résolu de n' en pas faire un pas.</l>
<l part="I">J' ai tort, ou j' ai raison.</l>
</sp>
<sp>
<speaker>Philinte.</speaker>
<l part="F">Ne vous y fiez pas.</l>
</sp>
<sp>
<speaker>Alceste.</speaker>
<l part="I">Je ne remuerai point.</l>
</sp>
<sp>
<speaker>Philinte.</speaker>
<l part="F">Votre partie est forte,</l>
<l part="I">et peut, par sa cabale, entraîner...</l>
</sp>
<sp>
<speaker>Alceste.</speaker>
<l part="F">Il n' importe.</l>
</sp>
<sp>
<speaker>Philinte.</speaker>
<l part="I">Vous vous tromperez.</l>
</sp>
<sp>
<speaker>Alceste.</speaker>
<l part="F">Soit. J' en veux voir le succès.</l>
</sp>

Le même mécanisme peut être appliqué à des strophes qui sont divisées entre deux locuteurs, par exemple³ :

<div>
<sp>
  <speaker>PREMIERE VOIX</speaker>
  <lg type="stanza" part="I">
   <l>« Mais qu’est-ce qui fait filer ce bateau</l>
   <l>Sans vague, sans un souffle d’air ? »</l>
  </lg>
</sp>
<sp>
  <speaker>SECONDE VOIX</speaker>
  <lg part="F">
   <l>« L’air se fend devant lui à son approche,</l>
   <l>Et puis se referme derrière.</l>
  </lg>
</sp>

</div>

On peut également utiliser l’élément sp pour le dialogue en prose, comme s’il s’agissait de théâtre. Voir l’exemple⁴ ci-dessous, qui fait usage de l’attribut who. Cet attribut porte un code permettant d’identifier le locuteur dans le dialogue.

<p> Peu à peu les voix du salon
s’affaiblirent ; la comtesse resta seule avec une de ses amies, mademoiselle Fanny
B***. Toutes deux se trouvèrent bientôt dans la chambre et devant mes yeux.</p>
<sp who="#FAN">
<speaker>FANNY.</speaker>
<p>Quel fâcheux contretemps ! La pluie tombe à torrents, et pas une voiture
   !</p>
</sp>
<sp who="#GAM">
<speaker>GAMIANI.</speaker>
<p>Je suis désolée comme vous ; par malencontre, ma voiture est chez le
   sellier.</p>
</sp>
<sp who="#FAN">
<speaker>FANNY.</speaker>
<p>Ma mère sera inquiète.</p>
</sp>
<sp who="#GAM">
<speaker>GAMIANI.</speaker>
<p>Soyez sans crainte, ma chère Fanny, votre mère est prévenue ; elle sait que vous
   passez la nuit chez moi. Je vous donne l’hospitalité.</p>
</sp>

Dans ce cas, les valeurs de l'attribut who (#GAM par exemple) pointent sur une liste des personnages du roman, chacun ayant un identifiant unique correspondant:

<list>
<head>Caractères du roman</head>
<item xml:id="FAN">
<name>Fanny</name> B. : jeune victime fictive </item>
<item xml:id="GAM">La comtesse <name>Gamiani</name> : héroine du roman, prétendument
une représentation de l'écrivain Amantine Dudevant ("George Sand")</item>
</list>

5 Les numéros de pages et les numéros de ligne

Les coupures de pages, de lignes, etc. peuvent être balisées par les éléments suivants :

pb/ (page break) marks the start of a new page in a paginated document.
lb/ (line break) marks the start of a new (typographic) line in some edition or version of a text.
milestone/ marks a boundary point separating any kind of section of a text, typically but not necessarily indicating a point at which some part of a standard reference system changes, where the change is not represented by a structural element.

Ces éléments indiquent un endroit précis dans le texte et non un empan. On utilise l’attribut global n pour enregistrer le numéro de la page ou de la ligne commençant à la balise.

Pour l'encodage des sources paginées, comme des imprimés ou des manuscrits, il peut être utile d’enregistrer les sauts de page ou de feuillet au point d'apparition dans le flux du texte, afin notamment de lier le texte aux images de pages (fac-similés), de générer automatiquement une référence bibliographique à la page, etc. Enregistrer les sauts de ligne peut aussi être utile à fin de référence, par exemple pour une édition critique avec des notes qui se réfèrent à des numéros de ligne.

On peut enregistrer plusieurs paginations, par exemple celle d'un manuscrit original et celle d'une édition imprimée de référence, en distinguant les sauts de pages grâce à l'attribut ed. Par exemple, dans le passage suivant, nous indiquons où se trouvent les débuts de page dans deux éditions différentes (ED1 et ED2). Nous indiquons également qu'un fac-similé de la page 240 dans l'édition ED1 est disponible dans le fichier p240.png :

<p>
<q>Il <pb n="240" ed="ED1" facs="p240.png"/> y en a de plus jolies, mais elle est
bonne et n’a pas de défauts. Du reste, il est facile de voir qu’elle lui semble bien
belle.</q> »
</p>
<p> J’écrivis immédiatement à Moor-House, <pb n="240" ed="ED2"/> pour annoncer ce que
j’avais fait. Je donnai toutes les explications nécessaires dans ma...</p>

Les éléments pb et lb appartiennent à la classe générale des éléments milestone, qui permet d’indiquer des points de référence dans un texte. L’élément générique milestone sert à indiquer n’importe quel point de référence : par exemple, le début d'une colonne, le début d’un nouveau type de section, qui ne serait pas balisé autrement, et plus généralement, n’importe quel changement significatif dans le texte qui ne serait pas balisé par un élément XML. Pour ces types d’unités ou bien pour les éditions, on utilise les attributs unit et ed. Leurs valeurs sont librement choisies, mais doivent être documentées dans l’élément refsDecl de l’en-tête. Ces attributs ne doivent pas être mélangés de manière arbitraire : soit on utilise l’élément milestone à la place des autres attributs, soit on utilise ces attributs.

6 Le marquage des expressions mises en valeur

6.1 Les changements de police de caractères, etc.

Les mots ou les expressions mis en valeur sont ceux qui sont visuellement différents du reste du texte ; il s’agit typiquement d’un changement de police, de style d’écriture, de couleur, etc., dont le but est d’attirer l’attention du lecteur. On peut se servir de la balise suivante pour signaler de tels changements:

hi (highlighted) marks a word or phrase as graphically distinct from the surrounding text, for reasons concerning which no claim is made.

L’attribut global rend peut être associé à n’importe quel autre élément, et utilisé chaque fois que nécessaire pour spécifier la mise en valeur. Par exemple, un titre en gras peut être balisé par <head rend="bold"> et un titre en italiques par <head rend="italic">.

Les valeurs de l’attribut rend ne sont pas spécifiées dans les Recommandations, car elles dépendent entièrement des besoins du projet. Typiquement, ses valeurs sont « italiques », « gras », « exposant », etc. pour le style de la police ; « centré », « aligné à droite » pour l’alignement ; « grand », « petit » pour la taille ; « petitesMajuscules », « majuscules » pour les caractères, etc. Plusieurs de ces valeurs peuvent être combinées si cela est nécessaire, et il n’y pas de syntaxe particulière pour exprimer ces combinaisons. Les Recommandations proposent aussi des mécanismes d’encodage plus rigoureux que celui offert par rend. Ils reposent sur les standards du W3C (comme les CSS).

Ce n’est pas toujours possible ou souhaitable d’interpréter les raisons qui ont présidé à telle ou telle mise en valeur. On peut alors utiliser l’élément hi pour marquer la séquence mise en valeur, sans pour autant faire d’interprétation quant à son statut.

Dans l’exemple suivant, le fait d’utiliser une police de caractères différente pour le nom présent dans le sous-titre et pour le sous-titre lui-même, est enregistré mais non interprété.

<p>
<hi rend="gothic">Item; il est attesté par le présent acte</hi>, que ledit <hi rend="italic">Walter Shandy</hi>, négociant, en
considération dudit futur mariage
...
</p>

De manière alternative, quand on sait interpréter la mise en valeur, un certain nombre d’autres éléments spécifiques sont disponibles :

emph (emphasized) marks words or phrases which are stressed or emphasized for linguistic or rhetorical effect.
foreign (foreign) identifies a word or phrase as belonging to some language other than that of the surrounding text.
gloss identifies a phrase or word used to provide a gloss or definition for some other word or phrase.
label contains any label or heading used to identify part of a text, typically but not exclusively in a list or glossary.
term contains a single-word, multi-word, or symbolic designation which is regarded as a technical term.
title contains a title for any kind of work.

Pour illustrer les éléments que nous avons définis, arrêtons-nous sur la phrase suivante⁵ : ‘Dans un récent numéro du Bulletin des bibliothèques de France, des collègues affirmaient avec enthousiasme, et non sans bravoure, que les bibliothèques n’avaient pas, loin s’en faut, « disparu de la cité », mais qu’elles se trouvaient au contraire investies d’une « nouvelle pertinence ».’

Si l’on décide d’interpréter la mise en valeur, la phrase pourrait être encodée de la manière suivante :

<p>Dans un récent
numéro du <title>Bulletin des bibliothèques de France</title>, des collègues
affirmaient avec enthousiasme, et non sans bravoure, que les bibliothèques n’avaient
pas, loin s’en faut, « <mentioned>disparu de la cité</mentioned> », mais qu’elles se
trouvaient au contraire investies d’une « <mentioned>nouvelle pertinence</mentioned>
»</p>

Si on est intéressé uniquement par la mise en forme de l'original, on pourrait l'encoder ainsi :

Dans un récent numéro du <hi rend="it">Bulletin des bibliothèques de France</hi>, des
collègues affirmaient avec enthousiasme, et non sans bravoure, que les bibliothèques
n’avaient pas, loin s’en faut, « <hi rend="it">disparu de la cité</hi> », mais
qu’elles se trouvaient au contraire investies d’une « <hi rend="it">nouvelle
pertinence</hi> ».

6.2 Les citations et les éléments associés

Tout comme les changements de police de caractères, les guillemets sont utilisés de différentes manières dans les textes, le cas le plus le plus fréquent étant celui de la citation. Quand c’est possible, il importe de ne pas s’en tenir à noter la présence de guillemets, mais plutôt d’étiqueter leur fonction. On utilise alors les éléments suivants :

q (quoted) contains material which is distinguished from the surrounding text using quotation marks or a similar method, for any one of a variety of reasons including, but not limited to: direct speech or thought, technical terms or jargon, authorial distance, quotations from elsewhere, and passages that are mentioned but not used.
mentioned marks words or phrases mentioned, not used.
soCalled contains a word or phrase for which the author or narrator indicates a disclaiming of responsibility, for example by the use of scare quotes or italics.
gloss identifies a phrase or word used to provide a gloss or definition for some other word or phrase.

Voici un exemple simple d'une citation, extrait des Confessions de Jean-Jacques Rousseau :

<p>Enfin je me rappelai le pis-aller d’une grande princesse à qui l’on disait que les
paysans n’avaient pas de pain, et qui répondit : <q>Qu’ils mangent de la
brioche.</q>
</p>

Pour indiquer comment la citation a été imprimée (par exemple, sur la même ligne, ou un pavé), on utilise l’attribut rend, qui permet également de spécifier le type de guillemets utilisé.

Le discours direct interrompu par le narrateur peut être simplement représenté en finissant la citation juste avant l’interruption et en la recommençant juste après. Par exemple dans La Quête du Graal (ms K, folio 169, § 55) on lit :

Et quant Melyan voit ces letres si dist a
Galaad : <q>Frans chevaliers por Dieu lessiez moi entrer en cele a senestre, car en
cele porrai je esprover ma force, et connoistre s’il avra ja en moi proesce ne
hardement por quoi je doie avoir los de chevalerie.</q>
<q>- S’il vos pleust</q>, fait Galaad,
<q>je m’en entrasse en cele a senestre, car si
com je pens je m’en getasse mielz que vos.</q>

S’il est important de signifier que les deux éléments q forment ensemble un seul bloc, on peut utiliser les attributs next et prev. Cf. section 8.3 Les types spéciaux de liens.

On peut associer aux citations des références (par exemple la source, le nom du personnage, etc.), en utilisant l’attribut who, que ces informations soient ou non explicitées dans le texte. Par exemple :

<q who="#Wilson"> Spaulding arriva au
bureau, je me rappelle que ce fut précisément il y a
aujourd’hui huit
jours, avec ce même journal à la main et s’écria : <q who="#Spaulding">— Quel
malheur, monsieur Wilson ! que je ne sois pas roux.</q>
</q>

Cet exemple⁶ montre aussi comment les citations peuvent être imbriquées dans d’autres citations : un personnage (Wilson) cite un autre personnage (Spaulding). Le créateur du texte électronique doit décider si les guillemets seront remplacés par des balises, ou si des balises seront ajoutées et les guillemets conservés. Si les guillemets sont supprimés, on peut utiliser l’attribut rend pour décrire les guillemets de la copie papier.

Les Recommandations proposent d’autres éléments permettant de distinguer le discours direct, la citation, et d’autres usages typiques des guillemets, bien qu’il ne soit pas toujours possible ou souhaitable d’interpréter la fonction des guillemets dans un texte. Pour des raisons de simplicité, seul q (qui peut être utilisé dans n’importe quel cas) fait partie de la TEI Lite.

Pour associer une citation bibliographique avec sa source par contre, on peut utiliser l'élément bibl, et emballer les deux dans l'élément cit (voir exemple 11 Les listes).

6.3 Les mots étrangers ou les expressions

Les mots et les expressions qui ne sont pas dans la langue principale du texte peuvent être balisés de deux façons : i) si le mot ou l’expression est déjà balisé pour une raison quelconque par un élément, on associe à ce dernier l’attribut global xml:lang, dont on spécifie la valeur (i.e. de quelle langue il s’agit) ; ii) sinon on utilise l’élément foreign, toujours avec l’attribut xml:lang. Par exemple :

<foreign xml:lang="la">Et vobis</foreign> messieurs,
Ce ne seroyt que bon que nous rendissiez noz cloches...

<p>Avez-vous lu <title xml:lang="de">Die Dreigroschenoper</title>?</p>
<p>On parle des <mentioned xml:lang="en">people</mentioned> dans ce journal</p>
<p>Le <term xml:lang="la">quaternio terminorum</term>, expression venant du latin,
<gloss>l'erreur des quatre termes, </gloss>est un sophisme qui intervient
lorsqu'un syllogisme utilise quatre termes au lieu de trois.</p>

Comme ces exemples le montrent, l’élément foreign ne doit pas être utilisé pour baliser des mots étrangers si un autre élément plus spécifique, tel que title, mentioned, ou term s’applique. L’attribut global xml:lang peut être associé à n’importe quel élément pour exprimer que son contenu n’est pas dans la même langue que celle qui l’entoure.

Les codes utilisés pour identifier les langues, indiquées dans l’attribut xml:lang, doivent se conformer aux standards internet⁷, tels que cela est expliqué dans les Recommandations. Voici quelques exemples de codes :

zh	chinois	grc	grec ancien
en	anglais	el	grec
enm	anglais du moyen age	ja	japonais
fr	francais	la	latin
de	allemand	sa	sanskrit

7 Les notes

Toutes les notes, qu’elles soient imprimées en bas de page, en fin de chapitre ou d’ouvrage, en marge, ou sous n’importe quelle autre forme doivent être balisées avec le même élément :

note contains a note or annotation.

Ceci n’est pas toujours possible, notamment avec les notes en marge, qui peuvent ne pas être associées à un appel de note précis. Pour simplifier, on pourra positionner la note en marge avant le paragraphe ou tout autre élément pertinent. Les notes peuvent également être placées dans une partie séparée du texte (comme le sont les notes de fin de chapitre ou d’ouvrage dans les textes imprimés) et liées à la partie adéquate du texte. On utilise alors l’attribut target.

Si besoin, on utilise l’attribut n pour spécifier le numéro ou l’identifiant de la note. L’attribut resp doit être utilisé de manière systématique pour distinguer les notes de l’auteur et les notes de l’éditeur, si l’œuvre comporte ces deux sortes de note.

Un exemple⁸ :

<p>J'écris dans la<lb/> marge...<lb/> Je vais<lb/> à la ligne.<lb/> Je renvoie à une
note<note type="gloss" place="foot"> J'aime beaucoup les renvois en bas de page,
même si je n'ai rien de particulier à y préciser.</note>en bas de page.</p>

Un autre exemple⁹ :

<lg>
<note place="marge"> Le sortilège commence à se rompre.</note>
<l>Au même instant je pus enfin prier ;</l>
<l>Et de mon cou se libérant,</l>
<l>L’Albatros vint à tomber, et coula</l>
<l>Comme du plomb dans l’océan.</l>
</lg>

8 Les références croisées et les liens

Des références croisées explicites et des liens d’un endroit du texte à un autre, que ce soit dans le même document ou entre deux documents différents, peuvent être encodés en utilisant les éléments décrits dans cette section. Les liens implicites (tels que l’association de deux textes parallèles, ou bien l’association d’un texte et de son interprétation) peuvent être encodés en utilisant les attributs présentés dans la section 8.3 Les types spéciaux de liens.

8.1 Les références croisées simples

Si l’on veut relier deux endroits d’un même texte, on utilise l’un des éléments suivants :

ref (reference) defines a reference to another location, possibly modified by additional text or comment.
ptr/ (pointer) defines a pointer to another location.

L’élément vide ptr marque l’endroit à partir duquel on crée le lien, tandis que ref contient du texte. Le plus souvent il s’agira du texte de la référence croisée elle-même. L’élément ptr doit être utilisé pour une référence croisée signalée par des moyens non verbaux, tels qu’un symbole ou une icône, ou encore un bouton dans un texte électronique. Il est également utile dans les systèmes de production de document, si le système de formatage est capable de générer correctement l’expression de la référence croisée.

Les deux formes suivantes sont logiquement équivalentes :

Cf. tout particulièrement <ref target="#SEC12">la section 12, page 34</ref>.

Cf. tout particulièrement <ptr target="#SEC12"/>.

La valeur de l’attribut target de l’un ou l’autre de ces éléments peut être l’identifiant d’un autre élément du texte courant. Le passage ou l’expression que l’on cible doivent porter un identifiant, et donc être balisés comme un élément d’un certain type. Dans l’exemple suivant, la référence croisée pointe sur un élément div :

... Cf. tout
particulièrement <ptr target="#SEC12"/>. ...
<div xml:id="SEC12">
<head>Au sujet des identifiants</head>

</div>

Comme xml:id est un attribut global, on peut pointer de cette manière sur n’importe quel élément d’un texte TEI. Dans l’exemple suivant, on a associé un identifiant à un paragraphe pour qu’on puisse pointer sur lui.

... voir discussion de ce point au <ref target="#pspec">paragraphe qui traite des liens</ref> ...
<p xml:id="pspec">Les
liens peuvent s'effectuer vers tout type d'élément ...</p>

Parfois, la cible d’une référence croisée ne correspond à aucune caractéristique particulière du texte, et de ce fait peut ne pas être balisée comme un élément d’un certain type. Si la cible que l’on souhaite indiquer correspond simplement à un endroit du document courant, la manière la plus simple de procéder est d’utiliser l’élément anchor, à l’endroit ciblé. Si la cible est une certaine suite de mots, qui n’est pas déjà balisée, l’élément seg peut être utilisé. Ces deux éléments se laissent décrire de la manière suivante :

anchor/ (anchor point) attaches an identifier to a point within a text, whether or not it corresponds with a textual element.
seg (arbitrary segment) represents any segmentation of text below the ‘chunk’ level.

Dans l’exemple (inventé) suivant, les éléments ref sont utilisés pour représenter les endroits du texte qui doivent être associés à d’autres parties du texte ; dans le premier cas, à un endroit particulier, et dans le second à une suite de mots :

Retournant à <ref target="#ABCD">la phrase où je me suis endormi</ref>, j'ai noté que <ref target="#EFGH">trois
mots</ref> avaient été soulignés par un lecteur précédent

Cet encodage demande que les éléments ayant des identifiants spécifiques (ABCD et EFGH dans cet exemple) soient effectivement quelque part dans le texte. Si aucun élément ne porte déjà ces identifiants, on peut utiliser les éléments anchor et seg.

.... <anchor type="bookmark" xml:id="ABCD"/> .... ....<seg type="target" xml:id="EFGH"> ... </seg> ...

On doit utiliser l’attribut type (comme ci-dessus) pour distinguer les différents rôles que pourraient jouer ces éléments généraux. D’autres utilisations sont présentées ci-après.

8.2 Le renvoi à d’autres documents

Jusqu’à maintenant, nous avons montré comment on pouvait utiliser les éléments ptr et ref pour établir des références croisées ou des liens, dont les cibles se situent dans le même texte. Cependant, on peut aussi utiliser ces deux éléments pour renvoyer à des éléments de n’importe quel autre document ou ressource XML, tel qu’un document sur le web, une partie d’une base de données. Ceci est possible si l’attribut target porte comme valeur n’importe quelle URI¹⁰ valide.

Un URI peut renvoyer à une page web, ou seulement à une partie de cette page. Par exemple : dans http://www.tei-c.org/index.xml#SEC2", le signe # indique que ce qui suit est l’identifiant d’un élément situé dans le document XML, dont l’adresse est indiquée par ce qui précède : cet exemple vise par conséquent un élément qui a un attribut xml:id dont la valeur est SEC2, dans le document extrait de http://www.tei-c.org/index.xml. Dans les exemples que nous avons présentés jusqu’à maintenant, la partie à gauche du signe # a toujours été omise, ce qui signifiait justement que l’élément auquel on renvoyait se trouvait dans le même document.

Des parties d’un document XML peuvent être spécifiées par d’autres moyens plus sophistiqués, qui utilisent XPath. Ce langage est aussi défini par le W3C. Il est particulièrement utile quand les éléments à associer ne portent pas d’identifiants et doivent alors être localisés par d’autres moyens.

8.3 Les types spéciaux de liens

Les attributs suivants, qui ont des rôles spécifiques, sont définis pour tous les éléments du schéma TEI Lite :

ana: lie un élément à son interprétation
corresp: lie un élément avec un ou plusieurs autres éléments correspondants
next: lie un élément avec l’élément suivant de l’ensemble
prev: lie un élément avec l’élément précédent de l’ensemble

On utilise l’attribut ana (analysis - analyse) lorsque des ensembles rassemblant des analyses abstraites ou des interprétations ont été définis quelque part dans le document. Cf. 15 L’interprétation et l’analyse. Par exemple, une analyse linguistique de « Jean aime Nancy » pourrait être encodée comme suit :

<seg type="sentence" ana="SVO">
<w ana="#NP1">Jean</w>
<w ana="#VVI">aime</w>
<w ana="#NP1">Nancy</w>
</seg>

Cet encodage implique l’existence quelque part dans le document d’éléments avec les identifiants SVO, NP1 et VVI, où la signification de ces codes particuliers est explicitée. Notons ici l’utilisation de l’élément seg pour marquer les composants particuliers d’une analyse, qui sont distingués par l’attribut type.

On utilise l’attribut corresp (correspondant) pour indiquer d'une manière simple tout type de correspondance entre deux éléments. Par exemple, dans un texte multilingue, il peut servir à lier ensemble les phrases équivalentes :

<seg xml:lang="fr" xml:id="FR1" corresp="#EN1">Jean aime Nancy</seg>
<seg xml:lang="en" xml:id="EN1" corresp="#FR1">John loves Nancy</seg>

On peut utiliser ce même mécanisme dans divers cadres. Dans l’exemple¹¹ qui suit, on l’utilise pour représenter la relation anaphorique entre « le village » et « Châteauvillain », et entre « la fête foraine » et « la fête de la Trinité »

<p>
<name type="place" xml:id="CVN">Châteauvillain</name> renouvelle avec une vieille
tradition de la localité qui avait été un peu oubliée depuis quelques années : <seg xml:id="feteTrin">la fête de la Trinité</seg>. Autrefois, il s'agissait des
grandes réjouissances annuelles de la commune. Pour ce retour attendu, <rs corresp="#feteTrin">la fête foraine</rs> s'installera dans <rs corresp="#CVN">le
village</rs> les 29 et 30 mai.
</p>

Les attributs prev et next permettent de relier de manière simple des constituants discontinus, comme dans l’exemple suivant :

<p> «<q xml:id="Q1a" next="#Q1b">Marie</q>,
dis-je, <q xml:id="Q1b" prev="#Q1a">j’ai été mariée ce matin à M. Rochester.</q> »
</p>.

9 Les interventions éditoriales

L’encodage d’un texte électronique a beaucoup à voir avec l’édition d’un manuscrit ou celle d’un texte destiné à être publié. Dans les deux cas, un éditeur consciencieux peut vouloir conserver l’état original de la source et la trace des corrections ou des changements qu’il a opérés. Pour ce faire, on utilisera les éléments décrits dans cette section et dans la section suivante,

9.1 La correction et la normalisation

On utilise les éléments suivants pour indiquer une correction, soit des changements éditoriaux introduits par l’éditeur, qui pense que l’original présente des erreurs :

corr (correction) contains the correct form of a passage apparently erroneous in the copy text.
sic (Latin for thus or so) contains text reproduced although apparently incorrect or inaccurate.

On utilise les éléments suivants pour indiquer une standardisation, soit des changements éditoriaux introduits pour des raisons de cohérence ou de modernisation d’un texte :

orig (original form) contains a reading which is marked as following the original, rather than being normalized or corrected.
reg (regularization) contains a reading which has been regularized or normalized in some sense.

Par exemple, considérons cette fable de La Fontaine, qui, dans sa première version imprimée, commence ainsi:

Maistre Corbeau sur un arbre perché
Tenoit en son bec un fromage.
Maistre Renard par l'odeur alléché
Luy tint à peu près ce langage:
Et bonjour, Monsieur du Corbeau :
Que vous estes joly! que vous me semblez beau:

Un éditeur moderne pourrait vouloir apporter un nombre de modifications, notamment pour moderniser (ou normaliser) l’orthographe. L’encodage serait alors le suivant :

<l>
<reg>Maître</reg> Corbeau sur un arbre
perché,
</l>
<l>
<reg>Tenait</reg> en son bec un fromage.
</l>
<l>
<reg>Maître</reg>
Renard par l'odeur alléché
</l>
<l>
<reg>Lui</reg> tint à peu près ce langage:
</l>
<l>Et
bonjour, Monsieur du Corbeau</l>
<l>Que vous <reg>êtes</reg>
<reg>joli</reg>! que vous me semblez beau!</l>

Un éditeur plus conservateur, ou très soucieux des sources, voudrait sans doute s’en tenir à l’original, mais, dans le même temps, signaler que certaines lectures peuvent être anomales.

<l>
<orig>Maistre</orig>
Corbeau sur un arbre perché,
</l>
<l>
<orig>Tenoit</orig> en son bec un fromage.
</l>...

Finalement, un éditeur numérique moderne peut décider de combiner ces deux possibilités dans un seul texte composite, en utilisant l’élément choice.

choice groups a number of alternative encodings for the same point in a text.

Celui-ci permet à l’éditeur d’indiquer que des lectures alternatives sont possibles

<l>
<choice>
  <orig>Maistre</orig>
  <reg>Maître</reg>
</choice>Corbeau sur un arbre
perché,
</l>
<l>
<choice>
  <orig>Tenoit</orig>
  <reg>Tenait</reg>
</choice> en son bec un
fromage.
</l>
<l>
<choice>
  <orig>Maistre</orig>
  <reg>Maître</reg>
</choice> Renard par
l'odeur alléché
</l>
<l>
<choice>
  <orig>Luy</orig>
  <reg>Lui</reg>
</choice> tint à peu
près ce langage:
</l>
<l>Et bonjour, Monsieur du Corbeau :</l>
<l>Que vous
<choice>
  <orig>estes</orig>
  <reg>êtes</reg>
</choice>
<choice>
  <orig>joly</orig>
  <reg>joli</reg>
</choice> ! que vous me semblez
beau!</l>

9.2 Les omissions, les suppressions et les ajouts

Outre la correction et la standardisation des mots et des expressions, les éditeurs ou les transcripteurs peuvent également indiquer du matériel manquant, omettre du matériel ou transcrire du matériel qui a été supprimé ou rayé. Des matériaux peuvent également se révéler particulièrement difficiles à transcrire, par exemple quand ils sont difficilement déchiffrables. Pour ces différents cas, on utilise les éléments suivants :

add (addition) contains letters, words, or phrases inserted in the source text by an author, scribe, annotator, or corrector.
gap (gap) indicates a point where material has been omitted in a transcription, whether for editorial reasons described in the TEI header, as part of sampling practice, or because the material is illegible, invisible, or inaudible.
del (deletion) contains a letter, word, or passage deleted, marked as deleted, or otherwise indicated as superfluous or spurious in the copy text by an author, scribe, annotator, or corrector.
unclear contains a word, phrase, or passage which cannot be transcribed with certainty because it is illegible or inaudible in the source.

On utilise ces éléments pour marquer les changements opérés par l’éditeur, le transcripteur, l’auteur ou le copiste (pour ces deux derniers, dans le manuscrit même). Par exemple, soit la source suivante :

Ces éléments servent à indiquer les changements opérés par par un éditeur

On voudrait corriger l’erreur manifeste, mais dans le même temps, conserver la trace de la suppression du deuxième « par » :

Ces
éléments servent à indiquer les changements opérés par <del resp="#LDB">par</del> un
éditeur.

La valeur #LDB de l’attribut resp est alors utilisée pour pointer sur l’identification de ceux qui sont responsables de la correction (typiquement dans un élément respStmt).

Soit la source suivante :

Ces éléments servent à indiquer changements opérés par un éditeur.

L'article a été supprimé par inadvertance. Le texte corrigé pourrait alors être encodé de la manière suivante :

Ces éléments servent à indiquer <add resp="#LB">les</add> changements opérés par un éditeur.

Ces éléments ne se limitent pas à indiquer les changements opérés par un éditeur. Ils peuvent être aussi utilisés pour conserver des changements opérés par l’auteur. Un manuscrit dans lequel un auteur a d’abord écrit « How it galls me, what a galling shadow », puis remplacé le mot « galls » par « dogs », pourrait être encodé comme suit :

How it <del hand="#DHL" type="overstrike">galls</del>
<add hand="#DHL" place="supralinear">dogs</add> me, what a galling shadow

A nouveau, le code #DHL pointe sur un autre endroit du texte, où l’on dispose de plus d’informations. De manière similaire, on utilise les éléments unclear et gap pour indiquer l’omission d’un matériel illisible ; l’exemple suivant montre également l’utilisation de add pour un ajout fait d'une autre main :

<add hand="#EPR" place="inspace">Envoyez-moi une épreuve <unclear cert="medium">W</unclear>
<gap reason="inDéchiffrable"/>
</add>

L’élément del indique que le matériel transcrit est, sur l’original, marqué comme supprimé, tandis que l’élément gap indique l’endroit où se trouve le matériel omis, que cette décision soit considérée comme opportune ou pas. Un corpus linguistique, par exemple, pourrait omettre les longues citations en langue étrangère, ou bien omettre systématiquement les figures, et les formules mathématiques. Voici un exemple¹² :

<p>Dans de tels
puzzles les pièces se divisent en quelques grandes classes dont les plus connues
sont : les bonshommes <gap>
  <desc>figure représentant plusieurs pièces de puzzle de
     ce type</desc>
</gap>; les croix de Lorraine <gap>
  <desc>figure représentant
     plusieurs pièces de puzzle de ce type</desc>
</gap>; et les croix
<gap>
  <desc>figure représentant plusieurs pièces de puzzle de ce
     type</desc>
</gap>
</p>

9.3 Les abréviations et leurs formes développées

Tout comme les noms, les dates et les nombres, les abréviations peuvent être transcrites telles qu’elles sont écrites ou bien développées. Elles peuvent rester non marquées, ou bien encodées en utilisant les éléments suivants :

abbr (abbreviation) contains an abbreviation of any sort.
expan (expansion) contains the expansion of an abbreviation.

L’élément abbr est utile pour distinguer des items semi-lexicaux tels que des acronymes, ou du jargon. Voici un exemple¹³ :

<head>Lutte contre le nepotisme : le
<abbr>PS</abbr> entrave
<abbr>EE-LV</abbr>
</head>

On utilise l’attribut type pour distinguer les types d’abréviation en fonction de leur rôle. On utilise l’élément expan pour indiquer que la forme développée de l’expression a été fournie par l’encodeur. Cet élément est particulièrement utile dans la transcription de manuscrits. Par exemple, dans les manuscrits de l’époque médiévale, on trouve très couramment le caractère p avec une hampe barrée, qui est la représentation conventionnelle du mot « per ». Un encodeur peut choisit de le développer de la manière suivante :

La forme développée de l’abréviation ne contient pas toujours les mêmes lettres que celles qui ont été utilisées dans l’abréviation. Quand cela arrive, cependant, la pratique éditoriale courante est d’utiliser les italiques, ou sinon de signaler quelles lettres ont été ajoutées. L’élément expan ne doit pas être utilisé dans ce cas, car sa fonction est d’indiquer la forme développée et non une partie de celle-ci. Par exemple, considérons l’abréviation c (pour cum), que l’on trouve souvent dans les textes médiévaux. Dans une édition moderne, un éditeur pourrait vouloir représenter ceci par ‘cum’, en mettant en italiques les lettres ajoutées. Un moyen simple d’y arriver serait de l’encoder comme suit :

Pour enregistrer à la fois une abréviation et sa forme développée, on peut utiliser l’élément choice, mentionné plus haut, pour rassembler la forme abrégée et la forme développée :

10 Les noms, les dates, et les nombres

Le schéma TEI permet de distinguer et d'identifier toutes sortes d'entités nommées, comme les noms de personnes et les noms de lieux, de marques, d'organisations, mais aussi des nombres ou des dates, autant d'informations précises qui produisent ou accrochent le texte à un contexte. Ces infomations factuelles rendent le text utile à de nombreuses disciplines, par exemple l'histoire ou la sociologie. Elles interessent aussi les applications linguistiques, en permettant par exemple d'isoler tous les mots qui ne figurent pas dans un dictionnaire de langue. Les éléments décrits ici, en rendant ces caractéristiques explicites, réduisent la complexité des traitements automatiques des textes.

10.1 Les dénominations et les expressions référentielles

Une expression référentielle est une expression qui réfère à une personne, un lieu, un objet. Deux éléments permettent de caractériser une telle expression :

rs (referencing string) contains a general purpose name or referring string.
name (name, proper noun) contains a proper noun or noun phrase.

On utilise l’attribut type pour distinguer les noms de personnes, de lieux, d’organisations, quand cela est possible. Voici un exemple¹⁴ :

Le premier dîner que <rs type="person">M. de
Norpois</rs> fit à la maison, une année où je jouais encore aux <rs type="place">Champs-Élysées</rs>, est resté dans ma mémoire, parce que l’après-midi de ce même
jour fut celui où j’allai enfin entendre <rs type="person">la Berma</rs>, en « matinée
», dans
<title>Phèdre</title>

Dans ce deuxieme exemple¹⁵, on traite le nom d'un ministère, pareillement aux autres agences, comme "organization" :

Comme le <rs type="organization">ministère
des Circonlocutions</rs> avait pour principe de ne jamais donner une réponse
catégorique, sous quelque prétexte que ce fût, <rs type="person">M. Mollusque</rs> se
contenta de répondre :
<q>C’est possible.</q>

Comme l’exemple suivant le montre, l’élément rs peut être utilisé pour faire référence à une personne, un lieu, etc., qui n’est pas nécessairement exprimé sous la forme d’un nom propre ou d’un groupe nominal.

<rs type="person">M. de Norpois</rs> avait
changé, sur un point bien plus important pour moi, les intentions de <rs type="person">mon père</rs>. ...

L’élément name par contraste est utilisé avec des noms propres. Il est alors synonyme de l’élément rs, ou enchâssé dans cet élément si l’expression référentielle comporte un mixte de noms propres et de noms communs (par exemple « Dédé l’embrouille »).

Etiqueter simplement une expression comme une dénomination permet rarement de traiter automatiquement les noms de personnes dans les formes habituellement attendues par les cadres d’analyse qui s’intéressent aux expressions référentielles. Le nom tel qu’il apparaît dans le texte peut être orthographié de manière erronée, partielle ou vague. En outre, il peut y avoir des particules telles que « van », « de la », etc., qui peuvent ou non être traitées comme faisant partie du nom, et qui dépendent de la langue et du pays d’origine du porteur.

L’attribut key propose un identifiant alternatif standard pour les objets nommés, analogue à une clé dans une base de données. C’est alors un moyen pratique pour rassembler toutes les références dispersées dans le texte, qui renvoient au même individu, au même lieu :

Le premier
dîner que <rs type="person" key="NORP1">M. de Norpois</rs> fit à la maison, une année
où je jouais encore aux <rs type="place" key="CE">Champs-Élysées</rs>, est resté dans
ma mémoire, parce que l’après-midi de ce même jour fut celui où j’allai enfin entendre
<rs type="person" key="LABM">la Berma</rs>, en « matinée », dans

<title>Phèdre</title>

Cette utilisation doit être distinguée de celle qui fait usage de l’élément reg (regularization - régularisation), qui permet de marquer la forme normalisée d’une expression référentielle :

<name type="person" key="DANT1">
<choice>
<reg>Georges Jacques Danton</reg>
<sic>Danton</sic>
</choice>
</name> est
né le 26 octobre 1759 à <name key="AsA"> Arcis-sur-Aube</name> et mort le 5 avril 1794
(16 germinal an II) à Paris.

L’élément index, discuté dans la section indexation, peut être plus approprié si le rôle de la régularisation est de fournir un index cohérent :

<p>Emmanuel Le Roy Ladurie
s'appuie sur les registres d'inquisition de <name type="person">Jacques
   Fournier</name>
<index>
  <term>Benoit XII, Pape d'Avignon (Jacques
     Fournier)</term>
</index> afin de retracer la vie des habitants de Montaillou en
Haute-Ariège « infesté » par le catharisme. </p>

Bien qu’elles soient pertinentes pour des applications simples, ces méthodes présentent deux inconvénients : il faut répéter la régularisation pour chaque occurrence du même nom ; et le poids de l’encodage supplémentaire XML dans le corps du texte peut devenir difficile à maintenir et compliqué à traiter. En onomastique ou en histoire par exemple, on est concerné par les personnes ou les lieux nommés, plutôt que par les noms eux-mêmes. En ce cas ou bien si l’on souhaite faire une analyse détaillée des constituants du nom, on se reportera aux Recommandations, qui fournissent un large éventail de solutions.

10.2 Les dates et les heures

Voici des balises permettant d’encoder les heures et les dates de manière plus précise :

date contains a date in any format.
time contains a phrase defining a time of day in any format.

Ces éléments ont un nombre d’attributs, dont les valeurs sont exprimées selon des formats standardisés.

att.datable provides attributes for normalization of elements that contain dates, times, or datable events.

calendar	indicates the system or calendar to which the date represented by the content of this element belongs.
period	supplies a pointer to some location defining a named period of time within which the datable item is understood to have occurred.
when	supplies the value of the date or time in a standard form, e.g. yyyy-mm-dd.

L’attribut when sert à spécifier la forme de la date ou de l’heure ; il repose sur l’un des formats normalisés ISO 8601. Des dates partielles ou des heures (par exemple, « 1990 », « septembre 1990 », « douzaine ») peuvent être représentées en omettant une partie de la valeur fournie :

<date when="1980-02-21">21 fevrier
1980</date>
<date when="1990">1990</date>
<date when="1990-09">septembre
mcmxc</date>
<date when="--09">septembre</date>
<date when="2001-09-11T12:48:00">11
septembre, neuf heures moins douze GMT</date>

Notons, dans le dernier exemple, l’utilisation d’une représentation normalisée pour une date, qui inclut une heure : cet exemple aurait pu aussi être encodé en utilisant l’élément time.

Décret de la
Convention <date when="1794-10-30">9 Brumaire An III</date>

<l>specially when it's nine below zero</l>
<l>and <time when="15:00:00">three o'clock in the afternoon</time>
</l>

10.3 Les nombres

Les nombres peuvent être écrits en lettres ou en chiffres (vingt et un, xxi et 21), et leur représentation dépend de la langue (par exemple, en anglais 5th est équivalent à 5 en grec ; 123,456.78 en anglais est équivalent à 123.456,78 ou 123 456,78 en français). Dans des applications TAL, il est souvent utile de les distinguer des parties plus proprement « lexicales » du texte. Dans d’autres applications, la possibilité d’enregistrer la valeur d’un nombre dans un format normalisé est importante. L’élément num le permet :

num (number) contains a number, written in any form.

Par exemple:

<num value="33">xxxiii</num>
<num type="cardinal" value="21">vingt et un</num>
<num type="percentage" value="10">dix pourcent</num>
<num type="percentage" value="10">10%</num>
<num type="ordinal" value="5">5eme</num>

11 Les listes

L’élément list sert à baliser n’importe quelle sorte de liste. Une liste est un glossaire ou bien une séquence d’items textuels, qui peuvent être numérotées ou non. Chaque item peut être précédé par une étiquette (dans un glossaire, cette étiquette est le terme qui est défini).

Les items d’une liste sont balisés par l’élément item. Le premier item peut être précédé de head, qui indique qu’il est le premier élément de la liste. La numérotation de la liste peut être omise, ou bien spécifiée grâce à l’attribut n, que l’on associe à chaque item, ou bien encore (plus rarement) en utilisant l’élément label. Les encodages suivants sont donc tous équivalents :

<list>
<head>Une liste courte</head>
<item>Premier item de la liste.</item>
<item>Deuxième item de la liste.</item>
<item>Troisième item de la liste.</item>
</list>
<list>
<head>Une liste courte</head>
<item n="1">Premier item de la liste.</item>
<item n="2">Deuxième item de la liste.</item>
<item n="3">Troisième item de la liste.</item>
</list>
<list>
<head>Une liste courte</head>
<label>1</label>
<item>Premier item de la liste.</item>
<label>2</label>
<item>Deuxième item de la liste.</item>
<label>3</label>
<item>Troisième item de la liste.</item>
</list>

On ne doit pas mélanger les styles dans une même liste.

Un simple tableau à deux colonnes peut traiter la liste de type glossaire. Elle sera balisée avec l’élément <list type="gloss">. Chaque item, ici, comprend un terme et une glose, qui sont balisés respectivement par label et item. Ils correspondent aux éléments term et gloss, qui peuvent apparaître dans n’importe quel texte en prose. Voici un exemple du Petit glossaire pour servir à l'intelligence des auteurs décadents et symbolistes (1888) de Jacques Plowert :

<list type="gloss">
<label>ENGOULER</label>
<item>
  <p>v. a. — Avaler d’une manière goulue.</p>
  <cit>
   <q>le vitrail jaune des portes de brasseries, tantôt vomissant, tantôt
       engoulant des masses noires.</q>
   <bibl>Les Demoiselles Goubert.</bibl>
  </cit>
</item>
<label>ENGRANDEUILLER(s’)</label>
<item>
  <p>v. p. - Se mettre en grand deuil. </p>
  <cit>
   <q>L'automne s’engrandeuille au bois de Boulogne.</q>
   <bibl>
    <title>Complaintes.</title> Jules Laforgue. </bibl>
  </cit>
</item>
<label>ENJAUNI.</label>
<item>
  <p>Adj. — Rendu jaune.</p>
  <cit>
   <q>La bâtisse de l'Opéra aux baies enjaunies de lumière où des ombres se
       heurtent. </q>
   <bibl>
    <title>Thé chez Miranda.</title> Paul Adam. </bibl>
  </cit>
</item>
<label>ENVOL</label>
<item>
  <p>s. M. — Vol elliptique.</p>
  <cit>
   <q>Un envol de pigeons écarlates tonne autour de ma pensée. </q>
   <bibl>
    <title>Illuminations.</title> A. Rimbaud.</bibl>
  </cit>
</item>
</list>

Notons ici que le contenu des éléments item peut être assez riche : dans le cas précèdant, on s'est servi de l'élément cit pour fournir l'exemple cité pour chaque définition. Si la structure interne d’un item dans une liste est très complexe, il pourrait être préférable de considérer la liste comme un tableau, pour laquelle un encodage spécifique peut être défini. Voir ci-dessous 13 Les tableaux.

Des listes de toutes sortes peuvent, bien sûr, être enchâssées dans d’autres listes à n’importe quel niveau de profondeur. Ici, par exemple, une liste des vins¹⁶ est organisée à plusieurs niveaux:

<list>
<item>vins d'Alsace <list>
   <item>Riesling</item>
   <item>Traminer</item>
   <item>Pinot noir</item>
   <item>Tokay</item>
  </list>
</item>
<item>Bordeaux rouge <list>
   <item>Médoc : <list>
     <item>Château-de-1'Abbaye-Skinner</item>
     <item> Château-Lynch-Bages</item>
     <item>Château-Palmer</item>
     <item>Château-Brane-Cantenac</item>
     <item> Château-Gruau-Larose</item>
    </list>
   </item>
  </list>
</item>
<item> Graves : <list>
   <item>Château-La-Garde-Martillac</item>
   <item>Château-Larrivet-Haut-Brion; </item>
  </list>
</item>
<item> Saint-Emilion : <list>
   <item>Château-La-Tour-Beau-Site</item>
   <item>Château-Canon</item>
   <item> Château-La-Gaffeière</item>
   <item>Château-Trottevieille </item>
  </list>
</item>
<item>Pomerol : <list>
   <item>Château-Taillefer</item>
  </list>
</item>
<item>Bordeaux blanc <list>
   <item>: Sauternes : <list>
     <item>Château-Sigalas-Rabaud</item>
     <item>Château-Caillou</item>
     <item>Château-Nairac </item>
    </list>
   </item>
  </list>
</item>
</list>

Une liste peut ne pas être présentée sous forme de liste. Par exemple ¹⁷ :

<p>Dans les pages lointaines de ce livre, il
est écrit que les animaux se divisent en <list rend="run-on">
  <item n="a">appartenant à l'Empereur,</item>
  <item n="b">embaumés, </item>
  <item n="c">apprivoisés, </item>
  <item n="d">cochons de lait, </item>
  <item n="e">sirènes, </item>
  <item n="f">fabuleux, </item>
  <item n="g">chiens en liberté, </item>
  <item n="h">inclus dans la présente classification, </item>
  <item n="i">qui s'agit comme des fous, </item>
  <item n="j">innombrables, </item>
  <item n="k">dessinés avec un très fin pinceau, </item>
  <item n="l">et caetera, </item>
  <item n="m">qui viennent de casser la cruche, </item>
  <item n="n">qui de loin semblent des mouches.</item>
</list>
</p>

Les listes de références bibliographiques doivent être balisées par l’élément listBibl, qui est décrit dans la section suivante.

12 Les références bibliographiques

Il est souvent utile de distinguer les références bibliographiques qui apparaissent dans les textes transcrits à des fins de recherche, ne serait-ce que pour qu’elles soient proprement formatées au moment de l’impression. L’élément bibl sert à cela. Lorsque les composants d’une référence bibliographique doivent être distingués, on utilise les éléments présentés ci-dessous. Souvent, il est utile de marquer certaines parties, telles que le titre des articles, des livres et des revues, car elles requièrent un formatage spécifique. Les autres éléments proposés s’adressent avant tout aux projets qui portent un intérêt tout particulier au matériel bibliographique.

bibl (bibliographic citation) contains a loosely-structured bibliographic citation of which the sub-components may or may not be explicitly tagged.
author in a bibliographic reference, contains the name(s) of an author, personal or corporate, of a work; for example in the same form as that provided by a recognized bibliographic name authority.
biblScope (scope of bibliographic reference) defines the scope of a bibliographic reference, for example as a list of page numbers, or a named subdivision of a larger work.
date contains a date in any format.
editor contains a secondary statement of responsibility for a bibliographic item, for example the name of an individual, institution or organization, (or of several such) acting as editor, compiler, translator, etc.
publisher provides the name of the organization responsible for the publication or distribution of a bibliographic item.
pubPlace (publication place) contains the name of the place where a bibliographic item was published.
title contains a title for any kind of work.

Par exemple, la note éditoriale à la fin de cette phrase¹⁸ ‘En février 1989, il ouvre la session publique organisée par le Parlement européen au sujet des violations des droits de l'Homme commises par le régime communiste roumain (François Fetjö, Ewa Kulesza-Mietkowski, La Fin des démocraties populaires, Seuil, 1992, page 319). pourrait être transcrite de la manière suivante :

... violations des droits de l'Homme
commises par le régime communiste roumain (<bibl>
<author>François Fetjö</author>,
<author>Ewa Kulesza-Mietkowski</author>, <title>La Fin des démocraties
populaires</title>, <publisher>Seuil</publisher>, <date>1992</date>,
<biblScope>page 319</biblScope>
</bibl>).

’

La référence bibliographique peutt également être séparée du texte, par exemple quand elle figure dans une note en bas de page, ou dans une liste de références balisée avec l’élément listBibl ; dans ce cas, le renvoi à la référence dans le texte sera balisée avec un ref ou un ptr.

13 Les tableaux

Les tableaux constituent un défi à tous les systèmes de traitement automatique, mais les tableaux simples sont du moins si fréquents dans les textes que, même dans l’ensemble simplifié qui est présenté ici, le balisage des tableaux est nécessaire. Les éléments suivants apportent des solutions :

table contains text displayed in tabular form, in rows and columns.
row contains one row of a table.
cell contains one cell of a table.

Prenons ce tableau indiquant les pentes moyennes de plusieurs aqueducs antiques, retrouvé dans Les aqueducs antiques de Germain de Montauzan (1908). Notons l'usage de l'attribut role pour indiquer que toute cellule du premier rang contient un label, et non pas une donnée.

<p>Voici quelques exemples de pentes moyennes observées sur des aqueducs antiques, en
France, autres que ceux de Lyon : <table>
  <row role="label">
   <cell>Aqueduc</cell>
   <cell>pente par kil.</cell>
   <cell>pente par m.</cell>
  </row>
  <row>
   <cell>de Vienne </cell>
   <cell>1m,16 </cell>
   <cell>0,0016 </cell>
  </row>
  <row>
   <cell>d’Arcueil</cell>
   <cell>1m,00 </cell>
   <cell>0,0010 </cell>
  </row>
  <row>
   <cell>d’Evreux</cell>
   <cell>0m,834</cell>
   <cell>0,0008 </cell>
  </row>
  <row>
   <cell>de Rodez</cell>
   <cell>0m,50</cell>
   <cell>0,0005 </cell>
  </row>
  <row>
   <cell>d’Antibes</cell>
   <cell>0m,41</cell>
   <cell> 0,0004 </cell>
  </row>
  <row>
   <cell>de Nîmes</cell>
   <cell>0m,342 </cell>
   <cell>0,0003 </cell>
  </row>
  <row>
   <cell>(minimum)</cell>
   <cell>0m,071</cell>
   <cell>0,00007 </cell>
  </row>
</table>
</p>

14 Les figures et les graphiques

Tous les composants d’un document ne sont pas forcément de nature textuelle. Le texte même le plus simple contient souvent des diagrammes et des illustrations, sans parler des textes où image et texte sont inextricablement imbriqués, ou des ressources électroniques dans lesquelles les deux sont complémentaires.

L’encodeur peut simplement noter la présence d’un graphique, et éventuellement décrire brièvement son contenu. Il peut également associer un lien vers la version numérique du graphique, en utilisant les éléments suivants :

graphic indicates the location of an inline graphic, illustration, or figure.
figure groups elements representing or containing graphic information such as an illustration, formula, or figure.
figDesc (description of figure) contains a brief prose description of the appearance or content of a graphic figure, for use when documenting an image without displaying it.

Toute information textuelle accompagnant le graphique, telle que le titre ou la légende, peut être incluse dans l’élément figure lui-même, qui comporte un élément head et un ou plusieurs éléments p. On peut inclure également le texte qui est dans le graphique. Il est fortement recommandé que l’élément figDesc comporte une description de l’image, pour que cela soit utilisable par des applications qui ne sont pas capables de traiter les graphiques, et pour rendre le graphique accessible à des lecteurs ayant un handicap visuel. (Normalement, ces descriptions ne sont pas considérées comme faisant partie du document).

La manière la plus simple de procéder est d’identifier la position d’un graphique et de l’associer à un lien, comme dans l’exemple suivant :

On a indiqué ici que le graphique est le fichier p412fig.png et qu’il apparaît entre les pages 412 et 413. L’élément graphic peut être utilisé partout où il y a un contenu textuel, i.e. dans un paragraphe, un titre, mais pas entre eux. Dans l’exemple suivant, l’encodeur a décidé de traiter les ornements d’une édition particulière comme un titre :

Plus couramment, un graphique a au moins un titre, qui peut être encodé grâce à l’élément head. On peut également rassembler un certain nombre de figures dans une structure particulière. Et c’est souvent plus facile d’inclure une courte description de l’image. L’élément figure permet de rassembler un ou plusieurs éléments de cette sorte, ils forment alors un type de graphique appelé ‘block’.

<figure>
<graphic url="chap3fig2.png"/>
<head>Dick Kennedy </head>
<figDesc>Gravure de E. Riou représentant un jeune homme assis sur une chaise, les
pieds sur une autre et tenant à la main une canne. En arrière plan, une théière, et
l'inscription <q>Map of Africa</q> .</figDesc>
</figure>

Ces cas doivent être soigneusement distingués du cas où un texte encodé est associé à une collection d’images numériques, qui constitue une ressource distincte. L’attribut facs permet d’associer n’importe quel élément du texte encodé à un fac-similé numérique. Dans le cas simple, quand seules des images sont disponibles, on utilise l’attribut facs dans l’élément pb pour associer chaque image à l’endroit du texte qui convient.

Cette méthode est valable uniquement dans les cas simples, i.e. quand chaque image numérique est contenue dans un fichier (page1.png, etc.) et correspond à une seule page transcrite et encodée. Si un alignement plus précis de l’image et de la transcription est nécessaire, par exemple, parce que les fichiers images représentent une double page, des mécanismes plus sophistiqués sont requis. On se reportera aux Recommandations.

15 L’interprétation et l’analyse

On dit souvent que tout balisage est une forme d’interprétation ou d’analyse. S’il est certainement difficile, et parfois impossible, de distinguer clairement entre des informations ‘objectives’ et des informations ‘subjectives’, et ce de manière universelle, il s’avère cependant que les jugements portant sur les secondes sont typiquement considérés comme beaucoup plus susceptibles d’alimenter des controverses que ceux portés sur les premières. Aussi, beaucoup de chercheurs notent de telles interprétations uniquement s’il est possible d’avertir le lecteur qu’elles font plus débat que d’autres parties du texte balisé. Cette section décrit quelques-uns des éléments fournis par le schéma TEI pour traiter ces différents besoins.

15.1 Les phrases « orthographiques »

Typiquement, l’interprétation parcourt l’ensemble du texte, sans respecter particulièrement telle ou telle unité structurelle. Une première étape utile consiste à segmenter le texte en unités discrètes et identifiables, chacune d’entre elles portant une étiquette servant en quelque sorte de ‘référence canonique’. Pour faciliter leur utilisation, ces unités ne doivent pas se croiser ou être emboîtées les unes dans les autres. On peut les représenter facilement en utilisant l’élément suivant :

s (s-unit) contains a sentence-like division of a text.

Comme son nom l’indique, l’élément s est l’élément le plus couramment utilisé (dans les applications linguistiques du moins) pour indiquer les phrases « orthographiques », i.e. définies par des traits orthographiques, tels que la ponctuation. Par exemple, le début du passage de Jane Eyre, présenté plus haut, peut être divisé en de telles unités :

<pb n="474"/>
<div type="chapter" n="38">
<p>
  <s n="001">J’ai enfin épousé M. Rochester.</s>
  <s n="002">Notre mariage se fit
     sans bruit; </s>
  <s n="003">lui, moi, le ministre et le clerc, étions seuls présents.</s>
  <s n="004">Quand nous revînmes de l’église, j’entrai dans la cuisine, où Marie
     pré- parait le dîner, tandis que John nettoyait les couteaux.</s>
</p>
<p>
  <s n="003"> «Marie, dis-je, j’ai été mariée ce matin à M.
     Rochester. »</s>
</p>
</div>

Notons que les éléments s ne peuvent être emboîtés : le début d’un s implique que le précédent est terminé. Quand des unités de type s sont balisées comme ci-dessus, il est conseillé de baliser la totalité du texte, de telle sorte que chaque mot du texte analysé appartiendra à un seul élément s, dont l’identifiant peut alors servir à spécifier les référence uniques à chacun de ces mots. Si les identifiants sont uniques dans le document, alors il est préférable d’utiliser l’attribut xml:id plutôt que l’attribut n (cf. exemple ci-dessus).

15.2 Les mots et la ponctuation

La segmentation du texte en mots lexicaux et non lexicaux est une opération très courante dans tous les types d’analyse textuelle, mais pas complètement triviale. Par exemple, la décision de traiter can’t en anglais ou du en français comme un ou deux mots n’est pas simple. En conséquence, il est souvent utile de rendre explicite la segmentation souhaitée. On utilise alors les éléments suivants :

w (word) represents a grammatical (not necessarily orthographic) word.
pc (punctuation character) contains a character or string of characters regarded as constituting a single punctuation mark.

Par exemple, la sortie d’un catégoriseur pourrait être encodé en TEI Lite comme suit :

<s>
<w ana="#NAM">TreeTagger</w>
<w ana="#VER_pres">permet</w>
<w ana="#PRP">d'</w>
<w ana="#VER_infi">annoter</w>
<w ana="#PRO_IND">plusieurs</w>
<w ana="#NOM">langues</w>
<pc>. </pc>
</s>

Dans cet exemple, chaque mot a été pourvu automatiquement d’une catégorie, en utilisant l’attribut ana (cf. section ci-dessus). L’élément w permet aussi de noter pour chaque mot sa racine ou son lemme, soit de manière explicite, en utilisant l’attribut lemma, soit par référence, en utilisant l’attribut lemmaRef, comme dans l’exemple qui suit :

...<w
  ana="#VER_pres"
  lemma="permettre"
  lemmaRef="http://www.myLexicon.com/permettre">permet</w> ...

15.3 Les éléments généraux d’interprétation

L’élément w est une spécialisation de l’élément seg, dont l’utilisation a déjà été présentée pour identifier des cibles non marquées de références croisées et des hyperliens (cf. section 8 Les références croisées et les liens) ; il spécifie une certaine portion de texte, à laquelle l’encodeur peut assigner un type utilisateur particulier, ou un identifiant unique ; il peut alors être utilisé pour marquer des caractéristiques textuelles, qui n’ont pas été prévues dans les Recommandations.

Par exemple, les Recommandations ne proposent pas d’élément ‘apostrophe’ pour marquer les parties d’un texte littéraire dans lesquelles le narrateur s’adresse directement au lecteur (ou à l’auditeur). Un traitement possible serait de considérer ces apostrophes comme des instances de l’élément q, que l’on distinguerait des autres en choisissant une valeur appropriée pour l’attribut who. Une autre solution, plus simple, et certainement plus générale, consiste à utiliser l’élément seg comme suit :

<l>Tu le connais, lecteur, ce monstre délicat,</l>
<l>
<seg type="apostrophe">— Hypocrite lecteur, — mon semblable, — mon frère!</seg>
</l>

L’attribut type de l’élément seg peut prendre n’importe quelle valeur, et peut donc servir à marquer des phénomènes de toutes sortes ; il est de bonne pratique que d’enregistrer les valeurs utilisées et leur signification dans l’en-tête.

Un élément seg d’un type donné (à la différence de l’élément s, qui lui ressemble superficiellement) peut être enchâssé dans un élément seg de même type ou de type différent. Cela permet de représenter des structures assez complexes ; des exemples ont été proposés plus haut (8.3 Les types spéciaux de liens). Cependant, comme les éléments doivent être correctement enchâssés et ne pas se chevaucher, seg ne peut pas gérer l’association d’une interprétation avec des segments arbitraires du texte, qui ignoreraient complètement la hiérarchie du document. Il nécessite également que l’interprétation elle-même soit représentée par une valeur unique codée dans l’attribut type.

Aucune de ces contraintes ne s’applique à l’élément interp, qui offre des caractéristiques puissantes pour encoder de manière relativement directe une interprétation assez complexe.

interp (interpretation) summarizes a specific interpretative annotation which can be linked to a span of text.
interpGrp (interpretation group) collects together a set of related interpretations which share responsibility or type.

Ces éléments permettent à l’encodeur de spécifier à la fois la classe de l’interprétation et une interprétation spécifique, relevant de cette classe, et que l’interprétation implique. De ce fait, tandis qu’avec seg, on peut dire simplement que quelque chose est une apostrophe, avec interp, on peut dire qu’il s’agit d’une instance (d’une apostrophe dans l’exemple) d’une classe plus grande (celle des figures de rhétorique).

En outre, interp est un élément vide, qui doit être lié au passage auquel il s’applique, au moyen de l’attribut ana, présenté ci-dessus (8.3 Les types spéciaux de liens), ou bien en utilisant son propre attribut inst. Cela signifie que n’importe quel type d’analyse peut être représenté, sans que l’on ait besoin de respecter la hiérarchie du document. Le regroupement des analyses de même type est également aisé, et ce grâce à l’élément spécifique <interGrp>.

Par exemple, supposons que l’on souhaite marquer différents aspects d’un texte, tels que des thèmes, des figures de rhétorique, et des emplacements de scènes. Différentes parties de notre extrait de Jane Eyre, par exemple, pourraient être ainsi associées.

Ces interprétations pourraient être placées n’importe où dans une partie identifiée par l’élément text. Cependant, il est de bonne pratique de les noter au même endroit (par exemple, dans une section séparée des Parties liminaires et des Annexes), comme dans l’exemple suivant :

<back>
<div type="Interpretations">
  <p>
   <interp xml:id="set-church-1" resp="#LB-MSM" type="emplacement">église</interp>
   <interp xml:id="set-kitch-1" resp="#LB-MSM" type="emplacement">cuisine</interp>
   <interp xml:id="set-unspec-1" resp="#LB-MSM" type="emplacement">ailleurs</interp>
  </p>
</div>
</back>

La redondance manifeste de cet encodage peut être largement réduite en utilisant l’élément <interGrp> pour regrouper tous les éléments interp qui ont des valeurs d’attribut identiques :

<back>
<div type="Interpretations">
  <interpGrp type="emplacement" resp="#LB-MSM">
   <interp xml:id="set-church">église</interp>
   <interp xml:id="set-kitch">cuisine</interp>
   <interp xml:id="set-unspec">ailleurs</interp>
  </interpGrp>

</div>
</back>

Une fois ces éléments d’interprétation définis, ils peuvent être liés aux parties auxquelles ils s’appliquent, et ce de deux manières différentes : ils peuvent servir comme cibles de l’attribut ana attaché aux instances, ou bien ils peuvent eux-mêmes pointer sur les instances. Par exemple, en se servant de l'attribut global ana, on pointe du texte vers son analyse :

<div type="chapitre" n="38">
<p xml:id="P38.1" ana="#set-church #set-kitch"> J’ai enfin épousé M. Rochester.
   Notre mariage se fit sans bruit; lui, moi, le ministre et le clerc, étions seuls
   présents. Quand nous revînmes de l’église, j’entrai dans la cuisine, où Marie pré-
   parait le dîner, tandis que John néttoyait les couteaux.</p>
</div>

Notons, dans cet exemple, que, comme le paragraphe comporte deux lieux (l’église et la cuisine), les deux identifiants ont été fournis.

Ou, pour pointer dans l'autre sens, de l'analyse vers le texte, on peut utiliser l’attribut inst de l'éléments interp ainsi :

<interpGrp type="emplacement" resp="#LB-MSM">
<interp inst="#P38.1">église</interp>
<interp inst="#P38.1" resp="#LB-MSM">cuisine</interp>
</interpGrp>

L’élément interp n’est pas limité à un certain type d’analyse. L’analyse littéraire ci-dessus en est une parmi d’autres ; on pourrait également utiliser interp pour représenter une analyse linguistique en termes de parties du discours. Par exemple, la phrase proposée dans la section 8.3 Les types spéciaux de liens suppose une analyse linguistique qui pourrait être représentée comme suit :

<interp xml:id="NP1" type="pos">noun phrase,
singular</interp>
<interp xml:id="VV1" type="pos">inflected verb, present-tense
singular</interp> ...

16 La documentation technique

Bien que l’objet central de ce document soit l’utilisation du schéma TEI pour encoder des documents existants, on peut utiliser ce même schéma pour encoder de nouveaux documents. Dans la préparation de nouveaux documents (tels que celui-ci), XML présente de nombreux avantages : la structure du document peut être représentée clairement, et le même texte électronique peut être réutilisé à des fins très différentes - pour produire par exemple des versions hypertextes ou feuilletables en ligne, mais aussi des versions bien mises en forme, et ce à partir d’une même source.

Pour ce faire, le schéma TEI Lite comprend des éléments permettant de marquer les caractéristiques de documents techniques en général, et de documents XML en particulier.

16.1 Les éléments supplémentaires pour les documents techniques

On peut utiliser les éléments suivants pour marquer les caractéristiques particulières de documents techniques :

att (attribute) contains the name of an attribute appearing within running text.
code contains literal code from some formal language such as a programming language.
eg (example) contains any kind of illustrative example.
formula contains a mathematical or other formula.
gi (element name) contains the name (generic identifier) of an element.
ident (identifier) contains an identifier or name for an object of some kind in a formal language. ident is used for tokens such as variable names, class names, type names, function names etc. in formal programming languages.
val (value) contains a single attribute value.

L’exemple suivant montre comment ces éléments pourraient être utilisés pour encoder le passage d’un tutoriel, qui constitue une introduction au langage de programmation Fortran :

<p>Par tradition, on introduit
une langue de programmation avec cet exemple classique: <eg> CHAR*12 GRTG
   GRTG='HELLO WORLD'
   PRINT *, GRTG
   END
</eg>
</p>
<p>Dans ce code, on voit la déclaration d'un variable <ident>GRTG</ident>:
<code>CHAR*12 GRTG</code>, qui indique que l'object <ident>GRTG</ident> contient
12 octets de type <ident>CHAR</ident>. La valeur <val>HELLO WORLD</val> est ensuite
attribuée à cette variable. </p>

On peut paramétrer une application pour qu’un texte, tel que celui présenté ci-dessus, soit correctement formaté (par exemple, en conservant les fins de ligne, en utilisant une police particulière). De même, l’utilisation de balises telles que ident facilite grandement la construction d’un index.

On utilise l’élément formula pour encadrer toute formule mathématique ou chimique, qui se présente dans un texte comme un élément distinct. Comme les formules sont généralement exprimées au moyen d’une grande diversité de signes typographiques spécifiques, ce qui n’est pas le cas des textes ordinaires, le corps de la formule doit faire l’objet d’une notation spécifique. Cette notation doit être spécifiée par l’attribut notation, comme dans l’exemple suivant :

<formula notation="tex"> \begin{math}E =
mc^{2}\end{math} </formula>

Se pose un problème quand l’encodage XML est le propos même du document technique, lui-même encodé en XML. Dans de tels documents, il est essentiel de distinguer clairement l’encodage des exemples de l’encodage du document lui-même ; dans ce type de document, la probabilité de trouver des balises de fin est alors très forte. Une solution simple consiste, pour les exemples, à utiliser la référence d’entité prédéfinie < pour représenter chaque caractère < (qui marque le début d’une balise XML). Une solution plus générale consiste à spécifier que les exemples sont des données, qui ne sont pas destinées à être traitées par un analyseur. Pour ce faire, on se sert d’une construction XML particulière, appelée CDATA marked section, comme dans l’exemple suivant :

<p>Une liste peut
être balisée ainsi: <eg><![ CDATA[ <list> <item>Premier item de
la liste</item> <item>deuxième item</item> </list>]]>
</eg> L'élément <gi>list</gi> contient une série d'éléments
<gi>item</gi>
</p>

L’élément list utilisé dans l’exemple ci-dessus ne doit pas être considéré comme faisant partie du document proprement dit, parce qu’il est inséré dans une section marquée : elle commence par une déclaration spéciale de balisage <![CDATA[, et se termine par ]]>.

Notons également l’utilisation de l’élément gi pour baliser les références aux noms d’éléments (ou identifiants génériques) dans le corps du texte.

16.2 Les sections générées

Les systèmes de production de documents actuelles offrent la possibilité de générer automatiquement des sections telles que la table des matières, l'index etc. Le schéma TEI Lite propose un élément pour marquer l’endroit où une telle section doit se trouver.

L’élément divGen peut être placé à n’importe quel endroit où un élément de division serait autorisé :

<front>
<titlePage>

</titlePage>
<divGen type="toc"/>
<div>
<head>Preface</head>

</div>
</front>
<body>

</body>
<back>
<div>
<head>Appendix</head>

</div>
<divGen type="index" n="Index"/>
</back>

Cet exemple montre également l’utilisation de l’attribut type pour distinguer les différentes sortes de sections qui auront été générées : dans le premier cas, une table des matières (un toc), et dans le second un index.

Quand un index existant ou une table des matières doivent être encodés (plutôt que générés), on utilise l’élément list (cf. section 11 Les listes).

16.3 La génération d’index

Alors que la production d’une table des matières à partir d’un document correctement balisé ne pose généralement pas de problèmes, la production d’un index de bonne qualité demande souvent un balisage plus minutieux. Bien que l’extraction (par exemple) de toutes les occurrences des éléments term ou name puisse constituer un bon point de départ, cela peut cependant ne pas suffire.

Le schéma TEI propose une balise spécifique index, qu’on utilise pour indiquer à la fois les parties du document qui doivent être indexées, et comment l’index doit être construit.

index (index entry) marks a location to be indexed for whatever purpose.

Par exemple, le deuxième paragraphe de cette section pourrait être codé ainsi :

Le schéma TEI propose une balise spécifique
<gi>index</gi>
<index>
<term>indexation</term>
</index>
<index>
<term>index (balise)</term>
<index>
<term> generation
d'index</term>
</index>
</index>, qu’on utilise pour indiquer ...

L’élément index peut également être utilisé pour fournir une information de nature interprétative ou analytique. Par exemple, dans le cadre d’une étude stylistique, on voudrait enregistrer les différentes figures utilisées par Ovide dans les Métamorphoses¹⁹. Une telle étude enregistrerait les références à Jupiter (soit deus, se, et le sujet de confiteor [sous une forme fléchie codée 227], les références à Jupiter-sous-l’-apparence-d’-un-taureau (soit imago tauri fallacis et le sujet de teneo), et ainsi de suite.

<l n="3.001">iamque deus posita fallacis
imagine tauri</l>
<l n="3.002">se confessus erat Dictaeaque rura tenebat</l>

Pour ce faire, on peut utiliser l’élément note, discuté dans la section 7 Les notes, ou l’élément interp discuté dans la section 15 L’interprétation et l’analyse. Ici, on montre comment l’élément index peut aussi être utilisé.

On suppose que l’objet doit générer plus d’un index : l’un rassemblera les noms des divinités (appelé dn), un autre les références onomastiques (appelé on), un troisième les références pronominales (appelé pr), et ainsi de suite. On pourrait alors procéder de la manière suivante :

<l n="3.001">iamque deus posita fallacis
imagine tauri <index indexName="dn">
  <term>Iuppiter</term>
  <index>
   <term>deus</term>
  </index>
</index>
<index indexName="on">
  <term>Iuppiter (taurus)</term>
  <index>
   <term>imago tauri
       fallacis</term>
  </index>
</index>
</l>
<l n="3.002">se confessus erat Dictaeaque
rura tenebat <index indexName="pr">
  <term>Iuppiter</term>
  <index>
   <term>se</term>
  </index>
</index>
<index indexName="v">
  <term>Iuppiter</term>
  <index>
   <term>confiteor
       (v227)</term>
  </index>
</index>
</l>

Pour chaque élément index ci-dessus, une entrée sera générée dans l’index approprié. L’entrée est le contenu de l’élément term. Dans chaque cas, les éléments term, rassemblés dans l’élément index secondaire, fournissent un deuxième mot clé. La référence à proprement parler sera extraite du contexte dans lequel l’élément index apparaît, i.e. dans ce cas, l’identifiant de l’élément l qui le contient.

16.4 Les adresses

On utilise l’élément address pour baliser une adresse postale de n’importe quel type. Il contient un ou plusieurs éléments addrLine, pour chaque ligne de l’adresse. Voici un exemple simple :

<address>
<addrLine>Centre d'Études Supérieures de la Renaissance</addrLine>
<addrLine>59, rue Néricault-Destouches</addrLine>
<addrLine> 37013 TOURS</addrLine>
<addrLine>France</addrLine>
</address>

Les parties de l’adresse peuvent être distinguées en utilisant l’élément name, discuté dans la section 10.1 Les dénominations et les expressions référentielles.

<address>
<addrLine>Centre d'Études Supérieures de la Renaissance</addrLine>
<addrLine>59, rue Néricault-Destouches</addrLine>
<addrLine> 37013 <name type="city">TOURS</name>
</addrLine>
<addrLine>
<name type="country">France</name>
</addrLine>
</address>

17 Les jeux de caractères, les diacritiques, etc.

Avec l’adoption par XML de l’Unicode, c’est-à-dire l'ensemble des caractères devant être utilisés dans tous les documents, la plupart des problèmes qui se posaient précédemment pour représenter la diversité des langues et des systèmes d’écriture ont été grandement réduits. Pour ceux qui travaillent avec les formes standards des langues européennes notamment, il n’y a presque rien à faire : tout éditeur XML permet de saisir directement des caractères accentués ou d’autres caractères « non ASCII ». Ces caractères doivent être stockés dans le fichier final de manière à ce qu’ils soient directement transférables entre des systèmes différents.

Dans un document XML, il y a deux exceptions importantes : les caractères & et < ne peuvent être saisis directement, car ils ont une signification particulière, celle de balise de début. Ils doivent toujours être représentés comme des références d’entité : & ou <.

18 Les Parties liminaires et les Annexes

18.1 Les Parties liminaires

Dans beaucoup de cas, notamment dans les textes plus anciens, les Parties liminaires, telles que les pages de titre, les messages préliminaires, etc. peuvent fournir des informations linguistiques et sociales très utiles. P5 fournit un ensemble de recommandations pour distinguer les éléments textuels les plus courants rencontrés dans les Parties liminaires. Elles sont présentées ci-dessous.

18.1.1 La page titre

Le début d’une page titre doit être balisé avec l’élément titlePage. Tout le texte contenu sur la page doit être transcrit et balisé avec les éléments figurant dans la liste suivante :

titlePage (title page) contains the title page of a text, appearing within the front or back matter.
docTitle (document title) contains the title of a document, including all its constituents, as given on a title page.
titlePart contains a subsection or division of the title of a work, as indicated on a title page.
byline contains the primary statement of responsibility given for a work on its title page or at the head or end of the work.
docAuthor (document author) contains the name of the author of the document, as given on the title page (often but not always contained in a byline).
docDate (document date) contains the date of a document, as given (usually) on a title page.
docEdition (document edition) contains an edition statement as presented on a title page of a document.
docImprint (document imprint) contains the imprint statement (place and date of publication, publisher name), as given (usually) at the foot of a title page.
epigraph contains a quotation, anonymous or attributed, appearing at the start or end of a section or on a title page.

Quand cela est nécessaire, les variations de polices de caractères doivent être notées avec l’attribut rend, décrit ci-dessus. Les Recommandations n’offrent pas à l’heure actuelle de solutions pour décrire en détail l’interlettrage et de la dimension des caractères dans les titres ornés. On indiquera les changements de langue au moyen de l’attribut xml:lang ou de l’élément foreign. Le nom des personnes, des lieux, des organisations peut être balisé au moyen de l’élément name, partout où ils apparaissent, et si aucun autre élément plus spécifique n’est disponible.

Deux pages titre en guise d’exemple :

<titlePage rend="Roman">
<docTitle>
<titlePart type="main">BERENICE <hi>TRAGEDIE</hi>. </titlePart>
</docTitle>
<byline>PAR <docAuthor>M. RACINE.</docAuthor>
</byline>
<docImprint>A <name>PARIS</name>, Chez <name>Claude Barbin</name>, au Palais, sur
le Second Perron de la Sainte Chapelle.</docImprint>
<docDate>M. DC. LXXI</docDate>
<imprimatur>AVEC PRIVILEGE DU ROY.</imprimatur>
</titlePage>

<titlePage>
<docTitle>
<titlePart type="main"> LES CHOSES</titlePart>
<titlePart type="sub">Une histoire des années soixante </titlePart>
</docTitle>
<byline>PAR <docAuthor>GEORGES PEREC</docAuthor>
</byline>
<titlePart>Postface de <name>Jacques LEENHARDT</name>
</titlePart>
<docImprint>Julliard</docImprint>
</titlePage>

Comme précédemment, on utilise l’attribut ref pour associer un nom à une description plus elaborée de l'entité nommé stockée ailleurs. Par exemple :

<titlePart>Postface de <name ref="http://cral.ehess.fr/index.php?139">Jacques
LEENHARDT</name>
</titlePart>

18.1.2 Différents types de liminaires

Les parties les plus importantes des Parties liminaires doivent être balisées comme div ; pour distinguer les différents liminaires, l’attribut type pourra avoir l’une des valeurs suivantes :

preface: Un avant-propos ou une préface qui s’adresse au lecteur, dans lequel l’auteur ou l’éditeur expose le contenu, les objectifs, l’origine du texte
dedication: Une adresse formelle ou une dédicace de l’auteur du texte à une ou plusieurs personnes ou institutions
abstract: Un résumé du contenu du texte, sous la forme d’un texte libre
ack: Les remerciements de l’auteur pour des personnes et des institutions qui ont contribué à la création du texte
contents: Une table des matières spécifiant la structure du travail et listant ses constituants. On doit utiliser l’élément list pour spécifier sa structure.
frontispiece: Un frontispice illustré, qui comprend éventuellement du texte.

Pour d’autres types de liminaires, l’encodeur peut spécifier à sa guise les autres valeurs de l’attribut type.

Comme dans n’importe quelle division textuelle, les Parties liminaires peuvent contenir des éléments structurels de bas niveau ou des éléments non structurels (cf. ci-dessus). Elles commencent généralement par un titre, qui doit être balisé au moyen de l’élément head. Les éléments suivants sont aussi disponibles pour baliser les contenus:

salute (salutation) contains a salutation or greeting prefixed to a foreword, dedicatory epistle, or other division of a text, or the salutation in the closing of a letter, preface, etc.
signed (signature) contains the closing salutation, etc., appended to a foreword, dedicatory epistle, or other division of a text.
byline contains the primary statement of responsibility given for a work on its title page or at the head or end of the work.
dateline contains a brief description of the place, date, time, etc. of production of a letter, newspaper story, or other work, prefixed or suffixed to it as a kind of heading or trailer.
argument contains a formal list or prose description of the topics addressed by a subdivision of a text.
cit (cited quotation) contains a quotation from some other document, together with a bibliographic reference to its source. In a dictionary it may contain an example text with at least one occurrence of the word form, used in the sense being described, or a translation of the headword, or an example.
opener groups together dateline, byline, salutation, and similar phrases appearing as a preliminary group at the start of a division, especially of a letter.
closer groups together salutations, datelines, and similar phrases appearing as a final group at the end of a division, especially of a letter.

Les messages qui apparaissent ailleurs dans le texte, contiendront naturellement les mêmes éléments.

A titre d’exemple, prenons la préface des Essais de Michel de Montaigne (édition de 1600) ; elle pourrait être balisée comme suit :

<div type="préface">
<head>Au Lecteur</head>
<p>C'EST icy un Livre de bonne foy, Lecteur. Il t'advertit dès l'entrée, que je ne
   m'y suis propose aucune fin, que domestique & privée .... Ainsi, Lecteur, je
   suis moy-mesme la matière de mon livre : ce n'est pas raison que tu employes ton
   loisir en un subject si frivole & si vain.</p>
<closer>
  <salute>A Dieu donc.</salute>
  <signed>De <name>Montaigne</name>, ce premier de Mars, mil cinq cês quatre
     vingts.</signed>
</closer>
</div>

18.2 Les Annexes

18.2.1 Les divisions structurelles des Annexes

Du fait de la diversité des pratiques de publications, les Annexes peuvent contenir virtuellement tous les éléments listés dans les Parties liminaires ; quand c’est le cas, on utilise les mêmes éléments. Cependant, les Annexes peuvent aussi contenir les types indiqués ci-dessous. Comme pour les divisions structurelles du corps du texte, ils doivent être balisés par des éléments div, et distingués par les valeurs associées à l’attribut type :

appendix: Une section auxiliaire indépendante d’un travail, qui propose souvent un texte additionnel d’une certaine manière non canonique.
glossary: Une liste de termes associés chacun à une définition (glose) : elle doit être encodée avec l’élément <list type="gloss">.
notes: Une section dans laquelle des notes de toutes sortes, y compris textuelles, sont regroupées.
bibliogr: Une liste de références bibliographiques : elle doit être encodée comme une listBibl
index: Toute forme d’index préexistant au travail. (Un index peut aussi être généré pour un document. On utilise alors l’élément index décrit plus haut).
colophon: Une déclaration qui apparaît à la fin d’un livre décrivant les conditions de sa production physique.

19 La page titre électronique

Chaque texte TEI a un en-tête qui fournit une information analogue à celle qui est fournie dans la page titre d’un texte imprimé. L’en-tête est indiqué par l’élément teiHeader et est constitué de quatre parties :

fileDesc (file description) contains a full bibliographic description of an electronic file.
encodingDesc (encoding description) documents the relationship between an electronic text and the source or sources from which it was derived.
profileDesc (text-profile description) provides a detailed description of non-bibliographic aspects of a text, specifically the languages and sublanguages used, the situation in which it was produced, the participants and their setting.
revisionDesc (revision description) summarizes the revision history for a file.

Un corpus, ou un recueil de textes, qui partagent beaucoup de caractéristiques, peuvent avoir un en-tête pour le corpus, et des en-têtes individuels pour chacun des composants. Dans ce cas, l’attribut type indique le type de l’en-tête. Et l’élément <teiHeader type="corpus"> présente les informations relatives au corpus.

Certains des éléments de l’en-tête contiennent du texte libre, encodé sous la forme d’un ou plusieurs p. D’autres sont regroupés :

Les éléments dont le nom se termine en Stmt (pour statement) rassemblent généralement un ensemble d’éléments enregistrant une certaine information structurée.
Les éléments dont le nom se termine en Decl (pour declaration) regroupent des informations sur les pratiques d’encodage particulières qui ont été utilisées.
Les éléments dont le nom se termine en Desc (pour description) contiennent une description sous la forme d’un texte libre.

19.1 La description du fichier

L’élément fileDesc est obligatoire. Il contient la description bibliographique complète du fichier, et utilise les éléments suivants :

titleStmt (title statement) groups information about the title of a work and those responsible for its content.
editionStmt (edition statement) groups information relating to one edition of a text.
extent describes the approximate size of a text stored on some carrier medium or of some other object, digital or non-digital, specified in any convenient units.
publicationStmt (publication statement) groups information concerning the publication or distribution of an electronic or other text.
seriesStmt (series statement) groups information about the series, if any, to which a publication belongs.
notesStmt (notes statement) collects together any notes providing information about a text additional to that recorded in other parts of the bibliographic description.
sourceDesc (source description) describes the source from which an electronic text was derived or generated, typically a bibliographic description in the case of a digitized text, or a phrase such as "born digital" for a text which has no previous existence.

Un en-tête minimal a la structure suivante :

19.1.1 La mention du titre

Les éléments suivants peuvent être utilisés dans le titleStmt :

title contains a title for any kind of work.
author in a bibliographic reference, contains the name(s) of an author, personal or corporate, of a work; for example in the same form as that provided by a recognized bibliographic name authority.
sponsor specifies the name of a sponsoring organization or institution.
funder (funding body) specifies the name of an individual, institution, or organization responsible for the funding of a project or text.
principal (principal researcher) supplies the name of the principal researcher responsible for the creation of an electronic text.
respStmt (statement of responsibility) supplies a statement of responsibility for the intellectual content of a text, edition, recording, or series, where the specialized elements for authors, editors, etc. do not suffice or do not apply. May also be used to encode information about individuals or organizations which have played a role in the production or distribution of a bibliographic work.

Le nom d’une ressource numérique dérivée de son nom non numérique sera évidemment très proche. Cependant, il est important de distinguer le nom du fichier informatique de celui du texte de la source. Par exemple :

<titleStmt>
<title>Artamène ou le Grand Cyrus : édition numérique</title>
<author>Scudéry, Madeleine (1608-1701)</author>
<respStmt>
  <resp>numérisation</resp>
  <name>Claude Bourqui</name>
  <name>Alexandre Gefen</name>
</respStmt>
</titleStmt>

19.1.2 La mention de l’édition

Le editionStmt rassemble les informations relatives à une seule édition de la ressource numérique (« édition » étant employé dans le sens qu’il a dans une bibliographie). Il peut inclure les éléments suivants :

edition (edition) describes the particularities of one edition of a text.
respStmt (statement of responsibility) supplies a statement of responsibility for the intellectual content of a text, edition, recording, or series, where the specialized elements for authors, editors, etc. do not suffice or do not apply. May also be used to encode information about individuals or organizations which have played a role in the production or distribution of a bibliographic work.

Par exemple:

<editionStmt>
<edition n="U2">Troisième version, avec des révisions substantielles <date>1987</date>
</edition>
</editionStmt>

La détermination de ce qui implique l'existence d'une nouvelle édition reste de la responsabilité de l'encodeur.

19.1.3 La mention relative à la taille

La mention extent note la taille approximative de la ressource numérique.

Par exemple :

<extent>4532
octets</extent>

19.1.4 La mention de la publication

Le publicationStmt est obligatoire. Il contient soit une simple description sous la forme d’un texte libre, soit au moins l’un des trois éléments décrits ci-dessous :

publisher provides the name of the organization responsible for the publication or distribution of a bibliographic item.
distributor supplies the name of a person or other agency responsible for the distribution of a text.
authority (release authority) supplies the name of a person or other agency responsible for making a work available, other than a publisher or distributor.

Chacun de ces trois éléments peuvent contenir les éléments suivants:

pubPlace (publication place) contains the name of the place where a bibliographic item was published.
address contains a postal address, for example of a publisher, an organization, or an individual.
idno (identifier) supplies any form of identifier used to identify some object, such as a bibliographic item, a person, a title, an organization, etc. in a standardized way.
availability supplies information about the availability of a text, for example any restrictions on its use or distribution, its copyright status, any licence applying to it, etc.
licence contains information about a licence or other legal agreement applicable to the text.
date contains a date in any format.

Par exemple:

<publicationStmt>
<authority>Université François-Rabelais</authority>
<address>
  <addrLine>3 rue des Tanneurs</addrLine>
  <addrLine>37041 TOURS Cedex 1 France</addrLine>
</address>
<idno type="BVH">B360446201_B343_1</idno>
<availability status="restricted">
  <licence
    target="http://creativecommons.org/licenses/by-sa/2.0/"> Ce document
     est publié librement sur le web à destination de la communauté scientifique
     dans le cadre de la licence Creative Commons « Paternité-Pas d’Utilisation
     Commerciale-Partage des Conditions Initiales à l’Identique 2.0 France ».
  </licence>
</availability>
</publicationStmt>

19.1.5 La mention des séries et des notes

L’élément seriesStmt rassemble des informations concernant la collection ou la série, si elle existe, à laquelle la publication appartient. Il peut contenir les éléments title, idno ou des éléments respStmt.

19.1.6 La description de la source

L’élément sourceDesc est un élément obligatoire, qui enregistre toutes les spécifications de la source ou des sources, à partir desquelles le fichier informatique a été construit. Il peut contenir du texte libre, une référence bibliographique, et rassemble un ou plusieurs des éléments suivants :

bibl (bibliographic citation) contains a loosely-structured bibliographic citation of which the sub-components may or may not be explicitly tagged.
listBibl (citation list) contains a list of bibliographic citations of any kind.

Par exemple : <sourceDesc>
 <p>Texte original : le texte a été créé sous sa forme électronique.</p>
</sourceDesc>
<sourceDesc>
 <bibl>Mazelier, Roger : Gérard de Nerval et l’Humour divin, Le Mesnil Saint-Denis,
   1995.</bibl>
</sourceDesc>
<sourceDesc>
 <bibl>
  <title level="a">L'Enracinement</title>
  <author>Simone Weil</author>, <title>Prélude à une déclaration des devoirs
     envers l'être humain </title>. <publisher>Gallimard</publisher>
  <date>1968</date>. </bibl>
</sourceDesc>

19.2 La description de l’encodage

L’élément encodingDesc spécifie les méthodes et les principes éditoriaux qui ont présidé à la transcription du texte. Il est fortement recommandé de l’utiliser. Il peut contenir du texte libre ou bien des éléments de la liste suivante :

projectDesc (project description) describes in detail the aim or purpose for which an electronic file was encoded, together with any other relevant information concerning the process by which it was assembled or collected.
samplingDecl (sampling declaration) contains a prose description of the rationale and methods used in sampling texts in the creation of a corpus or collection.
editorialDecl (editorial practice declaration) provides details of editorial principles and practices applied during the encoding of a text.
refsDecl (references declaration) specifies how canonical references are constructed for this text.
classDecl (classification declarations) contains one or more taxonomies defining any classificatory codes used elsewhere in the text.

19.2.1 Les descriptions du projet et des échantillons

Illustrons les éléments projectDesc et <samplingDesc> :

<encodingDesc>
<projectDesc>
  <p>Corpus de
     textes sélectionnés pour la formation Claremont Shakespeare Clinic, June 1990.
  </p>
</projectDesc>
</encodingDesc>

<encodingDesc>
<samplingDecl>
<p>Corpus
d'échantillons de 2000 mots pris au début de chaque texte. </p>
</samplingDecl>
</encodingDesc>

19.2.2 Les déclarations éditoriales

L’élément editorialDecl permet de décrire (texte libre) les pratiques mises en œuvre lors de l’encodage du texte. Typiquement, cette description doit couvrir les aspects suivants, chacun faisant l’objet d’un paragraphe séparé :

correction: comment et dans quelles circonstances, on a corrigé le texte.
normalization: les types de régularisations et standardisations, qui ont été opérés.
quotation: les décisions prises en matière de guillemets. Est-ce qu’ils ont été conservés ou remplacés par des références d’entité, les guillemets ouvrants et fermants sont-ils différenciés, etc. ?
hyphenation: ce que l’on a décidé en matière de césure (notamment les césures de fins de ligne). Est-ce qu’elles ont été conservées, remplacées par des références d’entité, etc. ?
segmentation: comment le texte a été segmenté, par exemple en phrases, en unités de ton, en strates graphémiques, etc. ?
interpretation: quelles informations analytiques, interprétatives ont été associées au texte ?

Exemple :

<editorialDecl>
<p>L'analyse
   morpho-syntaxique trouvée dans la section 4 a été ajoutée manuellement, et n'a
   pas encore été contrôllée.</p>
<p>Contrôle des erreurs d'orthographe effectué par
   le systeme WordPerfect. </p>
<p>Orthographe normalisée et modernisée selon
   dictionnaire Le Grand Robert.</p>
</editorialDecl>

19.2.3 Les déclarations relatives au système de référence et à la classification

On utilise l’élément refsDecl pour documenter la manière dont les schémas de référencement standard ont été construits. Dans sa forme la plus simple, l’élément contient du texte libre.

Exemple :

<refsDecl>
<p>L'attribut
<att>n</att> sur chaque <gi>div</gi> porte la référence canonique de cette
   division sous la forme XX.yyy : XX fournissant la numérotation du livre en
   chiffres romains, et yyy celle de la section en chiffres arabes, par ex
   II.234.</p>
</refsDecl>

L’élément classDecl regroupe les définitions ou les sources de n’importe quel schéma de classification descriptive, qui sont utilisées par les autres parties de l’en-tête. On doit fournir au moins un schéma, encodé en utilisant les éléments suivants :

taxonomy defines a typology either implicitly, by means of a bibliographic citation, or explicitly by a structured taxonomy.
bibl (bibliographic citation) contains a loosely-structured bibliographic citation of which the sub-components may or may not be explicitly tagged.
category contains an individual descriptive category, possibly nested within a superordinate category, within a user-defined taxonomy.
catDesc (category description) describes some category within a taxonomy or text typology, either in the form of a brief prose description or in terms of the situational parameters used by the TEI formal textDesc.

Dans le cas le plus simple, la taxinomie peut être définie par une référence bibliographique, comme dans l’exemple suivant :

<classDecl>
<taxonomy xml:id="RAMEAU">
  <bibl>Répertoire d'autorité-matière encyclopédique et alphabétique unifié
     (RAMEAU) de la Bibliothèque nationale de France. <ptr target="http://rameau.bnf.fr/"/>
  </bibl>
</taxonomy>
</classDecl>

L’encodeur peut procéder d’une autre manière, ou bien compléter ce qui précède, en définissant un schéma de classification spécifique et personel, comme dans l’exemple suivant :

<taxonomy xml:id="taxinomiePerso">
<category xml:id="a.a">
  <catDesc>littérature</catDesc>
</category>
<category xml:id="a.a.1">
  <catDesc>Drame bourgeois</catDesc>
</category>
<category xml:id="a.a.1.α">
  <catDesc>Comédie larmoyante</catDesc>
</category>
<category xml:id="a.b">
  <catDesc>Correspondance</catDesc>
</category>
<category xml:id="a.b.1.a">
  <catDesc>Dernières lettres</catDesc>
</category>
<category xml:id="a.c.">
  <catDesc>Littérature européenne -- 16e siècle</catDesc>
</category>
<category xml:id="a.c.1">
  <catDesc>Satire de la Renaissance </catDesc>
</category>
<category xml:id="a.d">
  <catDesc>Récits de voyage</catDesc>
</category>
<category xml:id="a.d.1">
  <catDesc>Récits de la mer </catDesc>
</category>
</taxonomy>>

Relier un texte particulier et une catégorie dans une taxinomie est réalisé grâce à l’élément catRef, inséré dans l’élément textClass. Cf. section ci-dessous.

19.3 La description du profil

L’élément profileDesc permet de caractériser la diversité des dimensions d’un texte. Il doit être enregistré dans un seul cadre et comporte trois constituants optionnels :

creation contains information about the creation of a text.
langUsage (language usage) describes the languages, sublanguages, registers, dialects, etc. represented within a text.
textClass (text classification) groups information which describes the nature or topic of a text in terms of a standard classification scheme, thesaurus, etc.

L’élément creation permet de documenter le lieu de la création du travail, même si celui-ci n’a pas été publié, ou bien s’il n’a pas été explicitement noté dans la source.

Exemple :

<creation>
<date when="1992-08">August 1992</date>
<name type="place">Taos, New
Mexico</name>
</creation>

Quand le texte est écrit dans différentes langues, on se sert de l’élément langUsage. Celui-ci contient des éléments permettent de documenter chaque langue en particulier :

language characterizes a single language or sublanguage used within a text.

Par exemple, un texte comportant de manière prédominante du français tel qu’il est parlé au Québec, mais aussi de plus petits passages en anglais britannique et en anglais du Canada pourrait être documenté comme suit :

<langUsage>
<language ident="fr-CA" usage="60">Québecois</language>
<language ident="en-CA" usage="20">Anglais canadien</language>
<language ident="en-GB" usage="20">Anglais britannique</language>
</langUsage>

L’élément textClass permet de classer le texte. On établit une référence vers un système de classification, localement défini par l’élément classDecl, ou bien on réfère à un schéma établi de manière externe, par exemple la Classification Décimale Universelle. Les textes peuvent être aussi classés à l’aide de listes de mots clés, qui peuvent eux-mêmes venir de listes contrôlées, définies localement, ou bien de manière externe. On utilise les éléments suivants pour établir de telles classifications :

classCode (classification code) contains the classification code used for this text in some standard classification system.
catRef/ (category reference) specifies one or more defined categories within some taxonomy or text typology.
keywords contains a list of keywords or phrases identifying the topic or nature of a text.

La manière la plus simple de classer un texte est d’utiliser l’élément classCode. Par exemple, un texte avec la classification 410 de la Classification Décimale Universelle pourrait être documenté comme suit :

Quand un schéma de classification a été défini localement en utilisant l’élément taxonomy, présenté dans la section précédente, l’élément <carRef> peut être utilisé pour y faire référence. Poursuivons avec un exemple proposé plus tôt, un travail répertorié dans ce corpus comme Drame bourgeois et Récits de la mer. Il pourrait être documenté comme suit :

L’élément keywords contient la liste des mots clés ou des expressions qui identifient les thèmes ou la nature du texte. Comme précédemment, l’attribut scheme identifie la source de laquelle ces termes ont été extraits, par exemple du système RAMEAU, ou ailleurs. Les mots clés utilisés peuvent également être répertorié dans élément taxonomy, tel que cela été présenté ci-dessus :

<textClass>
<keywords scheme="#fr_RAMEAU">
  <term>Littérature française -- 20ème siècle -- Histoire et critique</term>
  <term>Littérature française -- Histoire et critique -- Théorie, etc.</term>
  <term>Français (langue) -- Style -- Bases de données.</term>
</keywords>
</textClass>

Les classifications multiples sont possibles ; elles utilisent les mécanismes décrits dans cette section.

19.4 La description des modifications

L’élément revisionDesc permet de tenir le journal des modifications. Le journal peut être conservé sous la forme d’une suite d’éléments change, chacun contenant une courte description de la modification. On utilise les attributs when et who pour spécifier quand le changement a eu lieu et qui en est responsable.

Exemple :

<revisionDesc>
<change when="1991-03-06" who="#EMB">fichier validé</change>
<change when="1990-05-25" who="#EMB">version corrigée</change>
</revisionDesc>

Dans un environnement de production, il est souvent préférable d’utiliser un système automatisé pour conserver la trace des modifications. Beaucoup de systèmes de gestion de versions peuvent être également configurés pour mettre à jour automatiquement l’en-tête TEI d’un fichier.

Appendix A Liste des éléments décrits

Le schéma TEI Lite est un sous-ensemble de la TEI P5. Dans la liste qui suit, qui rassemble des éléments et des classes qui ont été décrits ci-dessus, les informations, notamment les exemples, ont été extraites des définitions fournies dans la TEI P5, et peuvent donc renvoyer à des éléments ou des attributs qui ne sont pas proposées dans la TEI Lite. Notons cependant que seuls les éléments listés ici sont disponibles dans le schéma TEI Lite. Certaines spécifications peuvent également renvoyer à des attributs, qui, bien qu’ils soient disponibles dans la TEI Lite, n’ont pas été, par manque de place, présentés dans ce tutoriel.

Schema tei_lite: Elements

<abbr>

<abbr> (abbreviation) contains an abbreviation of any sort. [3.5.5. ]

Module core

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.responsibility (@cert, @resp) att.typed (type, @subtype)

type

allows the encoder to classify the abbreviation according to some convenient typology.

Derived from	att.typed
Status	Optional
Datatype	`data.enumerated`
Note	The type attribute is provided for the sake of those who wish to classify abbreviations at their point of occurrence; this may be useful in some circumstances, though usually the same abbreviation will have the same type in all occurrences. As the sample values make clear, abbreviations may be classified by the method used to construct them, the method of writing them, or the referent of the term abbreviated; the typology used is up to the encoder and should be carefully planned to meet the needs of the expected use. For a typology of Middle English abbreviations, see [ID PETTY in TEI Guidelines]

Member of

model.choicePart model.pPart.editorial

Contained by

analysis: pc s w

figures: cell figDesc

header: authority catDesc change classCode creation distributor edition extent funder language licence principal sponsor

linking: seg

tagdocs: eg

textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer

May contain

analysis: interp interpGrp pc s w

core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear

figures: figure formula

header: idno

linking: anchor seg

tagdocs: att code gi ident val

Declaration

element abbr
{
   att.global.attributes,
   att.responsibility.attributes,
   att.typed.attribute.subtype,
   attribute type { data.enumerated }?,
   macro.phraseSeq
}

Example

<choice>
<expan>North Atlantic Treaty Organization</expan>
<abbr cert="low">NorATO</abbr>
<abbr cert="high">NATO</abbr>
<abbr cert="high" xml:lang="fr">OTAN</abbr>
</choice>

Example

<choice>
<abbr>SPQR</abbr>
<expan>senatus populusque romanorum</expan>
</choice>

Note

The abbr tag is not required; if appropriate, the encoder may transcribe abbreviations in the source text silently, without tagging them. If abbreviations are not transcribed directly but expanded silently, then the TEI header should so indicate.

<add>

<add> (addition) contains letters, words, or phrases inserted in the source text by an author, scribe, annotator, or corrector. [3.4.3. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.transcriptional (@hand, @status, @cause, @seq) (att.editLike (@evidence, @source, @instant) (att.dimensions (@unit, @quantity, @extent, @precision, @scope) (att.ranging (@atLeast, @atMost, @min, @max, @confidence)) ) (att.responsibility (@cert, @resp)) ) att.placement (@place) att.typed (@type, @subtype)
Member of	model.pPart.transcriptional
Contained by	analysis: pc s w core: abbr add addrLine author bibl biblScope corr date del editor emph expan foreign gloss head hi item l label mentioned name note num orig p pubPlace publisher q ref reg rs sic soCalled speaker stage term time title unclear figures: cell header: change distributor edition extent licence linking: seg tagdocs: eg textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer
May contain	analysis: interp interpGrp pc s w core: abbr add address bibl choice cit corr date del desc emph expan foreign gap gloss graphic hi index label lb lg list listBibl mentioned milestone name note num orig pb ptr q ref reg rs sic soCalled stage term time title unclear figures: figure formula table header: idno linking: anchor seg tagdocs: att code eg gi ident val
Declaration	element add { att.global.attributes, att.transcriptional.attributes, att.placement.attributes, att.typed.attributes, macro.paraContent }
Example	The story I am going to relate is true as to its main facts, and as to the consequences <add place="above">of these facts</add> from which this tale takes its title.
Note	In a diplomatic edition attempting to represent an original source, the add element should not be used for additions to the current TEI electronic edition made by editors or encoders. In these cases, either the corr or <supplied> element are recommended. In a TEI edition of a historical text with previous editorial emendations in which such additions or reconstructions are considered part of the source text, the use of add may be appropriate, dependent on the editorial philosophy of the project.

<address>

<address> contains a postal address, for example of a publisher, an organization, or an individual. [3.5.2. 2.2.4. 3.11.2.4. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.addressLike model.publicationStmtPart
Contained by	analysis: s core: abbr add addrLine author bibl biblScope corr date del desc editor emph expan foreign gloss head hi item l label mentioned name note num orig p pubPlace publisher q ref reg resp rs sic soCalled speaker stage term time title unclear figures: cell figDesc header: authority catDesc change classCode creation distributor edition extent funder language licence principal publicationStmt sponsor linking: seg tagdocs: eg textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer
May contain	analysis: interp interpGrp core: addrLine gap index lb milestone name note pb rs figures: figure header: idno linking: anchor
Declaration	element address { att.global.attributes, ( model.global, ( ( model.addrPart ), model.global )+ ) }
Example	<address> <street>via Marsala 24</street> <postCode>40126</postCode> <name>Bologna</name> <name n="I">Italy</name> </address>
Example	<address> <addrLine>Computing Center, MC 135</addrLine> <addrLine>P.O. Box 6998</addrLine> <addrLine>Chicago, IL 60680</addrLine> <addrLine>USA</addrLine> </address>
Note	This element should be used for postal addresses only. Within it, the generic element addrLine may be used as an alternative to any of the more specialized elements available from the model.addrPart class, such as <street>, <postCode> etc.

<addrLine>

<addrLine> (address line) contains one line of a postal address. [3.5.2. 2.2.4. 3.11.2.4. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.addrPart
Contained by	core: address
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val
Declaration	element addrLine { att.global.attributes, macro.phraseSeq }
Example	<address> <addrLine>Computing Center, MC 135</addrLine> <addrLine>P.O. Box 6998</addrLine> <addrLine>Chicago, IL</addrLine> <addrLine>60680 USA</addrLine> </address>
Example	<addrLine> <ref target="tel:+1-201-555-0123">(201) 555 0123</ref> </addrLine>
Note	Addresses may be encoded either as a sequence of lines, or using any sequence of component elements from the model.addrPart class. Other non-postal forms of address, such as telephone numbers or email, should not be included within an address element directly but may be wrapped within an addrLine if they form part of the printed address in some source text.

<anchor>

<anchor> (anchor point) attaches an identifier to a point within a text, whether or not it corresponds with a textual element. [8.4.2. 16.4. ]
Module	linking
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.typed (@type, @subtype)
Member of	model.milestoneLike
Contained by	analysis: s w core: abbr add addrLine address author bibl biblScope cit corr date del editor emph expan foreign gloss head hi item l label lg list listBibl mentioned name note num orig p pubPlace publisher q ref reg resp rs sic soCalled sp speaker stage term time title unclear figures: cell figure table header: authority change classCode distributor edition extent funder language licence principal sponsor linking: seg tagdocs: eg textstructure: argument back body byline closer dateline div docAuthor docDate docEdition docImprint docTitle epigraph front group imprimatur opener postscript salute signed text titlePage titlePart trailer
May contain	Empty element
Declaration	element anchor { att.global.attributes, att.typed.attributes, empty }
Example	<s>The anchor is he<anchor xml:id="A234"/>re somewhere.</s> <s>Help me find it.<ptr target="#A234"/> </s>
Note	On this element, the global xml:id attribute must be supplied to specify an identifier for the point at which this element occurs within a document. The value used may be chosen freely provided that it is unique within the document and is a syntactically valid name. There is no requirement for values containing numbers to be in sequence.

<argument>

<argument> contains a formal list or prose description of the topics addressed by a subdivision of a text. [4.2. 4.6. ]
Module	textstructure
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.divWrapper model.pLike.front model.titlepagePart
Contained by	core: lg list figures: figure table textstructure: back body div front group opener titlePage
May contain	analysis: interp interpGrp core: bibl cit desc gap head index l label lb lg list listBibl milestone note p pb q sp stage figures: figure table linking: anchor tagdocs: eg
Declaration	element argument { att.global.attributes, ( ( model.global \| model.headLike ), ( ( model.common ), model.global )+ ) }
Example	<argument> <p>Monte Video — Maldonado — Excursion to R Polanco — Lazo and Bolas — Partridges — Absence of Trees — Deer — Capybara, or River Hog — Tucutuco — Molothrus, cuckoo-like habits — Tyrant Flycatcher — Mocking-bird — Carrion Hawks — Tubes formed by Lightning — House struck</p> </argument>
Note	Often contains either a list or a paragraph

<att>

<att> (attribute) contains the name of an attribute appearing within running text. [22. ]

Module tagdocs

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))

scheme

supplies an identifier for the scheme in which this name is defined.

Status	Optional
Datatype	`data.enumerated`
Sample values include:	TEI (text encoding initiative) this attribute is part of the TEI scheme. [Default] DBK (docbook) this attribute is part of the Docbook scheme. XX (unknown) this attribute is part of an unknown scheme.

Member of

model.phrase.xml

Contained by

analysis: s

figures: cell figDesc

header: authority catDesc change classCode creation distributor edition extent funder language licence principal sponsor

linking: seg

tagdocs: eg

textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer

May contain Character data only

Declaration

element att
{
   att.global.attributes,
   attribute scheme { data.enumerated }?,
   data.name
}

Example

<p>The TEI defines six <soCalled>global</soCalled> attributes; their names are
<att>xml:id</att>, <att>rend</att>, <att>xml:lang</att>, <att>n</att>, <att>xml:space</att>,
and <att>xml:base</att>; <att scheme="XX">style</att> is not among them.</p>

Note

A namespace prefix may be used in order to specify the scheme as an alternative to specifying it via the scheme attribute: it takes precedence

<author>

<author> in a bibliographic reference, contains the name(s) of an author, personal or corporate, of a work; for example in the same form as that provided by a recognized bibliographic name authority. [3.11.2.2. 2.2.1. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.naming (@role, @nymRef) (att.canonical (@key, @ref))
Member of	model.respLike
Contained by	core: bibl header: editionStmt titleStmt
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val
Declaration	element author { att.global.attributes, att.naming.attributes, macro.phraseSeq }
Example	<author>British Broadcasting Corporation</author> <author>La Fayette, Marie Madeleine Pioche de la Vergne, comtesse de (1634–1693)</author> <author>Anonymous</author> <author>Bill and Melinda Gates Foundation</author> <author> <persName>Beaumont, Francis</persName> and <persName>John Fletcher</persName> </author> <author> <orgName key="BBC">British Broadcasting Corporation</orgName>: Radio 3 Network </author>
Note	Particularly where cataloguing is likely to be based on the content of the header, it is advisable to use a generally recognized name authority file to supply the content for this element. The attributes key or ref may also be used to reference canonical information about the author(s) intended from any appropriate authority, such as a library catalogue or online resource. In the case of a broadcast, use this element for the name of the company or network responsible for making the broadcast. Where an author is unknown or unspecified, this element may contain text such as Unknown or Anonymous. When the appropriate TEI modules are in use, it may also contain detailed tagging of the names used for people, organizations or places, in particular where multiple names are given.

<authority>

<authority> (release authority) supplies the name of a person or other agency responsible for making a work available, other than a publisher or distributor. [2.2.4. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.publicationStmtPart
Contained by	header: publicationStmt
May contain	analysis: interp interpGrp core: abbr address choice date emph expan foreign gap gloss hi index lb mentioned milestone name note num pb ptr ref rs soCalled term time title figures: figure header: idno linking: anchor tagdocs: att code gi ident val
Declaration	element authority { att.global.attributes, macro.phraseSeq.limited }
Example	<authority>John Smith</authority>

<availability>

<availability> supplies information about the availability of a text, for example any restrictions on its use or distribution, its copyright status, any licence applying to it, etc. [2.2.4. ]

Module header

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.declarable (@default)

status

supplies a code identifying the current availability of the text.

Status	Optional
Datatype	`data.enumerated`
Legal values are:	free the text is freely available. unknown the status of the text is unknown. [Default] restricted the text is not freely available.

Member of

model.publicationStmtPart

Contained by

header: publicationStmt

May contain

core: p

header: licence

Declaration

element availability
{
   att.global.attributes,
   att.declarable.attributes,
   attribute status { "free" | "unknown" | "restricted" }?,
   ( model.availabilityPart | model.pLike )+
}

Example

<availability status="restricted">
<p>Available for academic research purposes only.</p>
</availability>
<availability status="free">
<p>In the public domain</p>
</availability>
<availability status="restricted">
<p>Available under licence from the publishers.</p>
</availability>

Example

<availability>
<licence target="http://opensource.org/licenses/MIT">
  <p>The MIT License
     applies to this document.</p>
  <p>Copyright (C) 2011 by The University of Victoria</p>
  <p>Permission is hereby granted, free of charge, to any person obtaining a copy
     of this software and associated documentation files (the "Software"), to deal
     in the Software without restriction, including without limitation the rights
     to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
     copies of the Software, and to permit persons to whom the Software is
     furnished to do so, subject to the following conditions:</p>
  <p>The above copyright notice and this permission notice shall be included in
     all copies or substantial portions of the Software.</p>
  <p>THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
     IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
     FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
     AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
     LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
     OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
     THE SOFTWARE.</p>
</licence>
</availability>

Note

A consistent format should be adopted

<back>

<back> (back matter) contains any appendixes, etc. following the main part of a text. [4.7. 4. ]
Module	textstructure
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.declaring (@decls)
Member of
Contained by	textstructure: text
May contain	analysis: interp interpGrp core: divGen gap head index lb list milestone note p pb figures: figure table linking: anchor textstructure: argument byline closer div docAuthor docDate docEdition docImprint docTitle epigraph postscript signed titlePage titlePart trailer
Declaration	element back { att.global.attributes, att.declaring.attributes, ( ( model.frontPart \| model.pLike.front \| model.pLike \| model.listLike \| model.global ), ( ( ( ( model.div1Like ), ( model.frontPart \| model.div1Like \| model.global ) ) \| ( ( model.divLike ), ( model.frontPart \| model.divLike \| model.global )* ) )? ), ( ( ( model.divBottomPart ), ( model.divBottomPart \| model.global )* )? ) ) }
Example	<back> <div1 type="appendix"> <head>The Golden Dream or, the Ingenuous Confession</head> <p>To shew the Depravity of human Nature </p> </div1> <div1 type="epistle"> <head>A letter from the Printer, which he desires may be inserted</head> <salute>Sir.</salute> <p>I have done with your Copy, so you may return it to the Vatican, if you please </p> </div1> <div1 type="advert"> <head>The Books usually read by the Scholars of Mrs Two-Shoes are these and are sold at Mr Newbery's at the Bible and Sun in St Paul's Church-yard.</head> <list> <item n="1">The Christmas Box, Price 1d.</item> <item n="2">The History of Giles Gingerbread, 1d.</item> <item n="42">A Curious Collection of Travels, selected from the Writers of all Nations, 10 Vol, Pr. bound 1l.</item> </list> </div1> <div1 type="advert"> <head> <hi rend="center">By the KING's Royal Patent,</hi> Are sold by J. NEWBERY, at the Bible and Sun in St. Paul's Church-Yard.</head> <list> <item n="1">Dr. James's Powders for Fevers, the Small-Pox, Measles, Colds, &c. 2s. 6d</item> <item n="2">Dr. Hooper's Female Pills, 1s.</item> </list> </div1> </back>
Note	The content model of back matter is identical to that of front matter, reflecting the facts of cultural history.

<bibl>

<bibl> (bibliographic citation) contains a loosely-structured bibliographic citation of which the sub-components may or may not be explicitly tagged. [3.11.1. 2.2.7. 15.3.2. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.declarable (@default) att.typed (@type, @subtype) att.sortable (@sortKey)
Member of	model.biblLike model.biblPart
Contained by	core: add bibl cit corr del desc emph head hi item l listBibl note orig p q ref reg relatedItem sic stage title unclear figures: cell figDesc figure header: change licence sourceDesc taxonomy linking: seg textstructure: argument body div docEdition epigraph imprimatur postscript signed titlePart
May contain	analysis: interp interpGrp pc s w core: abbr add address author bibl biblScope choice corr date del editor emph expan foreign gap gloss hi index lb mentioned milestone name note num orig pb ptr pubPlace publisher ref reg relatedItem respStmt rs sic soCalled term time title unclear figures: figure header: distributor edition extent funder idno principal sponsor linking: anchor seg tagdocs: code ident
Declaration	element bibl { att.global.attributes, att.declarable.attributes, att.typed.attributes, att.sortable.attributes, ( text \| model.gLike \| model.highlighted \| model.pPart.data \| model.pPart.edit \| model.segLike \| model.ptrLike \| model.biblPart \| model.global )* }
Example	<bibl>Blain, Clements and Grundy: Feminist Companion to Literature in English (Yale, 1990)</bibl>
Example	<bibl> <title level="a">The Interesting story of the Children in the Wood</title>. In <author>Victor E Neuberg</author>, <title>The Penny Histories</title>. <publisher>OUP</publisher> <date>1968</date>. </bibl>
Example	<bibl type="article" subtype="book_chapter" xml:id="carlin_2003"> <author> <name> <surname>Carlin</surname> (<forename>Claire</forename>)</name> </author>, <title level="a">The Staging of Impotence : France’s last congrès</title> dans <bibl type="monogr"> <title level="m">Theatrum mundi : studies in honor of Ronald W. Tobin</title>, éd. <editor> <name> <forename>Claire</forename> <surname>Carlin</surname> </name> </editor> et <editor> <name> <forename>Kathleen</forename> <surname>Wine</surname> </name> </editor>, <pubPlace>Charlottesville, Va.</pubPlace>, <publisher>Rookwood Press</publisher>, <date when="2003">2003</date>. </bibl> </bibl>
Note	Contains phrase-level elements, together with any combination of elements from the biblPart class

<biblScope>

<biblScope> (scope of bibliographic reference) defines the scope of a bibliographic reference, for example as a list of page numbers, or a named subdivision of a larger work. [3.11.2.4. ]

Module core

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))

unit

identifies the unit of information conveyed by the element, e.g. columns, pages, volume.

Status	Optional
Datatype	`data.enumerated`
Suggested values include:	volume the element contains a volume number. issue the element contains an issue number, or volume and issue numbers. page the element contains a page number or page range. line the element contains a line number or line range. chapter the element contains a chapter indication (number and/or title) part the element identifies a part of a book or collection. column the element identifies a column.

type

identifies the type of information conveyed by the element, e.g. columns, pages, volume.

Deprecated	will be removed on 2014-12-17
Status	Optional
Datatype	`data.enumerated`
Suggested values include:	vol (volume) the element contains a volume number. issue the element contains an issue number, or volume and issue numbers. pp (pages) the element contains a page number or page range. ll (lines) the element contains a line number or line range. chap (chapter) the element contains a chapter indication (number and/or title) part the element identifies a part of a book or collection.

from

specifies the starting point of the range of units indicated by the unit attribute.

Status	Optional
Datatype	`data.word`

specifies the end-point of the range of units indicated by the unit attribute.

Status	Optional
Datatype	`data.word`

Member of

model.imprintPart

Contained by

core: bibl

header: seriesStmt

May contain

analysis: interp interpGrp pc s w

core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear

figures: figure formula

header: idno

linking: anchor seg

tagdocs: att code gi ident val

Declaration

element biblScope
{
   att.global.attributes,
   attribute unit
   {
      "volume"
    | "issue"
    | "page"
    | "line"
    | "chapter"
    | "part"
    | "column"
    | xsd:Name
   }?,
   attribute type
   {
      "vol" | "issue" | "pp" | "ll" | "chap" | "part" | xsd:Name
   }?,
   attribute from { data.word }?,
   attribute to { data.word }?,
   macro.phraseSeq
}

Example

<body>

<body> (text body) contains the whole body of a single unitary text, excluding any front or back matter. [4. ]
Module	textstructure
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.declaring (@decls)
Member of
Contained by	textstructure: text
May contain	analysis: interp interpGrp core: bibl cit desc divGen gap head index l label lb lg list listBibl milestone note p pb q sp stage figures: figure table linking: anchor tagdocs: eg textstructure: argument byline closer dateline div docAuthor docDate epigraph opener postscript salute signed trailer
Declaration	element body { att.global.attributes, att.declaring.attributes, ( model.global, ( ( model.divTop ), ( model.global \| model.divTop ) )?, ( ( model.divGenLike ), ( model.global \| model.divGenLike )* )?, ( ( ( model.divLike ), ( model.global \| model.divGenLike )* )+ \| ( ( model.div1Like ), ( model.global \| model.divGenLike )* )+ \| ( ( ( model.common ), model.global* )+, ( ( ( model.divLike ), ( model.global \| model.divGenLike )* )+ \| ( ( model.div1Like ), ( model.global \| model.divGenLike )* )+ )? ) ), ( ( model.divBottom ), model.global* )* ) }

<byline>

<byline> contains the primary statement of responsibility given for a work on its title page or at the head or end of the work. [4.2.2. 4.5. ]
Module	textstructure
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.divWrapper model.pLike.front model.titlepagePart
Contained by	core: lg list figures: figure table textstructure: back body div front group opener titlePage
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val textstructure: docAuthor
Declaration	element byline { att.global.attributes, ( text \| model.gLike \| model.phrase \| docAuthor \| model.global )* }
Example	<byline>Written by a CITIZEN who continued all the while in London. Never made publick before.</byline>
Example	<byline>Written from her own MEMORANDUMS</byline>
Example	<byline>By George Jones, Political Editor, in Washington</byline>
Example	<byline>BY <docAuthor>THOMAS PHILIPOTT,</docAuthor> Master of Arts, (Somtimes) Of Clare-Hall in Cambridge.</byline>
Note	The byline on a title page may include either the name or a description for the document's author. Where the name is included, it may optionally be tagged using the docAuthor element.

<catDesc>

<catDesc> (category description) describes some category within a taxonomy or text typology, either in the form of a brief prose description or in terms of the situational parameters used by the TEI formal textDesc. [2.3.7. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of
Contained by	header: category
May contain	core: abbr address choice date emph expan foreign gloss hi mentioned name num ptr ref rs soCalled term time title header: idno tagdocs: att code gi ident val
Declaration	element catDesc { att.global.attributes, ( text \| model.limitedPhrase \| model.catDescPart )* }
Example	<catDesc>Prose reportage</catDesc>
Example	<catDesc> <textDesc n="novel"> <channel mode="w">print; part issues</channel> <constitution type="single"/> <derivation type="original"/> <domain type="art"/> <factuality type="fiction"/> <interaction type="none"/> <preparedness type="prepared"/> <purpose type="entertain" degree="high"/> <purpose type="inform" degree="medium"/> </textDesc> </catDesc>

<category>

<category> contains an individual descriptive category, possibly nested within a superordinate category, within a user-defined taxonomy. [2.3.7. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of
Contained by	header: category taxonomy
May contain	core: desc gloss header: catDesc category
Declaration	element category { att.global.attributes, ( ( catDesc+ \| ( model.descLike \| model.glossLike )* ), category* ) }
Example	<category xml:id="b1"> <catDesc>Prose reportage</catDesc> </category>
Example	<category xml:id="b2"> <catDesc>Prose </catDesc> <category xml:id="b11"> <catDesc>journalism</catDesc> </category> <category xml:id="b12"> <catDesc>fiction</catDesc> </category> </category>
Example	<category xml:id="LIT"> <catDesc xml:lang="pl">literatura piękna</catDesc> <catDesc xml:lang="en">fiction</catDesc> <category xml:id="LPROSE"> <catDesc xml:lang="pl">proza</catDesc> <catDesc xml:lang="en">prose</catDesc> </category> <category xml:id="LPOETRY"> <catDesc xml:lang="pl">poezja</catDesc> <catDesc xml:lang="en">poetry</catDesc> </category> <category xml:id="LDRAMA"> <catDesc xml:lang="pl">dramat</catDesc> <catDesc xml:lang="en">drama</catDesc> </category> </category>

<catRef>

<catRef> (category reference) specifies one or more defined categories within some taxonomy or text typology. [2.4.3. ]

Module header

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.pointing (@targetLang, @target, @evaluate)

scheme

identifies the classification scheme within which the set of categories concerned is defined

Status	Optional
Datatype	`data.pointer`
Values	May supply the identifier of the associated taxonomy element.

Member of

Contained by

header: textClass

May contain Empty element

Declaration

element catRef
{
   att.global.attributes,
   att.pointing.attributes,
   attribute scheme { data.pointer }?,
   empty
}

Example

<catRef target="#news #prov #sales2"/>

<taxonomy>
<category xml:id="news">
  <catDesc>Newspapers</catDesc>
</category>
<category xml:id="prov">
  <catDesc>Provincial</catDesc>
</category>
<category xml:id="sales2">
  <catDesc>Low to average annual sales</catDesc>
</category>
</taxonomy>

Note

The scheme attribute need be supplied only if more than one taxonomy has been declared

<cell>

<cell> contains one cell of a table. [14.1.1. ]
Module	figures
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.tableDecoration (@role, @rows, @cols)
Member of
Contained by	figures: row
May contain	analysis: interp interpGrp pc s w core: abbr add address bibl choice cit corr date del desc emph expan foreign gap gloss graphic hi index l label lb lg list listBibl mentioned milestone name note num orig p pb ptr q ref reg rs sic soCalled sp stage term time title unclear figures: figure formula table header: idno linking: anchor seg tagdocs: att code eg gi ident val
Declaration	element cell { att.global.attributes, att.tableDecoration.attributes, macro.specialPara }
Example	<row> <cell role="label">General conduct</cell> <cell role="data">Not satisfactory, on account of his great unpunctuality and inattention to duties</cell> </row>

<change>

<change> documents a change or set of changes made during the production of a source document, or during the revision of an electronic file. [2.5. 2.4.1. ]

Module header

Attributes

Attributes att.ascribed (@who) att.datable (@calendar, @period) (att.datable.w3c (@when)) att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.typed (@type, @subtype)

target

points to one or more elements that belong to this change.

Status	Optional
Datatype	1–∞ occurrences of `data.pointer`separated by whitespace

Member of

Contained by

header: revisionDesc

May contain

analysis: interp interpGrp pc s w

core: abbr add address bibl choice cit corr date del desc emph expan foreign gap gloss graphic hi index l label lb lg list listBibl mentioned milestone name note num orig p pb ptr q ref reg rs sic soCalled sp stage term time title unclear

figures: figure formula table

header: idno

linking: anchor seg

tagdocs: att code eg gi ident val

Declaration

element change
{
   att.ascribed.attributes,
   att.datable.attributes,
   att.global.attributes,
   att.typed.attributes,
   attribute target { list { data.pointer+ } }?,
   macro.specialPara
}

Example

<titleStmt>
<title> ... </title>
<editor xml:id="LDB">Lou Burnard</editor>
<respStmt xml:id="BZ">
<resp>copy editing</resp>
<name>Brett Zamir</name>
</respStmt>
</titleStmt>

<revisionDesc status="published">
<change who="#BZ" when="2008-02-02" status="public">Finished chapter 23</change>
<change who="#BZ" when="2008-01-02" status="draft">Finished chapter 2</change>
<change n="P2.2" when="1991-12-21" who="#LDB">Added examples to section 3</change>
<change when="1991-11-11" who="#MSM">Deleted chapter 10</change>
</revisionDesc>

Example

<profileDesc>
<creation>
  <listChange>
   <change xml:id="DRAFT1">First draft in pencil</change>
   <change xml:id="DRAFT2" notBefore="1880-12-09">First revision, mostly
       using green ink</change>
   <change xml:id="DRAFT3" notBefore="1881-02-13">Final corrections as
       supplied to printer.</change>
  </listChange>
</creation>
</profileDesc>

Note

The who attribute may be used to point to any other element, but will typically specify a respStmt or <person> element elsewhere in the header, identifying the person responsible for the change and their role in making it.

It is recommended that changes be recorded with the most recent first. The status attribute may be used to indicate the status of a document following the change documented.

<choice>

<choice> groups a number of alternative encodings for the same point in a text. [3.4. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.linePart model.pPart.editorial
Contained by	analysis: pc s w core: abbr add addrLine author bibl biblScope choice corr date del desc editor emph expan foreign gloss head hi item l label mentioned name note num orig p pubPlace publisher q ref reg resp rs sic soCalled speaker stage term time title unclear figures: cell figDesc header: authority catDesc change classCode creation distributor edition extent funder language licence principal sponsor linking: seg tagdocs: eg textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer
May contain	core: abbr choice corr expan orig reg sic unclear linking: seg
Declaration	element choice { att.global.attributes, ( model.choicePart \| choice )* }
Example	An American encoding of Gulliver's Travels which retains the British spelling but also provides a version regularized to American spelling might be encoded as follows. <p>Lastly, That, upon his solemn oath to observe all the above articles, the said man-mountain shall have a daily allowance of meat and drink sufficient for the support of <choice> <sic>1724</sic> <corr>1728</corr> </choice> of our subjects, with free access to our royal person, and other marks of our <choice> <orig>favour</orig> <reg>favor</reg> </choice>.</p>
Note	Because the children of a choice element all represent alternative ways of encoding the same sequence, it is natural to think of them as mutually exclusive. However, there may be cases where a full representation of a text requires the alternative encodings to be considered as parallel. Note also that choice elements may self-nest. Where the purpose of an encoding is to record multiple witnesses of a single work, rather than to identify multiple possible encoding decisions at a given point, the <app> element and associated elements discussed in section 12.1. should be preferred.

<cit>

<cit> (cited quotation) contains a quotation from some other document, together with a bibliographic reference to its source. In a dictionary it may contain an example text with at least one occurrence of the word form, used in the sense being described, or a translation of the headword, or an example. [3.3.3. 4.3.1. 9.3.5.1. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.typed (@type, @subtype)
Member of	model.entryPart.top model.quoteLike
Contained by	core: add cit corr del desc emph head hi item l note orig p q ref reg sic sp stage title unclear figures: cell figDesc figure header: change licence linking: seg textstructure: argument body div docEdition epigraph imprimatur postscript signed titlePart
May contain	analysis: interp interpGrp core: bibl cit gap index lb listBibl milestone note pb ptr q ref figures: figure linking: anchor tagdocs: eg
Declaration	element cit { att.global.attributes, att.typed.attributes, ( model.qLike \| model.egLike \| model.biblLike \| model.ptrLike \| model.global \| model.entryPart )+ }
Example	<cit> <quote>and the breath of the whale is frequently attended with such an insupportable smell, as to bring on disorder of the brain.</quote> <bibl>Ulloa's South America</bibl> </cit>
Example	<entry> <form> <orth>horrifier</orth> </form> <cit type="translation" xml:lang="en"> <quote>to horrify</quote> </cit> <cit type="example"> <quote>elle était horrifiée par la dépense</quote> <cit type="translation" xml:lang="en"> <quote>she was horrified at the expense.</quote> </cit> </cit> </entry>

<classCode>

<classCode> (classification code) contains the classification code used for this text in some standard classification system. [2.4.3. ]

Module header

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))

scheme

identifies the classification system or taxonomy in use.

Status	Required
Datatype	`data.pointer`
Values	may point to a local definition, for example in a taxonomy element, or more usually to some external location where the scheme is fully defined.

Member of

Contained by

header: textClass

May contain

analysis: interp interpGrp

core: abbr address choice date emph expan foreign gap gloss hi index lb mentioned milestone name note num pb ptr ref rs soCalled term time title

figures: figure

header: idno

linking: anchor

tagdocs: att code gi ident val

Declaration

element classCode
{
   att.global.attributes,
   attribute scheme { data.pointer },
   macro.phraseSeq.limited
}

Example

<classDecl>

<classDecl> (classification declarations) contains one or more taxonomies defining any classificatory codes used elsewhere in the text. [2.3.7. 2.3. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.encodingDescPart
Contained by	header: encodingDesc
May contain	header: taxonomy
Declaration	element classDecl { att.global.attributes, taxonomy+ }
Example	<classDecl> <taxonomy xml:id="LCSH"> <bibl>Library of Congress Subject Headings</bibl> </taxonomy> </classDecl> <!-- ... --> <textClass> <keywords scheme="#LCSH"> <term>Political science</term> <term>United States -- Politics and government — Revolution, 1775-1783</term> </keywords> </textClass>

<closer>

<closer> groups together salutations, datelines, and similar phrases appearing as a final group at the end of a division, especially of a letter. [4.2.2. 4.2. ]
Module	textstructure
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.divBottomPart
Contained by	core: lg list figures: figure table textstructure: back body div front group postscript
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val textstructure: dateline salute signed
Declaration	element closer { att.global.attributes, ( text \| model.gLike \| signed \| dateline \| salute \| model.phrase \| model.global )* }
Example	<div type="letter"> <p> perhaps you will favour me with a sight of it when convenient.</p> <closer> <salute>I remain, &c. &c.</salute> <signed>H. Colburn</signed> </closer> </div>
Example	<div type="chapter"> <p> <!-- .... --> and his heart was going like mad and yes I said yes I will Yes.</p> <closer> <dateline> <name type="place">Trieste-Zürich-Paris,</name> <date>1914–1921</date> </dateline> </closer> </div>

<code>

<code> contains literal code from some formal language such as a programming language. [22.1.1. ]

Module tagdocs

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))

lang

(formal language) a name identifying the formal language in which the code is expressed

Status	Optional
Datatype	`data.word`

Member of

model.emphLike

Contained by

analysis: s

figures: cell figDesc

header: authority catDesc change classCode creation distributor edition extent funder language licence principal sponsor

linking: seg

tagdocs: eg

textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer

May contain Character data only

Declaration

element code { att.global.attributes, attribute lang { data.word }?, text }

Example

<code lang="JAVA"> Size fCheckbox1Size = new Size();
fCheckbox1Size.Height = 500;
fCheckbox1Size.Width = 500;
xCheckbox1.setSize(fCheckbox1Size);
</code>

<corr>

<corr> (correction) contains the correct form of a passage apparently erroneous in the copy text. [3.4.1. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.editLike (@evidence, @source, @instant) (att.dimensions (@unit, @quantity, @extent, @precision, @scope) (att.ranging (@atLeast, @atMost, @min, @max, @confidence)) ) (att.responsibility (@cert, @resp)) att.typed (@type, @subtype)
Member of	model.choicePart model.pPart.transcriptional
Contained by	analysis: pc s w core: abbr add addrLine author bibl biblScope choice corr date del editor emph expan foreign gloss head hi item l label mentioned name note num orig p pubPlace publisher q ref reg rs sic soCalled speaker stage term time title unclear figures: cell header: change distributor edition extent licence linking: seg tagdocs: eg textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer
May contain	analysis: interp interpGrp pc s w core: abbr add address bibl choice cit corr date del desc emph expan foreign gap gloss graphic hi index label lb lg list listBibl mentioned milestone name note num orig pb ptr q ref reg rs sic soCalled stage term time title unclear figures: figure formula table header: idno linking: anchor seg tagdocs: att code eg gi ident val
Declaration	element corr { att.global.attributes, att.editLike.attributes, att.typed.attributes, macro.paraContent }
Example	If all that is desired is to call attention to the fact that the copy text has been corrected, corr may be used alone: I don't know, Juan. It's so far in the past now — how <corr>can we</corr> prove or disprove anyone's theories?
Example	It is also possible, using the choice and sic elements, to provide an uncorrected reading: I don't know, Juan. It's so far in the past now — how <choice> <sic>we can</sic> <corr>can we</corr> </choice> prove or disprove anyone's theories?

<creation>

<creation> contains information about the creation of a text. [2.4.1. 2.4. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.datable (@calendar, @period) (att.datable.w3c (@when))
Member of	model.profileDescPart
Contained by	header: profileDesc
May contain	core: abbr address choice date emph expan foreign gloss hi mentioned name num ptr ref rs soCalled term time title header: idno tagdocs: att code gi ident val
Declaration	element creation { att.global.attributes, att.datable.attributes, ( text \| model.limitedPhrase \| listChange )* }
Example	<creation> <date>Before 1987</date> </creation>
Example	<creation> <date when="1988-07-10">10 July 1988</date> </creation>
Note	The creation element may be used to record details of a text's creation, e.g. the date and place it was composed, if these are of interest. It may also contain a more structured account of the various stages or revisions associated with the evolution of a text; this should be encoded using the <listChange> element. It should not be confused with the publicationStmt element, which records date and place of publication.

<date>

<date> contains a date in any format. [3.5.4. 2.2.4. 2.5. 3.11.2.4. 15.2.3. 13.3.6. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.datable (@calendar, @period) (att.datable.w3c (@when)) att.editLike (@evidence, @source, @instant) (att.dimensions (@unit, @quantity, @extent, @precision, @scope) (att.ranging (@atLeast, @atMost, @min, @max, @confidence)) ) (att.responsibility (@cert, @resp)) att.typed (@type, @subtype)
Member of	model.dateLike model.publicationStmtPart
Contained by	analysis: s core: abbr add addrLine author bibl biblScope corr date del desc editor emph expan foreign gloss head hi item l label mentioned name note num orig p pubPlace publisher q ref reg resp rs sic soCalled speaker stage term time title unclear figures: cell figDesc header: authority catDesc change classCode creation distributor edition extent funder language licence principal publicationStmt sponsor linking: seg tagdocs: eg textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val
Declaration	element date { att.global.attributes, att.datable.attributes, att.editLike.attributes, att.typed.attributes, ( text \| model.gLike \| model.phrase \| model.global )* }
Example	<date when="1980-02">early February 1980</date>
Example	Given on the <date when="1977-06-12">Twelfth Day of June in the Year of Our Lord One Thousand Nine Hundred and Seventy-seven of the Republic the Two Hundredth and first and of the University the Eighty-Sixth.</date>
Example	<date when="1990-09">September 1990</date>

<dateline>

<dateline> contains a brief description of the place, date, time, etc. of production of a letter, newspaper story, or other work, prefixed or suffixed to it as a kind of heading or trailer. [4.2.2. ]
Module	textstructure
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.divWrapper
Contained by	core: lg list figures: figure table textstructure: body closer div front group opener
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val
Declaration	element dateline { att.global.attributes, macro.phraseSeq }
Example	<dateline>Walden, this 29. of August 1592</dateline>
Example	<div type="chapter"> <p> <!-- ... --> and his heart was going like mad and yes I said yes I will Yes.</p> <closer> <dateline> <name type="place">Trieste-Zürich-Paris,</name> <date>1914–1921</date> </dateline> </closer> </div>

<del>

<del> (deletion) contains a letter, word, or passage deleted, marked as deleted, or otherwise indicated as superfluous or spurious in the copy text by an author, scribe, annotator, or corrector. [3.4.3. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.transcriptional (@hand, @status, @cause, @seq) (att.editLike (@evidence, @source, @instant) (att.dimensions (@unit, @quantity, @extent, @precision, @scope) (att.ranging (@atLeast, @atMost, @min, @max, @confidence)) ) (att.responsibility (@cert, @resp)) ) att.typed (@type, @subtype)
Member of	model.pPart.transcriptional
Contained by	analysis: pc s w core: abbr add addrLine author bibl biblScope corr date del editor emph expan foreign gloss head hi item l label mentioned name note num orig p pubPlace publisher q ref reg rs sic soCalled speaker stage term time title unclear figures: cell header: change distributor edition extent licence linking: seg tagdocs: eg textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer
May contain	analysis: interp interpGrp pc s w core: abbr add address bibl choice cit corr date del desc emph expan foreign gap gloss graphic hi index label lb lg list listBibl mentioned milestone name note num orig pb ptr q ref reg rs sic soCalled stage term time title unclear figures: figure formula table header: idno linking: anchor seg tagdocs: att code eg gi ident val
Declaration	element del { att.global.attributes, att.transcriptional.attributes, att.typed.attributes, macro.paraContent }
Example	<l> <del rend="overtyped">Mein</del> Frisch <del rend="overstrike" type="primary">schwebt</del> weht der Wind </l>
Example	<del rend="overstrike"> <gap reason="illegible" quantity="5" unit="character"/> </del>
Note	Degrees of uncertainty over what can still be read, or whether a deletion was intended may be indicated by use of the <certainty> element (see 21. ). This element should be used for deletion of shorter sequences of text, typically single words or phrases. The <delSpan> element should be used for longer sequences of text, for those containing structural subdivisions, and for those containing overlapping additions and deletions. The text deleted must be at least partially legible in order for the encoder to be able to transcribe it (unless it is restored in a <supplied> tag). Illegible or lost text within a deletion may be marked using the gap tag to signal that text is present but has not been transcribed, or is no longer visible. Attributes on the gap element may be used to indicate how much text is omitted, the reason for omitting it, etc. If text is not fully legible, the unclear element (available when using the additional tagset for transcription of primary sources) should be used to signal the areas of text which cannot be read with confidence in a similar way. There is a clear distinction in the TEI between del, which is a statement of the the or a later scribe's intent to cancel or remove text, and elements such as gap or unclear which signal the editor's decision to omit or inability to read the text remaining. See further sections 11.3.1.7. and, for the close association of the del tag with the gap, <damage>, unclear and <supplied> elements (the latter three tags available when using the additional tagset for transcription of primary sources), 11.3.3.2. . The del tag should not be used for deletions made by editors or encoders. In these cases, either corr, <surplus> or gap should be used.

<desc>

<desc> (description) contains a brief description of the object documented by its parent element, including its intended usage, purpose, or application where this is appropriate. [22.4.1. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.translatable (@versionDate)
Member of	model.descLike model.labelLike
Contained by	analysis: interp interpGrp core: add corr del desc emph gap graphic head hi item l lg note orig p q ref reg sic stage title unclear figures: cell figDesc figure header: category change licence taxonomy linking: seg textstructure: argument body div docEdition epigraph imprimatur postscript signed titlePart
May contain	core: abbr address bibl choice cit date desc emph expan foreign gloss hi label list listBibl mentioned name num ptr q ref rs soCalled stage term time title figures: table header: idno tagdocs: att code eg gi ident val
Declaration	element desc { att.global.attributes, att.translatable.attributes, macro.limitedContent }
Example	<desc>contains a brief description of the purpose and application for an element, attribute, attribute value, class, or entity.</desc>
Note	TEI convention requires that this be expressed as a finite clause, begining with an active verb.

<distributor>

<distributor> supplies the name of a person or other agency responsible for the distribution of a text. [2.2.4. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.imprintPart model.publicationStmtPart
Contained by	core: bibl header: publicationStmt
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val
Declaration	element distributor { att.global.attributes, macro.phraseSeq }
Example	<distributor>Oxford Text Archive</distributor> <distributor>Redwood and Burn Ltd</distributor>

<div>

<div> (text division) contains a subdivision of the front, body, or back of a text. [4.1. ]
Module	textstructure
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.divLike (@org, @sample) (att.fragmentable (@part)) att.typed (@type, @subtype) att.declaring (@decls)
Member of	model.divLike
Contained by	textstructure: back body div front
May contain	analysis: interp interpGrp core: bibl cit desc divGen gap head index l label lb lg list listBibl milestone note p pb q sp stage figures: figure table linking: anchor tagdocs: eg textstructure: argument byline closer dateline div docAuthor docDate epigraph opener postscript salute signed trailer
Declaration	element div { att.global.attributes, att.divLike.attributes, att.typed.attributes, att.declaring.attributes, ( ( model.divTop \| model.global ), ( ( ( ( ( model.divLike \| model.divGenLike ), model.global )+ ) \| ( ( ( model.common ), model.global* )+, ( ( model.divLike \| model.divGenLike ), model.global* )* ) ), ( ( model.divBottom ), model.global* )* )? ) }
Example	<body> <div type="part"> <head>Fallacies of Authority</head> <p>The subject of which is Authority in various shapes, and the object, to repress all exercise of the reasoning faculty.</p> <div n="1" type="chapter"> <head>The Nature of Authority</head> <p>With reference to any proposed measures having for their object the greatest happiness of the greatest number....</p> <div n="1.1" type="section"> <head>Analysis of Authority</head> <p>What on any given occasion is the legitimate weight or influence to be attached to authority ... </p> </div> <div n="1.2" type="section"> <head>Appeal to Authority, in What Cases Fallacious.</head> <p>Reference to authority is open to the charge of fallacy when... </p> </div> </div> </div> </body>

<divGen>

<divGen> (automatically generated text division) indicates the location at which a textual division generated automatically by a text-processing application is to appear. [3.8.2. ]

Module core

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))

type

specifies what type of generated text division (e.g. index, table of contents, etc.) is to appear.

Status	Optional
Datatype	`data.enumerated`
Sample values include:	index an index is to be generated and inserted at this point. toc a table of contents figlist a list of figures tablist a list of tables
Note	Valid values are application-dependent; those shown are of obvious utility in document production, but are by no means exhaustive.

Member of

model.divGenLike model.frontPart

Contained by

textstructure: back body div front

May contain

core: head

Declaration

element divGen
{
   att.global.attributes,
   attribute type { data.enumerated }?,
   model.headLike*
}

Example

One use for this element is to allow document preparation software to generate an index and insert it in the appropriate place in the output. The example below assumes that the indexName attribute on index elements in the text has been used to specify index entries for the two generated indexes, named NAMES and THINGS:

<back>
<div1 type="backmat">
  <head>Bibliography</head>

</div1>
<div1 type="backmat">
  <head>Indices</head>
  <divGen n="Index Nominum" type="NAMES"/>
  <divGen n="Index Rerum" type="THINGS"/>
</div1>
</back>

Example

Another use for divGen is to specify the location of an automatically produced table of contents:

<front>

<divGen type="toc"/>
<div>
<head>Preface</head>
<p> ... </p>
</div>
</front>

Note

This element is intended primarily for use in document production or manipulation, rather than in the transcription of pre-existing materials; it makes it easier to specify the location of indices, tables of contents, etc., to be generated by text preparation or word processing software.

<docAuthor>

<docAuthor> (document author) contains the name of the author of the document, as given on the title page (often but not always contained in a byline). [4.6. ]
Module	textstructure
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.canonical (@key, @ref)
Member of	model.divWrapper model.pLike.front model.titlepagePart
Contained by	core: lg list figures: figure table textstructure: back body byline div front group titlePage
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val
Declaration	element docAuthor { att.global.attributes, att.canonical.attributes, macro.phraseSeq }
Example	<titlePage> <docTitle> <titlePart>Travels into Several Remote Nations of the World, in Four Parts.</titlePart> </docTitle> <byline> By <docAuthor>Lemuel Gulliver</docAuthor>, First a Surgeon, and then a Captain of several Ships</byline> </titlePage>
Note	The document author's name often occurs within a byline, but the docAuthor element may be used whether the byline element is used or not.

<docDate>

<docDate> (document date) contains the date of a document, as given (usually) on a title page. [4.6. ]

Module textstructure

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))

when

gives the value of the date in standard form, i.e. YYYY-MM-DD.

Status	Optional
Datatype	`data.temporal.w3c`
Values	a date in one of the formats specified in XML Schema Part 2: Datatypes Second Edition
Note	For simple dates, the when attribute should give the Gregorian or proleptic Gregorian date in the form (YYYY-MM-DD) specified by XML Schema Part 2.

Member of

model.divWrapper model.pLike.front model.titlepagePart

Contained by

core: lg list

figures: figure table

textstructure: back body div docImprint front group titlePage

May contain

analysis: interp interpGrp pc s w

core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear

figures: figure formula

header: idno

linking: anchor seg

tagdocs: att code gi ident val

Declaration

element docDate
{
   att.global.attributes,
   attribute when { data.temporal.w3c }?,
   macro.phraseSeq
}

Example

<docImprint>Oxford, Clarendon Press, <docDate>1987</docDate>
</docImprint>

Note

Cf. the general date element in the core tag set. This specialized element is provided for convenience in marking and processing the date of the documents, since it is likely to require specialized handling for many applications.

<docEdition>

<docEdition> (document edition) contains an edition statement as presented on a title page of a document. [4.6. ]
Module	textstructure
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.pLike.front model.titlepagePart
Contained by	textstructure: back front titlePage
May contain	analysis: interp interpGrp pc s w core: abbr add address bibl choice cit corr date del desc emph expan foreign gap gloss graphic hi index label lb lg list listBibl mentioned milestone name note num orig pb ptr q ref reg rs sic soCalled stage term time title unclear figures: figure formula table header: idno linking: anchor seg tagdocs: att code eg gi ident val
Declaration	element docEdition { att.global.attributes, macro.paraContent }
Example	<docEdition>The Third edition Corrected</docEdition>
Note	Cf. the edition element of bibliographic citation. As usual, the shorter name has been given to the more frequent element.

<docImprint>

<docImprint> (document imprint) contains the imprint statement (place and date of publication, publisher name), as given (usually) at the foot of a title page. [4.6. ]
Module	textstructure
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.pLike.front model.titlepagePart
Contained by	textstructure: back front titlePage
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr pubPlace publisher ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val textstructure: docDate
Declaration	element docImprint { att.global.attributes, ( text \| model.gLike \| model.phrase \| pubPlace \| docDate \| publisher \| model.global )* }
Example	<docImprint>Oxford, Clarendon Press, 1987</docImprint> Imprints may be somewhat more complex: <docImprint> <pubPlace>London</pubPlace> Printed for <name>E. Nutt</name>, at <pubPlace>Royal Exchange</pubPlace>; <name>J. Roberts</name> in <pubPlace>wick-Lane</pubPlace>; <name>A. Dodd</name> without <pubPlace>Temple-Bar</pubPlace>; and <name>J. Graves</name> in <pubPlace>St. James's-street.</pubPlace> <date>1722.</date> </docImprint>
Note	Cf. the <imprint> element of bibliographic citations. As with title, author, and editions, the shorter name is reserved for the element likely to be used more often.

<docTitle>

<docTitle> (document title) contains the title of a document, including all its constituents, as given on a title page. [4.6. ]
Module	textstructure
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.canonical (@key, @ref)
Member of	model.pLike.front model.titlepagePart
Contained by	textstructure: back front titlePage
May contain	analysis: interp interpGrp core: gap index lb milestone note pb figures: figure linking: anchor textstructure: titlePart
Declaration	element docTitle { att.global.attributes, att.canonical.attributes, ( model.global, ( titlePart, model.global )+ ) }
Example	<docTitle> <titlePart type="main">The DUNCIAD, VARIOURVM.</titlePart> <titlePart type="sub">WITH THE PROLEGOMENA of SCRIBLERUS.</titlePart> </docTitle>

<edition>

<edition> (edition) describes the particularities of one edition of a text. [2.2.2. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.biblPart
Contained by	core: bibl header: editionStmt
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val
Declaration	element edition { att.global.attributes, macro.phraseSeq }
Example	<edition>First edition <date>Oct 1990</date> </edition> <edition n="S2">Students' edition</edition>

<editionStmt>

<editionStmt> (edition statement) groups information relating to one edition of a text. [2.2.2. 2.2. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of
Contained by	header: fileDesc
May contain	core: author editor p respStmt header: edition funder principal sponsor
Declaration	element editionStmt { att.global.attributes, ( model.pLike+ \| ( edition, model.respLike* ) ) }
Example	<editionStmt> <edition n="S2">Students' edition</edition> <respStmt> <resp>Adapted by </resp> <name>Elizabeth Kirk</name> </respStmt> </editionStmt>
Example	<editionStmt> <p>First edition, <date>Michaelmas Term, 1991.</date> </p> </editionStmt>

<editor>

<editor> contains a secondary statement of responsibility for a bibliographic item, for example the name of an individual, institution or organization, (or of several such) acting as editor, compiler, translator, etc. [3.11.2.2. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.naming (@role, @nymRef) (att.canonical (@key, @ref))
Member of	model.respLike
Contained by	core: bibl header: editionStmt seriesStmt titleStmt
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val
Declaration	element editor { att.global.attributes, att.naming.attributes, macro.phraseSeq }
Example	<editor>Eric Johnson</editor> <editor role="illustrator">John Tenniel</editor>
Note	A consistent format should be adopted. Particularly where cataloguing is likely to be based on the content of the header, it is advisable to use generally recognized authority lists for the exact form of personal names.

<editorialDecl>

<editorialDecl> (editorial practice declaration) provides details of editorial principles and practices applied during the encoding of a text. [2.3.3. 2.3. 15.3.2. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.declarable (@default)
Member of	model.encodingDescPart
Contained by	header: encodingDesc
May contain	core: p
Declaration	element editorialDecl { att.global.attributes, att.declarable.attributes, ( model.pLike \| model.editorialDeclPart )+ }
Example	<editorialDecl> <normalization> <p>All words converted to Modern American spelling using Websters 9th Collegiate dictionary </p> </normalization> <quotation marks="all"> <p>All opening quotation marks converted to “ all closing quotation marks converted to &cdq;.</p> </quotation> </editorialDecl>

<eg>

<eg> (example) contains any kind of illustrative example. [22.4.4. 22.4.5. ]
Module	tagdocs
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.egLike
Contained by	core: add cit corr del desc emph head hi item l note orig p q ref reg sic stage title unclear figures: cell figDesc figure header: change licence linking: seg textstructure: argument body div docEdition epigraph imprimatur postscript signed titlePart
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val
Declaration	element eg { att.global.attributes, macro.phraseSeq }
Example	<p>The <gi>term</gi> element is declared using the following syntax: <eg><![CDATA[<!ELEMENT term (%phrase.content;)>]]</eg> </p>
Note	If the example contains material in XML markup, either it must be enclosed within a CDATA marked section, or character entity references must be used to represent the markup delimiters. If the example contains well-formed XML, it should be marked using the more specific <egXML> element.

<emph>

<emph> (emphasized) marks words or phrases which are stressed or emphasized for linguistic or rhetorical effect. [3.3.2.2. 3.3.2. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.emphLike
Contained by	analysis: s core: abbr add addrLine author bibl biblScope corr date del desc editor emph expan foreign gloss head hi item l label mentioned name note num orig p pubPlace publisher q ref reg resp rs sic soCalled speaker stage term time title unclear figures: cell figDesc header: authority catDesc change classCode creation distributor edition extent funder language licence principal sponsor linking: seg tagdocs: eg textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer
May contain	analysis: interp interpGrp pc s w core: abbr add address bibl choice cit corr date del desc emph expan foreign gap gloss graphic hi index label lb lg list listBibl mentioned milestone name note num orig pb ptr q ref reg rs sic soCalled stage term time title unclear figures: figure formula table header: idno linking: anchor seg tagdocs: att code eg gi ident val
Declaration	element emph { att.global.attributes, macro.paraContent }
Example	You took the car and did <emph>what</emph>?!!
Example	<q>What it all comes to is this,</q> he said. <q> <emph>What does Christopher Robin do in the morning nowadays?</emph> </q>

<encodingDesc>

<encodingDesc> (encoding description) documents the relationship between an electronic text and the source or sources from which it was derived. [2.3. 2.1.1. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.teiHeaderPart
Contained by	header: teiHeader
May contain	core: p header: classDecl editorialDecl projectDesc refsDecl samplingDecl
Declaration	element encodingDesc { att.global.attributes, ( ( model.encodingDescPart \| model.pLike )+ ) }
Example	<encodingDesc> <p>Basic encoding, capturing lexical information only. All hyphenation, punctuation, and variant spellings normalized. No formatting or layout information preserved.</p> </encodingDesc>

<epigraph>

<epigraph> contains a quotation, anonymous or attributed, appearing at the start or end of a section or on a title page. [4.2.3. 4.2. 4.6. ]
Module	textstructure
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.divWrapper model.pLike.front model.titlepagePart
Contained by	core: lg list figures: figure table textstructure: back body div front group opener titlePage
May contain	analysis: interp interpGrp core: bibl cit desc gap index l label lb lg list listBibl milestone note p pb q sp stage figures: figure table linking: anchor tagdocs: eg
Declaration	element epigraph { att.global.attributes, ( model.common \| model.global )* }
Example	<epigraph xml:lang="la"> <cit> <bibl>Lucret.</bibl> <quote> <l part="F">petere inde coronam,</l> <l>Vnde prius nulli velarint tempora Musae.</l> </quote> </cit> </epigraph>

<expan>

<expan> (expansion) contains the expansion of an abbreviation. [3.5.5. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.editLike (@evidence, @source, @instant) (att.dimensions (@unit, @quantity, @extent, @precision, @scope) (att.ranging (@atLeast, @atMost, @min, @max, @confidence)) ) (att.responsibility (@cert, @resp))
Member of	model.choicePart model.pPart.editorial
Contained by	analysis: pc s w core: abbr add addrLine author bibl biblScope choice corr date del desc editor emph expan foreign gloss head hi item l label mentioned name note num orig p pubPlace publisher q ref reg resp rs sic soCalled speaker stage term time title unclear figures: cell figDesc header: authority catDesc change classCode creation distributor edition extent funder language licence principal sponsor linking: seg tagdocs: eg textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val
Declaration	element expan { att.global.attributes, att.editLike.attributes, macro.phraseSeq }
Example	The address is Southmoor <choice> <expan>Road</expan> <abbr>Rd</abbr> </choice>
Example	<expan xml:lang="la"> <abbr>Imp</abbr> <ex>erator</ex> </expan>
Note	The content of this element should usually be a complete word or phrase. The <ex> element provided by the transcr module may be used to mark up sequences of letters supplied within such an expansion.

<extent>

<extent> describes the approximate size of a text stored on some carrier medium or of some other object, digital or non-digital, specified in any convenient units. [2.2.3. 2.2. 3.11.2.4. 10.7.1. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.biblPart
Contained by	core: bibl header: fileDesc
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val
Declaration	element extent { att.global.attributes, macro.phraseSeq }
Example	<extent>3200 sentences</extent> <extent>between 10 and 20 Mb</extent> <extent>ten 3.5 inch high density diskettes</extent>

<figDesc>

<figDesc> (description of figure) contains a brief prose description of the appearance or content of a graphic figure, for use when documenting an image without displaying it. [14.4. ]
Module	figures
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of
Contained by	figures: figure
May contain	core: abbr address bibl choice cit date desc emph expan foreign gloss hi label list listBibl mentioned name num ptr q ref rs soCalled stage term time title figures: table header: idno tagdocs: att code eg gi ident val
Declaration	element figDesc { att.global.attributes, macro.limitedContent }
Example	<figure> <graphic url="emblem1.png"/> <head>Emblemi d'Amore</head> <figDesc>A pair of naked winged cupids, each holding a flaming torch, in a rural setting.</figDesc> </figure>
Note	This element is intended for use as an alternative to the content of its parent figure element ; for example, to display when the image is required but the equipment in use cannot display graphic images. It may also be used for indexing or documentary purposes.

<figure>

<figure> groups elements representing or containing graphic information such as an illustration, formula, or figure. [14.4. ]
Module	figures
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.placement (@place) att.typed (@type, @subtype)
Member of	model.global
Contained by	analysis: s w core: abbr add addrLine address author bibl biblScope cit corr date del editor emph expan foreign gloss head hi item l label lg list mentioned name note num orig p pubPlace publisher q ref reg resp rs sic soCalled sp speaker stage term time title unclear figures: cell figure table header: authority change classCode distributor edition extent funder language licence principal sponsor linking: seg tagdocs: eg textstructure: argument back body byline closer dateline div docAuthor docDate docEdition docImprint docTitle epigraph front group imprimatur opener postscript salute signed text titlePage titlePart trailer
May contain	analysis: interp interpGrp core: bibl cit desc gap graphic head index l label lb lg list listBibl milestone note p pb q sp stage figures: figDesc figure formula table linking: anchor tagdocs: eg textstructure: argument byline closer dateline docAuthor docDate epigraph postscript salute signed trailer
Declaration	element figure { att.global.attributes, att.placement.attributes, att.typed.attributes, ( model.headLike \| model.common \| figDesc \| model.graphicLike \| model.global \| model.divBottom )* }
Example	<figure> <head>The View from the Bridge</head> <figDesc>A Whistleresque view showing four or five sailing boats in the foreground, and a series of buoys strung out between them.</figDesc> <graphic url="http://www.example.org/fig1.png" scale="0.5"/> </figure>

<fileDesc>

<fileDesc> (file description) contains a full bibliographic description of an electronic file. [2.2. 2.1.1. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of
Contained by	header: teiHeader
May contain	header: editionStmt extent notesStmt publicationStmt seriesStmt sourceDesc titleStmt
Declaration	element fileDesc { att.global.attributes, ( ( titleStmt, editionStmt?, extent?, publicationStmt, seriesStmt?, notesStmt? ), sourceDesc+ ) }
Example	<fileDesc> <titleStmt> <title>The shortest possible TEI document</title> </titleStmt> <publicationStmt> <p>Distributed as part of TEI P5</p> </publicationStmt> <sourceDesc> <p>No print source exists: this is an original digital text</p> </sourceDesc> </fileDesc>
Note	The major source of information for those seeking to create a catalogue entry or bibliographic citation for an electronic file. As such, it provides a title and statements of responsibility together with details of the publication or distribution of the file, of any series to which it belongs, and detailed bibliographic notes for matters not addressed elsewhere in the header. It also contains a full bibliographic description for the source or sources from which the electronic text was derived.

<foreign>

<foreign> (foreign) identifies a word or phrase as belonging to some language other than that of the surrounding text. [3.3.2.1. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.emphLike
Contained by	analysis: s core: abbr add addrLine author bibl biblScope corr date del desc editor emph expan foreign gloss head hi item l label mentioned name note num orig p pubPlace publisher q ref reg resp rs sic soCalled speaker stage term time title unclear figures: cell figDesc header: authority catDesc change classCode creation distributor edition extent funder language licence principal sponsor linking: seg tagdocs: eg textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val
Declaration	element foreign { att.global.attributes, macro.phraseSeq }
Example	This is heathen Greek to you still? Your <foreign xml:lang="la">lapis philosophicus</foreign>?
Note	The global xml:lang attribute should be supplied for this element to identify the language of the word or phrase marked. As elsewhere, its value should be a language tag as defined in 6.1. . This element is intended for use only where no other element is available to mark the phrase or words concerned. The global xml:lang attribute should be used in preference to this element where it is intended to mark the language of the whole of some text element. The <distinct> element may be used to identify phrases belonging to sublanguages or registers not generally regarded as true languages.

<formula>

<formula> contains a mathematical or other formula. [14.2. ]

Module figures

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))

notation

names the notation used for the content of the element.

Status	Optional
Datatype	`data.enumerated`

Member of

model.graphicLike

Contained by

analysis: s

figures: cell figure formula table

header: change distributor edition extent licence

linking: seg

tagdocs: eg

textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer

May contain

core: graphic hi

figures: formula

Declaration

element formula
{
   att.global.attributes,
   attribute notation { data.enumerated }?,
   ( text | model.graphicLike | model.hiLike )*
}

Example

Example

Example

<front>

<front> (front matter) contains any prefatory matter (headers, title page, prefaces, dedications, etc.) found at the start of a document, before the main body. [4.6. 4. ]
Module	textstructure
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.declaring (@decls)
Member of
Contained by	textstructure: text
May contain	analysis: interp interpGrp core: divGen gap head index lb milestone note p pb figures: figure linking: anchor textstructure: argument byline closer dateline div docAuthor docDate docEdition docImprint docTitle epigraph postscript salute signed titlePage titlePart trailer
Declaration	element front { att.global.attributes, att.declaring.attributes, ( ( ( model.frontPart \| model.pLike \| model.pLike.front \| model.global )* ), ( ( ( model.div1Like, ( model.div1Like \| model.frontPart \| model.global )* ) \| ( model.divLike, ( model.divLike \| model.frontPart \| model.global )* ) ), ( ( ( model.divBottom ), ( model.divBottom \| model.global )* )? ) )? ) }
Example	<front> <epigraph> <quote>Nam Sibyllam quidem Cumis ego ipse oculis meis vidi in ampulla pendere, et cum illi pueri dicerent: <q xml:lang="gr">Σίβυλλα τί θέλεις</q>; respondebat illa: <q xml:lang="gr">ὰποθανεῖν θέλω.</q> </quote> </epigraph> <div type="dedication"> <p>For Ezra Pound <q xml:lang="it">il miglior fabbro.</q> </p> </div> </front>
Example	<front> <div type="dedication"> <p>To our three selves</p> </div> <div type="preface"> <head>Author's Note</head> <p>All the characters in this book are purely imaginary, and if the author has used names that may suggest a reference to living persons she has done so inadvertently. ...</p> </div> </front>

<funder>

<funder> (funding body) specifies the name of an individual, institution, or organization responsible for the funding of a project or text. [2.2.1. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.respLike
Contained by	core: bibl header: editionStmt titleStmt
May contain	analysis: interp interpGrp core: abbr address choice date emph expan foreign gap gloss hi index lb mentioned milestone name note num pb ptr ref rs soCalled term time title figures: figure header: idno linking: anchor tagdocs: att code gi ident val
Declaration	element funder { att.global.attributes, macro.phraseSeq.limited }
Example	<funder>The National Endowment for the Humanities, an independent federal agency</funder> <funder>Directorate General XIII of the Commission of the European Communities</funder> <funder>The Andrew W. Mellon Foundation</funder> <funder>The Social Sciences and Humanities Research Council of Canada</funder>
Note	Funders provide financial support for a project; they are distinct from sponsors, who provide intellectual support and authority.

<gap>

<gap> (gap) indicates a point where material has been omitted in a transcription, whether for editorial reasons described in the TEI header, as part of sampling practice, or because the material is illegible, invisible, or inaudible. [3.4.3. ]

Module core

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.editLike (@evidence, @source, @instant) (att.dimensions (@unit, @quantity, @extent, @precision, @scope) (att.ranging (@atLeast, @atMost, @min, @max, @confidence)) ) (att.responsibility (@cert, @resp))

reason

gives the reason for omission. Sample values include sampling, inaudible, irrelevant, cancelled.

Status	Optional
Datatype	1–∞ occurrences of `data.word`separated by whitespace
Values	any short indication of the reason for the omission.

hand

in the case of text omitted from the transcription because of deliberate deletion by an identifiable hand, signifies the hand which made the deletion.

Status	Optional
Datatype	`data.pointer`
Values	must point to a hand identifier typically but not necessarily declared in the document header (see section 11.3.2.1. ).

agent

In the case of text omitted because of damage, categorizes the cause of the damage, if it can be identified.

Status	Optional
Datatype	`data.enumerated`
Sample values include:	rubbing damage results from rubbing of the leaf edges mildew damage results from mildew on the leaf surface smoke damage results from smoke

Member of

model.global.edit

Contained by

analysis: s w

figures: cell figure table

header: authority change classCode distributor edition extent funder language licence principal sponsor

linking: seg

tagdocs: eg

May contain

core: desc

Declaration

element gap
{
   att.global.attributes,
   att.editLike.attributes,
   attribute reason { list { data.word+ } }?,
   attribute hand { data.pointer }?,
   attribute agent { data.enumerated }?,
   ( model.descLike | model.certLike )*
}

Example

Example

Example

Example

Note

The gap, unclear, and del core tag elements may be closely allied in use with the <damage> and <supplied> elements, available when using the additional tagset for transcription of primary sources. See section 11.3.3.2. for discussion of which element is appropriate for which circumstance.

The gap tag simply signals the editors decision to omit or inability to transcribe a span of text. Other information, such as the interpretation that text was deliberately erased or covered, should be indicated using the relevant tags, such as del in the case of deliberate deletion.

<gi>

<gi> (element name) contains the name (generic identifier) of an element. [22. 22.4.4. ]

Module tagdocs

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))

scheme

supplies the name of the scheme in which this name is defined.

Status	Optional
Datatype	`data.enumerated`
Sample values include:	TEI this element is part of the TEI scheme. [Default] DBK (docbook) this element is part of the Docbook scheme. XX (unknown) this element is part of an unknown scheme. Schematron this element is from Schematron. HTML this element is from the HTML scheme.

Member of

model.phrase.xml

Contained by

analysis: s

figures: cell figDesc

header: authority catDesc change classCode creation distributor edition extent funder language licence principal sponsor

linking: seg

tagdocs: eg

textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer

May contain Character data only

Declaration

element gi
{
   att.global.attributes,
   attribute scheme { data.enumerated }?,
   data.name
}

Example

<p>The <gi>xhtml:li</gi> element is roughly analogous to the <gi>item</gi> element, as is the
<gi scheme="DBK">listItem</gi> element.</p>

This example shows the use of both a namespace prefix and the schema attribute as alternative ways of indicating that the gi in question is not a TEI element name: in practice only one method should be adopted.

<gloss>

<gloss> identifies a phrase or word used to provide a gloss or definition for some other word or phrase. [3.3.4. 22.4.1. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.declaring (@decls) att.translatable (@versionDate) att.typed (@type, @subtype) att.pointing (@targetLang, @target, @evaluate) att.cReferencing (@cRef)
Member of	model.emphLike model.glossLike
Contained by	analysis: s core: abbr add addrLine author bibl biblScope corr date del desc editor emph expan foreign gloss head hi item l label mentioned name note num orig p pubPlace publisher q ref reg resp rs sic soCalled speaker stage term time title unclear figures: cell figDesc header: authority catDesc category change classCode creation distributor edition extent funder language licence principal sponsor taxonomy linking: seg tagdocs: eg textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val
Declaration	element gloss { att.global.attributes, att.declaring.attributes, att.translatable.attributes, att.typed.attributes, att.pointing.attributes, att.cReferencing.attributes, macro.phraseSeq }
Example	We may define <term xml:id="tdpv" rend="sc">discoursal point of view</term> as <gloss target="#tdpv">the relationship, expressed through discourse structure, between the implied author or some other addresser, and the fiction.</gloss>
Note	The target and cRef attributes are mutually exclusive.

<graphic>

<graphic> indicates the location of an inline graphic, illustration, or figure. [3.9. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.media (@width, @height, @scale) att.resourced (@url) att.declaring (@decls)
Member of	model.graphicLike model.titlepagePart
Contained by	analysis: s core: abbr add addrLine author biblScope corr date del editor emph expan foreign gloss head hi item l label mentioned name note num orig p pubPlace publisher q ref reg rs sic soCalled speaker stage term time title unclear figures: cell figure formula table header: change distributor edition extent licence linking: seg tagdocs: eg textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePage titlePart trailer
May contain	core: desc
Declaration	element graphic { att.global.attributes, att.media.attributes, att.resourced.attributes, att.declaring.attributes, model.descLike* }
Example	<figure> <graphic url="fig1.png"/> <head>Figure One: The View from the Bridge</head> <figDesc>A Whistleresque view showing four or five sailing boats in the foreground, and a series of buoys strung out between them.</figDesc> </figure>
Note	The mimeType attribute should be used to supply the MIME media type of the image specified by the url attribute.

<group>

<group> contains the body of a composite text, grouping together a sequence of distinct texts (or groups of such texts) which are regarded as a unit for some purpose, for example the collected works of an author, a sequence of prose essays, etc. [4. 4.3.1. 15.1. ]
Module	textstructure
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.declaring (@decls) att.typed (@type, @subtype)
Member of
Contained by	textstructure: group text
May contain	analysis: interp interpGrp core: gap head index lb milestone note pb figures: figure linking: anchor textstructure: argument byline closer dateline docAuthor docDate epigraph group opener postscript salute signed text trailer
Declaration	element group { att.global.attributes, att.declaring.attributes, att.typed.attributes, ( ( model.divTop \| model.global ), ( ( text \| group ), ( text \| group \| model.global ) ), model.divBottom* ) }
Example	<text> <!-- Section on Alexander Pope starts --> <front> <!-- biographical notice by editor --> </front> <group> <text> <!-- first poem --> </text> <text> <!-- second poem --> </text> </group> </text> <!-- end of Pope section-->

<head>

<head> (heading) contains any type of heading, for example the title of a section, or the heading of a list, glossary, manuscript description, etc. [4.2.1. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.typed (@type, @subtype)
Member of	model.headLike model.pLike.front
Contained by	core: divGen lg list listBibl figures: figure table textstructure: argument back body div front group postscript
May contain	analysis: interp interpGrp pc s w core: abbr add address bibl choice cit corr date del desc emph expan foreign gap gloss graphic hi index l label lb lg list listBibl mentioned milestone name note num orig pb ptr q ref reg rs sic soCalled stage term time title unclear figures: figure formula table header: idno linking: anchor seg tagdocs: att code eg gi ident val
Declaration	element head { att.global.attributes, att.typed.attributes, ( text \| lg \| model.gLike \| model.phrase \| model.inter \| model.lLike \| model.global )* }
Example	The most common use for the head element is to mark the headings of sections. In older writings, the headings or incipits may be rather longer than usual in modern works. If a section has an explicit ending as well as a heading, it should be marked as a trailer, as in this example: <div1 n="I" type="book"> <head>In the name of Christ here begins the first book of the ecclesiastical history of Georgius Florentinus, known as Gregory, Bishop of Tours.</head> <div2 type="section"> <head>In the name of Christ here begins Book I of the history.</head> <p>Proposing as I do ...</p> <p>From the Passion of our Lord until the death of Saint Martin four hundred and twelve years passed.</p> <trailer>Here ends the first Book, which covers five thousand, five hundred and ninety-six years from the beginning of the world down to the death of Saint Martin.</trailer> </div2> </div1>
Example	The head element is also used to mark headings of other units, such as lists: With a few exceptions, connectives are equally useful in all kinds of discourse: description, narration, exposition, argument. <list type="simple"> <head>Connectives</head> <item>above</item> <item>accordingly</item> <item>across from</item> <item>adjacent to</item> <item>again</item> <item> <!-- ... --> </item> </list>
Note	The head element is used for headings at all levels; software which treats (e.g.) chapter headings, section headings, and list titles differently must determine the proper processing of a head element based on its structural position. A head occurring as the first element of a list is the title of that list; one occurring as the first element of a <div1> is the title of that chapter or section.

<hi>

<hi> (highlighted) marks a word or phrase as graphically distinct from the surrounding text, for reasons concerning which no claim is made. [3.3.2.2. 3.3.2. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.hiLike
Contained by	analysis: s w core: abbr add addrLine author bibl biblScope corr date del desc editor emph expan foreign gloss head hi item l label mentioned name note num orig p pubPlace publisher q ref reg resp rs sic soCalled speaker stage term time title unclear figures: cell figDesc formula header: authority catDesc change classCode creation distributor edition extent funder language licence principal sponsor linking: seg tagdocs: eg textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer
May contain	analysis: interp interpGrp pc s w core: abbr add address bibl choice cit corr date del desc emph expan foreign gap gloss graphic hi index label lb lg list listBibl mentioned milestone name note num orig pb ptr q ref reg rs sic soCalled stage term time title unclear figures: figure formula table header: idno linking: anchor seg tagdocs: att code eg gi ident val
Declaration	element hi { att.global.attributes, macro.paraContent }
Example	<hi rend="gothic">And this Indenture further witnesseth</hi> that the said <hi rend="italic">Walter Shandy</hi>, merchant, in consideration of the said intended marriage ...

<ident>

<ident> (identifier) contains an identifier or name for an object of some kind in a formal language. ident is used for tokens such as variable names, class names, type names, function names etc. in formal programming languages. [22.1.1. ]
Module	tagdocs
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.typed (@type, @subtype)
Member of	model.emphLike
Contained by	analysis: s core: abbr add addrLine author bibl biblScope corr date del desc editor emph expan foreign gloss head hi item l label mentioned name note num orig p pubPlace publisher q ref reg resp rs sic soCalled speaker stage term time title unclear figures: cell figDesc header: authority catDesc change classCode creation distributor edition extent funder language licence principal sponsor linking: seg tagdocs: eg textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer
May contain	Character data only
Declaration	element ident { att.global.attributes, att.typed.attributes, text }
Example	<ident type="namespace">http://www.tei-c.org/ns/Examples</ident>
Note	In running prose, this element may be used for any kind of identifier in any formal language. It should not be used for element and attribute names in XML, for which the special elements gi and att are provided.

<idno>

<idno> (identifier) supplies any form of identifier used to identify some object, such as a bibliographic item, a person, a title, an organization, etc. in a standardized way. [2.2.4. 2.2.5. 3.11.2.4. ]

Module header

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.sortable (@sortKey)

type

categorizes the identifier, for example as an ISBN, Social Security number, etc.

Status	Optional
Datatype	`data.enumerated`

Member of

model.nameLike model.publicationStmtPart

Contained by

analysis: s

figures: cell figDesc

header: authority catDesc change classCode creation distributor edition extent funder idno language licence principal publicationStmt seriesStmt sponsor

linking: seg

tagdocs: eg

textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer

May contain

header: idno

Declaration

element idno
{
   att.global.attributes,
   att.sortable.attributes,
   attribute type { data.enumerated }?,
   ( text | model.gLike | idno )*
}

Example

<idno type="ISBN">978-1-906964-22-1</idno>
<idno type="ISSN">0143-3385</idno>
<idno type="DOI">http://dx.doi.org/10.1000/123</idno>
<idno type="URL">http://authority.nzetc.org/463/</idno>
<idno type="LT">Thomason Tract E.537(17)</idno>
<idno type="Wing">C695</idno>
<idno type="oldCat">
<g ref="#sym"/>345
</idno>

In the last case, the identifier includes a non-Unicode character which is defined elsewhere by means of a <glyph> or <char> element referenced here as #sym.

Note

idno should be used for labels which identify an object or concept in a formal cataloguing system such as a database or an RDF store, or in a distributed system such as the World Wide Web.

<imprimatur>

<imprimatur> contains a formal statement authorizing the publication of a work, sometimes required to appear on a title page or its verso. [4.6. ]
Module	textstructure
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.titlepagePart
Contained by	textstructure: titlePage
May contain	analysis: interp interpGrp pc s w core: abbr add address bibl choice cit corr date del desc emph expan foreign gap gloss graphic hi index label lb lg list listBibl mentioned milestone name note num orig pb ptr q ref reg rs sic soCalled stage term time title unclear figures: figure formula table header: idno linking: anchor seg tagdocs: att code eg gi ident val
Declaration	element imprimatur { att.global.attributes, macro.paraContent }
Example	<imprimatur>Licensed and entred acording to Order.</imprimatur>

<index>

<index> (index entry) marks a location to be indexed for whatever purpose. [3.8.2. ]

Module core

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.spanning (@spanTo)

indexName

a single word which follows the rules defining a legal XML name (see http://www.w3.org/TR/REC-xml/#dt-name), supplying a name to specify which index (of several) the index entry belongs to.

Status	Optional
Datatype	`data.name`
Values	an application-specific name, consisting of Unicode characters only.
Note	This attribute makes it possible to create multiple indexes for a text.

Member of

model.global.meta

Contained by

analysis: s w

figures: cell figure table

header: authority change classCode distributor edition extent funder language licence principal sponsor

linking: seg

tagdocs: eg

May contain

core: index term

Declaration

element index
{
   att.global.attributes,
   att.spanning.attributes,
   attribute indexName { data.name }?,
   ( term, index? )*
}

Example

David's other principal backer, Josiah ha-Kohen
<index indexName="NAMES">
<term>Josiah ha-Kohen b. Azarya</term>
</index> b. Azarya, son of one of the last gaons of Sura <index indexName="PLACES">
<term>Sura</term>
</index> was David's own first cousin.

<interp>

<interp> (interpretation) summarizes a specific interpretative annotation which can be linked to a span of text. [17.3. ]
Module	analysis
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.interpLike (@type, @inst) (att.responsibility (@cert, @resp))
Member of	model.global.meta
Contained by	analysis: interpGrp s w core: abbr add addrLine address author bibl biblScope cit corr date del editor emph expan foreign gloss head hi item l label lg list mentioned name note num orig p pubPlace publisher q ref reg resp rs sic soCalled sp speaker stage term time title unclear figures: cell figure table header: authority change classCode distributor edition extent funder language licence principal sponsor linking: seg tagdocs: eg textstructure: argument back body byline closer dateline div docAuthor docDate docEdition docImprint docTitle epigraph front group imprimatur opener postscript salute signed text titlePage titlePart trailer
May contain	core: desc
Declaration	element interp { att.global.attributes, att.interpLike.attributes, ( text \| model.gLike \| model.descLike \| model.certLike )* }
Example	<interp type="structuralunit" xml:id="ana_am">aftermath</interp>
Note	Generally, each interp element carries an xml:id attribute. This permits the encoder to explicitly associate the interpretation represented by the content of an interp with any textual element through its ana attribute. Alternatively (or, in addition) an interp may carry an inst attribute which points to one or more textual elements to which the analysis represented by the content of the interp applies.

<interpGrp>

<interpGrp> (interpretation group) collects together a set of related interpretations which share responsibility or type. [17.3. ]
Module	analysis
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.interpLike (@type, @inst) (att.responsibility (@cert, @resp))
Member of	model.global.meta
Contained by	analysis: s w core: abbr add addrLine address author bibl biblScope cit corr date del editor emph expan foreign gloss head hi item l label lg list mentioned name note num orig p pubPlace publisher q ref reg resp rs sic soCalled sp speaker stage term time title unclear figures: cell figure table header: authority change classCode distributor edition extent funder language licence principal sponsor linking: seg tagdocs: eg textstructure: argument back body byline closer dateline div docAuthor docDate docEdition docImprint docTitle epigraph front group imprimatur opener postscript salute signed text titlePage titlePart trailer
May contain	analysis: interp core: desc
Declaration	element interpGrp { att.global.attributes, att.interpLike.attributes, ( model.descLike*, interp+ ) }
Example	<interpGrp resp="#TMA" type="structuralunit"> <desc>basic structural organization</desc> <interp xml:id="I1">introduction</interp> <interp xml:id="I2">conflict</interp> <interp xml:id="I3">climax</interp> <interp xml:id="I4">revenge</interp> <interp xml:id="I5">reconciliation</interp> <interp xml:id="I6">aftermath</interp> </interpGrp> <bibl xml:id="TMA"> <!-- bibliographic citation for source of this interpretive framework --> </bibl>
Note	Any number of interp elements.

<item>

<item> contains one component of a list. [3.7. 2.5. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.sortable (@sortKey)
Member of
Contained by	core: list
May contain	analysis: interp interpGrp pc s w core: abbr add address bibl choice cit corr date del desc emph expan foreign gap gloss graphic hi index l label lb lg list listBibl mentioned milestone name note num orig p pb ptr q ref reg rs sic soCalled sp stage term time title unclear figures: figure formula table header: idno linking: anchor seg tagdocs: att code eg gi ident val
Declaration	element item { att.global.attributes, att.sortable.attributes, macro.specialPara }
Example	<list type="ordered"> <head>Here begin the chapter headings of Book IV</head> <item n="4.1">The death of Queen Clotild.</item> <item n="4.2">How King Lothar wanted to appropriate one third of the Church revenues.</item> <item n="4.3">The wives and children of Lothar.</item> <item n="4.4">The Counts of the Bretons.</item> <item n="4.5">Saint Gall the Bishop.</item> <item n="4.6">The priest Cato.</item> <item> ...</item> </list>
Note	May contain simple prose or a sequence of chunks. Whatever string of characters is used to label a list item in the copy text may be used as the value of the global n attribute, but it is not required that numbering be recorded explicitly. In ordered lists, the n attribute on the item element is by definition synonymous with the use of the label element to record the enumerator of the list item. In glossary lists, however, the term being defined should be given with the label element, not n.

<keywords>

<keywords> contains a list of keywords or phrases identifying the topic or nature of a text. [2.4.3. ]

Module header

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))

scheme

identifies the controlled vocabulary within which the set of keywords concerned is defined.

Status	Optional
Datatype	`data.pointer`
Values	may point to a local definition, for example in a taxonomy element, or more usually to some external location where the scheme is fully defined.

Member of

Contained by

header: textClass

May contain

core: list term

Declaration

element keywords
{
   att.global.attributes,
   attribute scheme { data.pointer }?,
   ( term+ | list )
}

Example

<keywords scheme="http://classificationweb.net">
<term>Babbage, Charles</term>
<term>Mathematicians - Great Britain - Biography</term>
</keywords>

Example

<keywords>
<term>Fermented beverages</term>
<term>Central Andes</term>
<term>Schinus molle</term>
<term>Molle beer</term>
<term>Indigenous peoples</term>
<term>Ethnography</term>
<term>Archaeology</term>
</keywords>

Note

Each individual keyword (including compound subject headings) should be supplied as a term element directly within the keywords element. An alternative usage, in which each term appears within a item inside a list is permitted for backwards compatibility, but is deprecated.

If no control list exists for the keywords used, then no value should be supplied for the scheme attribute.

<l>

<l> (verse line) contains a single, possibly incomplete, line of verse. [3.12.1. 3.12. 7.2.5. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.fragmentable (@part)
Member of	model.lLike
Contained by	core: head item lg note q sp stage figures: cell figure header: change licence textstructure: argument body div epigraph postscript
May contain	analysis: interp interpGrp pc s w core: abbr add address bibl choice cit corr date del desc emph expan foreign gap gloss graphic hi index label lb lg list listBibl mentioned milestone name note num orig pb ptr q ref reg rs sic soCalled stage term time title unclear figures: figure formula table header: idno linking: anchor seg tagdocs: att code eg gi ident val
Declaration	element l { att.global.attributes, att.fragmentable.attributes, macro.paraContent }
Example	<l met="x/x/x/x/x/" real="/xx/x/x/x/">Shall I compare thee to a summer's day?</l>

<label>

<label> contains any label or heading used to identify part of a text, typically but not exclusively in a list or glossary. [3.7. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.typed (@type, @subtype) att.placement (@place)
Member of	model.labelLike
Contained by	core: add corr del desc emph head hi item l lg list note orig p q ref reg sic stage title unclear figures: cell figDesc figure header: change licence linking: seg textstructure: argument body div docEdition epigraph imprimatur postscript signed titlePart
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val
Declaration	element label { att.global.attributes, att.typed.attributes, att.placement.attributes, macro.phraseSeq }
Example	Labels are commonly used for the headwords in glossary lists; note the use of the global xml:lang attribute to set the default language of the glossary list to Middle English, and identify the glosses and headings as modern English or Latin: <list type="gloss" xml:lang="enm"> <head xml:lang="en">Vocabulary</head> <headLabel xml:lang="en">Middle English</headLabel> <headItem xml:lang="en">New English</headItem> <label>nu</label> <item xml:lang="en">now</item> <label>lhude</label> <item xml:lang="en">loudly</item> <label>bloweth</label> <item xml:lang="en">blooms</item> <label>med</label> <item xml:lang="en">meadow</item> <label>wude</label> <item xml:lang="en">wood</item> <label>awe</label> <item xml:lang="en">ewe</item> <label>lhouth</label> <item xml:lang="en">lows</item> <label>sterteth</label> <item xml:lang="en">bounds, frisks (cf. <cit> <ref>Chaucer, K.T.644</ref> <quote>a courser, <term>sterting</term>as the fyr</quote> </cit> </item> <label>verteth</label> <item xml:lang="la">pedit</item> <label>murie</label> <item xml:lang="en">merrily</item> <label>swik</label> <item xml:lang="en">cease</item> <label>naver</label> <item xml:lang="en">never</item> </list>
Example	Labels may also be used to record explicitly the numbers or letters which mark list items in ordered lists, as in this extract from Gibbon's Autobiography. In this usage the label element is synonymous with the n attribute on the item element: I will add two facts, which have seldom occurred in the composition of six, or at least of five quartos. <list rend="runon" type="ordered"> <label>(1)</label> <item>My first rough manuscript, without any intermediate copy, has been sent to the press.</item> <label>(2) </label> <item>Not a sheet has been seen by any human eyes, excepting those of the author and the printer: the faults and the merits are exclusively my own.</item> </list>
Example	Labels may also be used for other structured list items, as in this extract from the journal of Edward Gibbon: <list type="gloss"> <label>March 1757.</label> <item>I wrote some critical observations upon Plautus.</item> <label>March 8th.</label> <item>I wrote a long dissertation upon some lines of Virgil.</item> <label>June.</label> <item>I saw Mademoiselle Curchod — <quote xml:lang="la">Omnia vincit amor, et nos cedamus amori.</quote> </item> <label>August.</label> <item>I went to Crassy, and staid two days.</item> </list> Note that the label might also appear within the item rather than as its sibling. Though syntactically valid, this usage is not recommended TEI practice.
Example	Labels may also be used to represent a label or heading attached to a paragraph or sequence of paragraphs not treated as a structural division, or to a group of verse lines. Note that, in this case, the label element appears within the p or lg element, rather than as a preceding sibling of it. <p>[...] <lb/>& n’entrer en mauuais & mal-heu- <lb/>ré meſnage. Or des que le conſente- <lb/>ment des parties y eſt le mariage eſt <lb/> arreſté, quoy que de faict il ne ſoit <label place="margin">Puiſſance maritale entre les Romains.</label> <lb/> conſommé. Depuis la conſomma- <lb/>tion du mariage la femme eſt ſoubs <lb/> la puiſſance du mary, s’il n’eſt eſcla- <lb/>ue ou enfant de famille : car en ce <lb/> cas, la femme, qui a eſpouſé vn en- <lb/>fant de famille, eſt ſous la puiſſance [...]</p> In this example the text of the label appears in the right hand margin of the original source, next to the paragraph it describes, but approximately in the middle of it. If so desired the type attribute may be used to distinguish different categories of label.

<language>

<language> characterizes a single language or sublanguage used within a text. [2.4.2. ]

Module header

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))

ident

(identifier) Supplies a language code constructed as defined in BCP 47 which is used to identify the language documented by this element, and which is referenced by the global xml:lang attribute.

Status	Required
Datatype	`data.language`

usage

specifies the approximate percentage (by volume) of the text which uses this language.

Status	Optional
Datatype	`xsd:nonNegativeInteger { maxInclusive = "100" }`

Member of

Contained by

header: langUsage

May contain

analysis: interp interpGrp

core: abbr address choice date emph expan foreign gap gloss hi index lb mentioned milestone name note num pb ptr ref rs soCalled term time title

figures: figure

header: idno

linking: anchor

tagdocs: att code gi ident val

Declaration

element language
{
   att.global.attributes,
   attribute ident { data.language },
   attribute usage { xsd:nonNegativeInteger { maxInclusive = "100" } }?,
   macro.phraseSeq.limited
}

Example

<langUsage>
<language ident="en-US" usage="75">modern American English</language>
<language ident="i-az-Arab" usage="20">Azerbaijani in Arabic script</language>
<language ident="x-lap" usage="05">Pig Latin</language>
</langUsage>

Note

Particularly for sublanguages, an informal prose characterization should be supplied as content for the element.

<langUsage>

<langUsage> (language usage) describes the languages, sublanguages, registers, dialects, etc. represented within a text. [2.4.2. 2.4. 15.3.2. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.declarable (@default)
Member of	model.profileDescPart
Contained by	header: profileDesc
May contain	header: language
Declaration	element langUsage { att.global.attributes, att.declarable.attributes, language+ }
Example	<langUsage> <language ident="fr-CA" usage="60">Québecois</language> <language ident="en-CA" usage="20">Canadian business English</language> <language ident="en-GB" usage="20">British English</language> </langUsage>

<lb>

<lb> (line break) marks the start of a new (typographic) line in some edition or version of a text. [3.10.3. 7.2.5. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.typed (@type, @subtype) att.edition (@ed, @edRef) att.spanning (@spanTo) att.breaking (@break)
Member of	model.milestoneLike
Contained by	analysis: s w core: abbr add addrLine address author bibl biblScope cit corr date del editor emph expan foreign gloss head hi item l label lg list listBibl mentioned name note num orig p pubPlace publisher q ref reg resp rs sic soCalled sp speaker stage term time title unclear figures: cell figure table header: authority change classCode distributor edition extent funder language licence principal sponsor linking: seg tagdocs: eg textstructure: argument back body byline closer dateline div docAuthor docDate docEdition docImprint docTitle epigraph front group imprimatur opener postscript salute signed text titlePage titlePart trailer
May contain	Empty element
Declaration	element lb { att.global.attributes, att.typed.attributes, att.edition.attributes, att.spanning.attributes, att.breaking.attributes, empty }
Example	This example shows typographical line breaks within metrical lines, where they occur at different places in different editions: <l>Of Mans First Disobedience,<lb ed="1674"/> and<lb ed="1667"/> the Fruit</l> <l>Of that Forbidden Tree, whose<lb ed="1667 1674"/> mortal tast</l> <l>Brought Death into the World,<lb ed="1667"/> and all<lb ed="1674"/> our woe,</l>
Example	This example encodes typographical line breaks as a means of preserving the visual appearance of a title page. The break attribute is used to show that the line break does not (as elsewhere) mark the start of a new word. <titlePart> <lb/>With Additions, ne-<lb break="no"/>ver before Printed. </titlePart>
Note	By convention, lb elements should appear at the point in the text where a new line starts. The n attribute, if used, indicates the number or other value associated with the text between this point and the next lb element, typically the sequence number of the line within the page, or other appropriate unit. This element is intended to be used for marking actual line breaks on a manuscript or printed page, at the point where they occur; it should not be used to tag structural units such as lines of verse (for which the l element is available) except in circumstances where structural units cannot otherwise be marked. The type attribute may be used to characterize the line break in any respect. The more specialized attributes break, ed, or edRef should be preferred when the intent is to indicate whether or not the line break is word-breaking, or to note the source from which it derives.

<lg>

<lg> (line group) contains one or more verse lines functioning as a formal unit, e.g. a stanza, refrain, verse paragraph, etc. [3.12.1. 3.12. 7.2.5. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.divLike (@org, @sample) (att.fragmentable (@part)) att.typed (@type, @subtype) att.declaring (@decls)
Member of	macro.paraContent model.divPart
Contained by	core: add corr del emph head hi item l lg note orig p q ref reg sic sp stage title unclear figures: cell figure header: change licence linking: seg textstructure: argument body div docEdition epigraph imprimatur postscript signed titlePart
May contain	analysis: interp interpGrp core: desc gap head index l label lb lg milestone note pb stage figures: figure linking: anchor textstructure: argument byline closer dateline docAuthor docDate epigraph opener postscript salute signed trailer
Declaration	element lg { att.global.attributes, att.divLike.attributes, att.typed.attributes, att.declaring.attributes, ( ( model.divTop \| model.global ), ( model.lLike \| model.stageLike \| model.labelLike \| lg ), ( model.lLike \| model.stageLike \| model.labelLike \| model.global \| lg ), ( ( model.divBottom ), model.global* )* ) }
Schematron	<sch:assert test="count(descendant::tei:lg\|descendant::tei:l\|descendant::tei:gap) > 0">An lg element must contain at least one child l, lg or gap element.</sch:assert>
Example	<lg type="free"> <l>Let me be my own fool</l> <l>of my own making, the sum of it</l> </lg> <lg type="free"> <l>is equivocal.</l> <l>One says of the drunken farmer:</l> </lg> <lg type="free"> <l>leave him lay off it. And this is</l> <l>the explanation.</l> </lg>
Note	contains verse lines or nested line groups only, possibly prefixed by a heading.

<licence>

<licence> contains information about a licence or other legal agreement applicable to the text. [2.2.4. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.pointing (@targetLang, @target, @evaluate) att.datable (@calendar, @period) (att.datable.w3c (@when))
Member of	model.availabilityPart
Contained by	header: availability
May contain	analysis: interp interpGrp pc s w core: abbr add address bibl choice cit corr date del desc emph expan foreign gap gloss graphic hi index l label lb lg list listBibl mentioned milestone name note num orig p pb ptr q ref reg rs sic soCalled sp stage term time title unclear figures: figure formula table header: idno linking: anchor seg tagdocs: att code eg gi ident val
Declaration	element licence { att.global.attributes, att.pointing.attributes, att.datable.attributes, macro.specialPara }
Example	<licence target="http://www.nzetc.org/tm/scholarly/tei-NZETC-Help.html#licensing"> Licence: Creative Commons Attribution-Share Alike 3.0 New Zealand Licence </licence>
Example	<availability> <licence target="http://creativecommons.org/licenses/by/3.0/" notBefore="2013-01-01"> <p>The Creative Commons Attribution 3.0 Unported (CC BY 3.0) Licence applies to this document.</p> <p>The licence was added on January 1, 2013.</p> </licence> </availability>
Note	A licence element should be supplied for each licence agreement applicable to the text in question. The target attribute may be used to reference a full version of the licence. The when, notBefore, notAfter, from or to attributes may be used in combination to indicate the date or dates of applicability of the licence.

<list>

<list> (list) contains any sequence of items organized as a list. [3.7. ]

Module core

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.sortable (@sortKey)

type

describes the form of the list.

Status	Optional
Datatype	`data.enumerated`
Suggested values include:	ordered list items are numbered or lettered. bulleted list items are marked with a bullet or other typographic device. simple list items are not numbered or bulleted. [Default] gloss each list item glosses some term or concept, which is given by a label element preceding the list item.
Note	The formal syntax of the element declarations allows label tags to be omitted from lists tagged <list type="gloss">; this is however a semantic error.

Member of

model.listLike

Contained by

core: add corr del desc emph head hi item l note orig p q ref reg sic sp stage title unclear

figures: cell figDesc figure

header: change keywords licence revisionDesc sourceDesc

linking: seg

textstructure: argument back body div docEdition epigraph imprimatur postscript signed titlePart

May contain

analysis: interp interpGrp

core: gap head index item label lb milestone note pb

figures: figure

linking: anchor

textstructure: argument byline closer dateline docAuthor docDate epigraph opener postscript salute signed trailer

Declaration

element list
{
   att.global.attributes,
   att.sortable.attributes,
   attribute type { "ordered" | "bulleted" | "simple" | "gloss" | xsd:Name }?,
   (
      ( ( model.divTop ) | ( model.global ) )*,
      (
         ( item, model.global* )+
       | (
            headLabel?,
            headItem?,
            ( label, model.global*, item, model.global* )+
         )
      ),
      ( ( model.divBottom ), model.global* )*
   )
}

Example

<list type="ordered">
<item>a butcher</item>
<item>a baker</item>
<item>a candlestick maker, with <list type="bullets">
   <item>rings on his fingers</item>
   <item>bells on his toes</item>
  </list>
</item>
</list>

Example

The following example treats the short numbered clauses of Anglo-Saxon legal codes as lists of items. The text is from an ordinance of King Athelstan (924–939):

<div1 type="section">
<head>Athelstan's Ordinance</head>
<list type="ordered">
  <item n="1">Concerning thieves. First, that no thief is to be spared who is caught with
     the stolen goods, [if he is] over twelve years and [if the value of the goods is] over
     eightpence. <list type="ordered">
    <item n="1.1">And if anyone does spare one, he is to pay for the thief with his
         wergild — and the thief is to be no nearer a settlement on that account — or to
         clear himself by an oath of that amount.</item>
    <item n="1.2">If, however, he [the thief] wishes to defend himself or to escape, he is
         not to be spared [whether younger or older than twelve].</item>
    <item n="1.3">If a thief is put into prison, he is to be in prison 40 days, and he may
         then be redeemed with 120 shillings; and the kindred are to stand surety for him
         that he will desist for ever.</item>
    <item n="1.4">And if he steals after that, they are to pay for him with his wergild,
         or to bring him back there.</item>
    <item n="1.5">And if he steals after that, they are to pay for him with his wergild,
         whether to the king or to him to whom it rightly belongs; and everyone of those who
         supported him is to pay 120 shillings to the king as a fine.</item>
   </list>
  </item>
  <item n="2">Concerning lordless men. And we pronounced about these lordless men, from whom
     no justice can be obtained, that one should order their kindred to fetch back such a
     person to justice and to find him a lord in public meeting. <list type="ordered">
    <item n="2.1">And if they then will not, or cannot, produce him on that appointed day,
         he is then to be a fugitive afterwards, and he who encounters him is to strike him
         down as a thief.</item>
    <item n="2.2">And he who harbours him after that, is to pay for him with his wergild
         or to clear himself by an oath of that amount.</item>
   </list>
  </item>
  <item n="3">Concerning the refusal of justice. The lord who refuses justice and upholds
     his guilty man, so that the king is appealed to, is to repay the value of the goods and
     120 shillings to the king; and he who appeals to the king before he demands justice as
     often as he ought, is to pay the same fine as the other would have done, if he had
     refused him justice. <list type="ordered">
    <item n="3.1">And the lord who is an accessory to a theft by his slave, and it becomes
         known about him, is to forfeit the slave and be liable to his wergild on the first
         occasionp if he does it more often, he is to be liable to pay all that he owns.</item>
    <item n="3.2">And likewise any of the king's treasurers or of our reeves, who has been
         an accessory of thieves who have committed theft, is to liable to the same.</item>
   </list>
  </item>
  <item n="4">Concerning treachery to a lord. And we have pronounced concerning treachery to
     a lord, that he [who is accused] is to forfeit his life if he cannot deny it or is
     afterwards convicted at the three-fold ordeal.</item>
</list>
</div1>

Note that nested lists have been used so the tagging mirrors the structure indicated by the two-level numbering of the clauses. The clauses could have been treated as a one-level list with irregular numbering, if desired.

Example

<p>These decrees, most blessed Pope Hadrian, we propounded in the public council ... and they
confirmed them in our hand in your stead with the sign of the Holy Cross, and afterwards
inscribed with a careful pen on the paper of this page, affixing thus the sign of the Holy
Cross. <list type="simple">
  <item>I, Eanbald, by the grace of God archbishop of the holy church of York, have
     subscribed to the pious and catholic validity of this document with the sign of the Holy
     Cross.</item>
  <item>I, Ælfwold, king of the people across the Humber, consenting have subscribed with
     the sign of the Holy Cross.</item>
  <item>I, Tilberht, prelate of the church of Hexham, rejoicing have subscribed with the
     sign of the Holy Cross.</item>
  <item>I, Higbald, bishop of the church of Lindisfarne, obeying have subscribed with the
     sign of the Holy Cross.</item>
  <item>I, Ethelbert, bishop of Candida Casa, suppliant, have subscribed with thef sign of
     the Holy Cross.</item>
  <item>I, Ealdwulf, bishop of the church of Mayo, have subscribed with devout will.</item>
  <item>I, Æthelwine, bishop, have subscribed through delegates.</item>
  <item>I, Sicga, patrician, have subscribed with serene mind with the sign of the Holy
     Cross.</item>
</list>
</p>

Note

May contain an optional heading followed by a series of items, or a series of label and item pairs, the latter being optionally preceded by one or two specialized headings.

<listBibl>

<listBibl> (citation list) contains a list of bibliographic citations of any kind. [3.11.1. 2.2.7. 15.3.2. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.sortable (@sortKey) att.declarable (@default) att.typed (@type, @subtype)
Member of	model.biblLike
Contained by	core: add cit corr del desc emph head hi item l listBibl note orig p q ref reg relatedItem sic stage title unclear figures: cell figDesc figure header: change licence sourceDesc taxonomy linking: seg textstructure: argument body div docEdition epigraph imprimatur postscript signed titlePart
May contain	core: bibl head lb listBibl milestone pb linking: anchor
Declaration	element listBibl { att.global.attributes, att.sortable.attributes, att.declarable.attributes, att.typed.attributes, ( model.headLike*, ( model.biblLike \| model.milestoneLike )+ ) }
Example	<listBibl> <head>Works consulted</head> <bibl>Blain, Clements and Grundy: Feminist Companion to Literature in English (Yale, 1990) </bibl> <biblStruct> <analytic> <title>The Interesting story of the Children in the Wood</title> </analytic> <monogr> <title>The Penny Histories</title> <author>Victor E Neuberg</author> <imprint> <publisher>OUP</publisher> <date>1968</date> </imprint> </monogr> </biblStruct> </listBibl>

<mentioned>

<mentioned> marks words or phrases mentioned, not used. [3.3.3. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.emphLike
Contained by	analysis: s core: abbr add addrLine author bibl biblScope corr date del desc editor emph expan foreign gloss head hi item l label mentioned name note num orig p pubPlace publisher q ref reg resp rs sic soCalled speaker stage term time title unclear figures: cell figDesc header: authority catDesc change classCode creation distributor edition extent funder language licence principal sponsor linking: seg tagdocs: eg textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val
Declaration	element mentioned { att.global.attributes, macro.phraseSeq }
Example	There is thus a striking accentual difference between a verbal form like <mentioned xml:id="X234" xml:lang="el">eluthemen</mentioned> <gloss target="#X234">we were released,</gloss> accented on the second syllable of the word, and its participial derivative <mentioned xml:id="X235" xml:lang="el">lutheis</mentioned> <gloss target="#X235">released,</gloss> accented on the last.

<milestone>

<milestone> marks a boundary point separating any kind of section of a text, typically but not necessarily indicating a point at which some part of a standard reference system changes, where the change is not represented by a structural element. [3.10.3. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.milestoneUnit (@unit) att.typed (@type, @subtype) att.edition (@ed, @edRef) att.spanning (@spanTo) att.breaking (@break)
Member of	model.milestoneLike
Contained by	analysis: s w core: abbr add addrLine address author bibl biblScope cit corr date del editor emph expan foreign gloss head hi item l label lg list listBibl mentioned name note num orig p pubPlace publisher q ref reg resp rs sic soCalled sp speaker stage term time title unclear figures: cell figure table header: authority change classCode distributor edition extent funder language licence principal sponsor linking: seg tagdocs: eg textstructure: argument back body byline closer dateline div docAuthor docDate docEdition docImprint docTitle epigraph front group imprimatur opener postscript salute signed text titlePage titlePart trailer
May contain	Empty element
Declaration	element milestone { att.global.attributes, att.milestoneUnit.attributes, att.typed.attributes, att.edition.attributes, att.spanning.attributes, att.breaking.attributes, empty }
Example	<milestone n="23" ed="La" unit="Dreissiger"/> ... <milestone n="24" ed="AV" unit="verse"/> ...
Note	For this element, the global n attribute indicates the new number or other value for the unit which changes at this milestone. The special value unnumbered should be used in passages which fall outside the normal numbering scheme, such as chapter or other headings, poem numbers or titles, etc. The order in which milestone elements are given at a given point is not normally significant.

<name>

<name> (name, proper noun) contains a proper noun or noun phrase. [3.5.1. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.personal (@full, @sort) (att.naming (@role, @nymRef) (att.canonical (@key, @ref)) ) att.datable (@calendar, @period) (att.datable.w3c (@when)) att.editLike (@evidence, @source, @instant) (att.dimensions (@unit, @quantity, @extent, @precision, @scope) (att.ranging (@atLeast, @atMost, @min, @max, @confidence)) ) (att.responsibility (@cert, @resp)) att.typed (@type, @subtype)
Member of	model.nameLike.agent
Contained by	analysis: s core: abbr add addrLine address author bibl biblScope corr date del desc editor emph expan foreign gloss head hi item l label mentioned name note num orig p pubPlace publisher q ref reg resp respStmt rs sic soCalled speaker stage term time title unclear figures: cell figDesc header: authority catDesc change classCode creation distributor edition extent funder language licence principal sponsor linking: seg tagdocs: eg textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val
Declaration	element name { att.global.attributes, att.personal.attributes, att.datable.attributes, att.editLike.attributes, att.typed.attributes, macro.phraseSeq }
Example	<name type="person">Thomas Hoccleve</name> <name type="place">Villingaholt</name> <name type="org">Vetus Latina Institut</name> <name type="person" ref="#HOC001">Occleve</name>
Note	Proper nouns referring to people, places, and organizations may be tagged instead with <persName>, <placeName>, or <orgName>, when the TEI module for names and dates is included.

<note>

<note> contains a note or annotation. [3.8.1. 2.2.6. 3.11.2.8. 9.3.5.4. ]

Module core

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.placement (@place) att.pointing (@targetLang, @target, @evaluate) att.responsibility (@cert, @resp) att.typed (@type, @subtype)

anchored

indicates whether the copy text shows the exact place of reference for the note.

Status	Optional
Datatype	`data.truthValue`
Default	true
Note	In modern texts, notes are usually anchored by means of explicit footnote or endnote symbols. An explicit indication of the phrase or line annotated may however be used instead (e.g. ‘page 218, lines 3–4’). The anchored attribute indicates whether any explicit location is given, whether by symbol or by prose cross-reference. The value true indicates that such an explicit location is indicated in the copy text; the value false indicates that the copy text does not indicate a specific place of attachment for the note. If the specific symbols used in the copy text at the location the note is anchored are to be recorded, use the n attribute.

targetEnd

points to the end of the span to which the note is attached, if the note is not embedded in the text at that point.

Status	Optional
Datatype	1–∞ occurrences of `data.pointer`separated by whitespace
Values	reference to the xml:id(s) of element(s) which end at the location(s) in question, or to an empty element at the point in question.
Note	This attribute is retained for backwards compatibility; it may be removed at a subsequent release of the Guidelines. The recommended way of pointing to a span of elements is by means of the range function of XPointer, as further described in 16.2.5.4. .

Member of

model.noteLike

Contained by

analysis: s w

figures: cell figure table

header: authority change classCode distributor edition extent funder language licence notesStmt principal sponsor

linking: seg

tagdocs: eg

May contain

analysis: interp interpGrp pc s w

figures: figure formula table

header: idno

linking: anchor seg

tagdocs: att code eg gi ident val

Declaration

element note
{
   att.global.attributes,
   att.placement.attributes,
   att.pointing.attributes,
   att.responsibility.attributes,
   att.typed.attributes,
   attribute anchored { data.truthValue }?,
   attribute targetEnd { list { data.pointer+ } }?,
   macro.specialPara
}

Example

In the following example, the translator has supplied a footnote containing an explanation of the term translated as "painterly":

And yet it is not only
in the great line of Italian renaissance art, but even in the
painterly <note place="bottom" type="gloss" resp="#MDMH">
<term xml:lang="de">Malerisch</term>. This word has, in the German, two
distinct meanings, one objective, a quality residing in the object,
the other subjective, a mode of apprehension and creation. To avoid
confusion, they have been distinguished in English as
<mentioned>picturesque</mentioned> and
<mentioned>painterly</mentioned> respectively.
</note> style of the
Dutch genre painters of the seventeenth century that drapery has this
psychological significance.

For this example to be valid, the code MDMH must be defined elsewhere, for example by means of a responsibility statement in the associated TEI header:

<respStmt xml:id="MDMH">
<resp>translation from German to English</resp>
<name>Hottinger, Marie Donald Mackie</name>
</respStmt>

Example

The global n attribute may be used to supply the symbol or number used to mark the note's point of attachment in the source text, as in the following example:

Mevorakh b. Saadya's mother, the matriarch of the
family during the second half of the eleventh century, <note n="126" anchored="true"> The
alleged mention of Judah Nagid's mother in a letter from 1071 is, in fact, a reference to
Judah's children; cf. above, nn. 111 and 54. </note> is well known from Geniza documents
published by Jacob Mann.

However, if notes are numbered in sequence and their numbering can be reconstructed automatically by processing software, it may well be considered unnecessary to record the note numbers.

<notesStmt>

<notesStmt> (notes statement) collects together any notes providing information about a text additional to that recorded in other parts of the bibliographic description. [2.2.6. 2.2. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of
Contained by	header: fileDesc
May contain	core: note relatedItem
Declaration	element notesStmt { att.global.attributes, ( model.noteLike \| relatedItem )+ }
Example	<notesStmt> <note>Historical commentary provided by Mark Cohen</note> <note>OCR scanning done at University of Toronto</note> </notesStmt>
Note	Information of different kinds should not be grouped together into the same note.

<num>

<num> (number) contains a number, written in any form. [3.5.3. ]

Module core

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.ranging (@atLeast, @atMost, @min, @max, @confidence)

type

indicates the type of numeric value.

Status	Optional
Datatype	`data.enumerated`
Suggested values include:	cardinal absolute number, e.g. 21, 21.5 ordinal ordinal number, e.g. 21st fraction fraction, e.g. one half or three-quarters percentage a percentage
Note	If a different typology is desired, other values can be used for this attribute.

value

supplies the value of the number in standard form.

Status	Optional
Datatype	`data.numeric`
Values	a numeric value.
Note	The standard form used is defined by the TEI datatype data.numeric.

Member of

model.measureLike

Contained by

analysis: s

figures: cell figDesc

header: authority catDesc change classCode creation distributor edition extent funder language licence principal sponsor

linking: seg

tagdocs: eg

textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer

May contain

analysis: interp interpGrp pc s w

core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear

figures: figure formula

header: idno

linking: anchor seg

tagdocs: att code gi ident val

Declaration

element num
{
   att.global.attributes,
   att.ranging.attributes,
   attribute type
   {
      "cardinal" | "ordinal" | "fraction" | "percentage" | xsd:Name
   }?,
   attribute value { data.numeric }?,
   macro.phraseSeq
}

Example

<p>I reached <num type="cardinal" value="21">twenty-one</num> on
my <num type="ordinal" value="21">twenty-first</num> birthday</p>
<p>Light travels at <num value="3E10">3×10<hi rend="sup">10</hi>
</num> cm per second.</p>

Note

Detailed analyses of quantities and units of measure in historical documents may also use the feature structure mechanism described in chapter 18. . The num element is intended for use in simple applications.

<opener>

<opener> groups together dateline, byline, salutation, and similar phrases appearing as a preliminary group at the start of a division, especially of a letter. [4.2. ]
Module	textstructure
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.divTopPart
Contained by	core: lg list textstructure: body div group postscript
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val textstructure: argument byline dateline epigraph salute signed
Declaration	element opener { att.global.attributes, ( text \| model.gLike \| model.phrase \| argument \| byline \| dateline \| epigraph \| salute \| signed \| model.global )* }
Example	<opener> <dateline>Walden, this 29. of August 1592</dateline> </opener>
Example	<opener> <dateline> <name type="place">Great Marlborough Street</name> <date>November 11, 1848</date> </dateline> <salute>My dear Sir,</salute> </opener> <p>I am sorry to say that absence from town and other circumstances have prevented me from earlier enquiring...</p>

<orig>

<orig> (original form) contains a reading which is marked as following the original, rather than being normalized or corrected. [3.4.2. 12. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.responsibility (@cert, @resp)
Member of	model.choicePart model.pPart.transcriptional
Contained by	analysis: pc s w core: abbr add addrLine author bibl biblScope choice corr date del editor emph expan foreign gloss head hi item l label mentioned name note num orig p pubPlace publisher q ref reg rs sic soCalled speaker stage term time title unclear figures: cell header: change distributor edition extent licence linking: seg tagdocs: eg textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer
May contain	analysis: interp interpGrp pc s w core: abbr add address bibl choice cit corr date del desc emph expan foreign gap gloss graphic hi index label lb lg list listBibl mentioned milestone name note num orig pb ptr q ref reg rs sic soCalled stage term time title unclear figures: figure formula table header: idno linking: anchor seg tagdocs: att code eg gi ident val
Declaration	element orig { att.global.attributes, att.responsibility.attributes, macro.paraContent }
Example	If all that is desired is to call attention to the original version in the copy text, orig may be used alone: <l>But this will be a <orig>meere</orig> confusion</l> <l>And hardly shall we all be <orig>vnderstoode</orig> </l>
Example	More usually, an orig will be combined with a regularized form within a choice element: <l>But this will be a <choice> <orig>meere</orig> <reg>mere</reg> </choice> confusion</l> <l>And hardly shall we all be <choice> <orig>vnderstoode</orig> <reg>understood</reg> </choice> </l>

<p>

<p> (paragraph) marks paragraphs in prose. [3.1. 7.2.5. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.declaring (@decls) att.fragmentable (@part)
Member of	model.pLike
Contained by	core: item note q sp stage figures: cell figure header: availability change editionStmt editorialDecl encodingDesc licence projectDesc publicationStmt refsDecl samplingDecl seriesStmt sourceDesc textstructure: argument back body div epigraph front postscript
May contain	analysis: interp interpGrp pc s w core: abbr add address bibl choice cit corr date del desc emph expan foreign gap gloss graphic hi index label lb lg list listBibl mentioned milestone name note num orig pb ptr q ref reg rs sic soCalled stage term time title unclear figures: figure formula table header: idno linking: anchor seg tagdocs: att code eg gi ident val
Declaration	element p { att.global.attributes, att.declaring.attributes, att.fragmentable.attributes, macro.paraContent }
Example	<p>Hallgerd was outside. <q>There is blood on your axe,</q> she said. <q>What have you done?</q> </p> <p> <q>I have now arranged that you can be married a second time,</q> replied Thjostolf. </p> <p> <q>Then you must mean that Thorvald is dead,</q> she said. </p> <p> <q>Yes,</q> said Thjostolf. <q>And now you must think up some plan for me.</q> </p>

<pb>

<pb> (page break) marks the start of a new page in a paginated document. [3.10.3. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.typed (@type, @subtype) att.edition (@ed, @edRef) att.spanning (@spanTo) att.breaking (@break)
Member of	model.milestoneLike
Contained by	analysis: s w core: abbr add addrLine address author bibl biblScope cit corr date del editor emph expan foreign gloss head hi item l label lg list listBibl mentioned name note num orig p pubPlace publisher q ref reg resp rs sic soCalled sp speaker stage term time title unclear figures: cell figure table header: authority change classCode distributor edition extent funder language licence principal sponsor linking: seg tagdocs: eg textstructure: argument back body byline closer dateline div docAuthor docDate docEdition docImprint docTitle epigraph front group imprimatur opener postscript salute signed text titlePage titlePart trailer
May contain	Empty element
Declaration	element pb { att.global.attributes, att.typed.attributes, att.edition.attributes, att.spanning.attributes, att.breaking.attributes, empty }
Example	Page numbers may vary in different editions of a text. <p> ... <pb n="145" ed="ed2"/> <!-- Page 145 in edition "ed2" starts here --> ... <pb n="283" ed="ed1"/> <!-- Page 283 in edition "ed1" starts here--> ... </p>
Example	A page break may be associated with a facsimile image of the page it introduces by means of the facs attribute <body> <pb n="1" facs="page1.png"/> <!-- page1.png contains an image of the page; the text it contains is encoded here --> <p> <!-- ... --> </p> <pb n="2" facs="page2.png"/> <!-- similarly, for page 2 --> <p> <!-- ... --> </p> </body>
Note	A pb element should appear at the start of the page which it identifies. The global n attribute indicates the number or other value associated with this page. This will normally be the page number or signature printed on it, since the physical sequence number is implicit in the presence of the pb element itself. The type attribute may be used to characterize the page break in any respect, for example as word-breaking or not.

<pc>

<pc> (punctuation character) contains a character or string of characters regarded as constituting a single punctuation mark. [17.1. ]

Module analysis

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.segLike (@function) (att.fragmentable (@part)) att.typed (@type, @subtype)

force

indicates the extent to which this punctuation mark conventionally separates words or phrases

Status	Optional
Datatype	`data.enumerated`
Legal values are:	strong the punctuation mark is a word separator weak the punctuation mark is not a word separator inter the punctuation mark may or may not be a word separator

unit

provides a name for the kind of unit delimited by this punctuation mark.

Status	Optional
Datatype	`data.enumerated`

pre

indicates whether this punctuation mark precedes or follows the unit it delimits.

Status	Optional
Datatype	`data.truthValue`

Member of

model.segLike

Contained by

analysis: s w

figures: cell

header: change distributor edition extent licence

linking: seg

tagdocs: eg

textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer

May contain

core: abbr add choice corr del expan orig reg sic unclear

Declaration

element pc
{
   att.global.attributes,
   att.segLike.attributes,
   att.typed.attributes,
   attribute force { "strong" | "weak" | "inter" }?,
   attribute unit { data.enumerated }?,
   attribute pre { data.truthValue }?,
   ( text | model.gLike | c | model.pPart.edit )*
}

Example

<phr>
<w>do</w>
<w>you</w>
<w>understand</w>
<pc type="interrogative">?</pc>
</phr>

<postscript>

<postscript> contains a postscript, e.g. to a letter. [4.2. ]
Module	textstructure
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.divBottomPart
Contained by	core: lg list figures: figure table textstructure: back body div front group postscript
May contain	analysis: interp interpGrp core: bibl cit desc gap head index l label lb lg list listBibl milestone note p pb q sp stage figures: figure table linking: anchor tagdocs: eg textstructure: closer opener postscript signed trailer
Declaration	element postscript { att.global.attributes, ( ( model.global \| model.divTopPart ), ( model.common ), ( model.global \| model.common ), ( ( model.divBottomPart ), model.global* )* ) }
Example	<div type="letter"> <opener> <dateline> <placeName>Rimaone</placeName> <date when="2006-11-21">21 Nov 06</date> </dateline> <salute>Dear Susan,</salute> </opener> <p>Thank you very much for the assistance splitting those logs. I'm sorry about the misunderstanding as to the size of the task. I really was not asking for help, only to borrow the axe. Hope you had fun in any case.</p> <closer> <salute>Sincerely yours,</salute> <signed>Seymour</signed> </closer> <postscript> <label>P.S.</label> <p>The collision occured on <date when="2001-07-06">06 Jul 01</date>.</p> </postscript> </div>

<principal>

<principal> (principal researcher) supplies the name of the principal researcher responsible for the creation of an electronic text. [2.2.1. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.respLike
Contained by	core: bibl header: editionStmt titleStmt
May contain	analysis: interp interpGrp core: abbr address choice date emph expan foreign gap gloss hi index lb mentioned milestone name note num pb ptr ref rs soCalled term time title figures: figure header: idno linking: anchor tagdocs: att code gi ident val
Declaration	element principal { att.global.attributes, macro.phraseSeq.limited }
Example	<principal>Gary Taylor</principal>

<profileDesc>

<profileDesc> (text-profile description) provides a detailed description of non-bibliographic aspects of a text, specifically the languages and sublanguages used, the situation in which it was produced, the participants and their setting. [2.4. 2.1.1. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.teiHeaderPart
Contained by	header: teiHeader
May contain	header: creation langUsage textClass
Declaration	element profileDesc { att.global.attributes, ( model.profileDescPart* ) }
Example	<profileDesc> <langUsage> <language ident="fr">French</language> </langUsage> <textDesc n="novel"> <channel mode="w">print; part issues</channel> <constitution type="single"/> <derivation type="original"/> <domain type="art"/> <factuality type="fiction"/> <interaction type="none"/> <preparedness type="prepared"/> <purpose type="entertain" degree="high"/> <purpose type="inform" degree="medium"/> </textDesc> <settingDesc> <setting> <name>Paris, France</name> <time>Late 19th century</time> </setting> </settingDesc> </profileDesc>
Note	Although the content model permits it, it is rarely meaningful to supply multiple occurrences for any of the child elements of profileDesc. In earlier versions of these Guidelines, it was required that the creation element appear first.

<projectDesc>

<projectDesc> (project description) describes in detail the aim or purpose for which an electronic file was encoded, together with any other relevant information concerning the process by which it was assembled or collected. [2.3.1. 2.3. 15.3.2. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.declarable (@default)
Member of	model.encodingDescPart
Contained by	header: encodingDesc
May contain	core: p
Declaration	element projectDesc { att.global.attributes, att.declarable.attributes, model.pLike+ }
Example	<projectDesc> <p>Texts collected for use in the Claremont Shakespeare Clinic, June 1990</p> </projectDesc>

<ptr>

<ptr> (pointer) defines a pointer to another location. [3.6. 16.1. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.pointing (@targetLang, @target, @evaluate) att.typed (@type, @subtype) att.declaring (@decls) att.cReferencing (@cRef)
Member of	model.ptrLike
Contained by	analysis: s core: abbr add addrLine author bibl biblScope cit corr date del desc editor emph expan foreign gloss head hi item l label mentioned name note num orig p pubPlace publisher q ref reg relatedItem resp rs sic soCalled speaker stage term time title unclear figures: cell figDesc header: authority catDesc change classCode creation distributor edition extent funder language licence principal sponsor linking: seg tagdocs: eg textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer
May contain	Empty element
Declaration	element ptr { att.global.attributes, att.pointing.attributes, att.typed.attributes, att.declaring.attributes, att.cReferencing.attributes, empty }
Schematron	<s:report test="@target and @cRef">Only one of the attributes @target and @cRef may be supplied on <s:name/>.</s:report>
Example	<ptr target="#p143 #p144"/> <ptr target="http://www.tei-c.org"/> <ptr cRef="1.3.4"/>

<publicationStmt>

<publicationStmt> (publication statement) groups information concerning the publication or distribution of an electronic or other text. [2.2.4. 2.2. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of
Contained by	header: fileDesc
May contain	core: address date p pubPlace publisher header: authority availability distributor idno
Declaration	element publicationStmt { att.global.attributes, ( model.pLike+ \| model.publicationStmtPart+ ) }
Example	<publicationStmt> <publisher>C. Muquardt </publisher> <pubPlace>Bruxelles & Leipzig</pubPlace> <date when="1846"/> </publicationStmt>
Example	<publicationStmt> <publisher>Chadwyck Healey</publisher> <pubPlace>Cambridge</pubPlace> <availability> <p>Available under licence only</p> </availability> <date when="1992">1992</date> </publicationStmt>
Note	Where a publication statement contains several members of the mode.publicationStmtPart class rather than one or more paragraphs, care should be taken to ensure that the repeated elements are presented in a meaningful order. It is a conformance requirement that elements supplying information about publication place, address, identifier, availability, and date be given following the name of the publisher, distributor, or authority concerned, and preferably in that order. These constraints are not currently modelled in the schema, but may be in a future release.

<publisher>

<publisher> provides the name of the organization responsible for the publication or distribution of a bibliographic item. [3.11.2.4. 2.2.4. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.imprintPart model.publicationStmtPart
Contained by	core: bibl header: publicationStmt textstructure: docImprint
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val
Declaration	element publisher { att.global.attributes, macro.phraseSeq }
Example	<imprint> <pubPlace>Oxford</pubPlace> <publisher>Clarendon Press</publisher> <date>1987</date> </imprint>
Note	Use the full form of the name by which a company is usually referred to, rather than any abbreviation of it which may appear on a title page

<pubPlace>

<pubPlace> (publication place) contains the name of the place where a bibliographic item was published. [3.11.2.4. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.naming (@role, @nymRef) (att.canonical (@key, @ref))
Member of	model.imprintPart model.publicationStmtPart
Contained by	core: bibl header: publicationStmt textstructure: docImprint
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val
Declaration	element pubPlace { att.global.attributes, att.naming.attributes, macro.phraseSeq }
Example	<publicationStmt> <publisher>Oxford University Press</publisher> <pubPlace>Oxford</pubPlace> <date>1989</date> </publicationStmt>

<q>

<q> (quoted) contains material which is distinguished from the surrounding text using quotation marks or a similar method, for any one of a variety of reasons including, but not limited to: direct speech or thought, technical terms or jargon, authorial distance, quotations from elsewhere, and passages that are mentioned but not used. [3.3.3. ]

Module core

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.ascribed (@who) att.source (@source)

type

may be used to indicate whether the offset passage is spoken or thought, or to characterize it more finely.

Status	Optional
Datatype	`data.enumerated`
Suggested values include:	spoken representation of speech thought representation of thought, e.g. internal monologue written quotation from a written source soCalled authorial distance foreign distinct linguistically distinct term technical term emph rhetorically emphasized mentioned refering to itself, not its normal referant

Member of

model.qLike

Contained by

core: add cit corr del desc emph head hi item l note orig p q ref reg sic sp stage title unclear

figures: cell figDesc figure

header: change licence

linking: seg

textstructure: argument body div docEdition epigraph imprimatur postscript signed titlePart

May contain

analysis: interp interpGrp pc s w

figures: figure formula table

header: idno

linking: anchor seg

tagdocs: att code eg gi ident val

Declaration

element q
{
   att.global.attributes,
   att.ascribed.attributes,
   att.source.attributes,
   attribute type
   {
      "spoken"
    | "thought"
    | "written"
    | "soCalled"
    | "foreign"
    | "distinct"
    | "term"
    | "emph"
    | "mentioned"
    | xsd:Name
   }?,
   macro.specialPara
}

Example

It is spelled <q>Tübingen</q> — to enter the
letter <q>u</q> with an umlaut hold down the <q>option</q> key and press
<q>0 0 f c</q>

Note

May be used to indicate that a passage is distinguished from the surrounding text for reasons concerning which no claim is made. When used in this manner, q may be thought of as syntactic sugar for hi with a value of rend that indicates the use of such mechanisms as quotation marks.

<ref>

<ref> (reference) defines a reference to another location, possibly modified by additional text or comment. [3.6. 16.1. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.pointing (@targetLang, @target, @evaluate) att.typed (@type, @subtype) att.declaring (@decls) att.cReferencing (@cRef)
Member of	model.ptrLike
Contained by	analysis: s core: abbr add addrLine author bibl biblScope cit corr date del desc editor emph expan foreign gloss head hi item l label mentioned name note num orig p pubPlace publisher q ref reg relatedItem resp rs sic soCalled speaker stage term time title unclear figures: cell figDesc header: authority catDesc change classCode creation distributor edition extent funder language licence principal sponsor linking: seg tagdocs: eg textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer
May contain	analysis: interp interpGrp pc s w core: abbr add address bibl choice cit corr date del desc emph expan foreign gap gloss graphic hi index label lb lg list listBibl mentioned milestone name note num orig pb ptr q ref reg rs sic soCalled stage term time title unclear figures: figure formula table header: idno linking: anchor seg tagdocs: att code eg gi ident val
Declaration	element ref { att.global.attributes, att.pointing.attributes, att.typed.attributes, att.declaring.attributes, att.cReferencing.attributes, macro.paraContent }
Schematron	<s:report test="@target and @cRef">Only one of the attributes @target' and @cRef' may be supplied on <s:name/> </s:report>
Example	See especially <ref target="http://www.natcorp.ox.ac.uk/Texts/A02.xml#s2">the second sentence</ref>
Example	See also <ref target="#locution">s.v. <term>locution</term> </ref>.
Note	The target and cRef attributes are mutually exclusive.

<refsDecl>

<refsDecl> (references declaration) specifies how canonical references are constructed for this text. [2.3.6.3. 2.3. 2.3.6. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.declarable (@default)
Member of	model.encodingDescPart
Contained by	header: encodingDesc
May contain	core: p
Declaration	element refsDecl { att.global.attributes, att.declarable.attributes, ( model.pLike+ \| cRefPattern+ \| refState+ ) }
Example	<refsDecl> <cRefPattern matchPattern="([A-Za-z0-9]+) ([0-9]+):([0-9]+)" replacementPattern="#xpath(//body/div[@n='$1']/div[$2]/div3[$3])"/> </refsDecl> This example is a formal representation for the referencing scheme described informally in the following example.
Example	<refsDecl> <p>References are made up by concatenating the value for the <att>n</att> attribute on the highest level <gi>div</gi> element, followed by a space, followed by the sequential number of the next level <gi>div</gi> followed by a colon followed by the sequential number of the next (and lowest) level <gi>div</gi>.</p> </refsDecl>

<reg>

<reg> (regularization) contains a reading which has been regularized or normalized in some sense. [3.4.2. 12. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.editLike (@evidence, @source, @instant) (att.dimensions (@unit, @quantity, @extent, @precision, @scope) (att.ranging (@atLeast, @atMost, @min, @max, @confidence)) ) (att.responsibility (@cert, @resp)) att.typed (@type, @subtype)
Member of	model.choicePart model.pPart.transcriptional
Contained by	analysis: pc s w core: abbr add addrLine author bibl biblScope choice corr date del editor emph expan foreign gloss head hi item l label mentioned name note num orig p pubPlace publisher q ref reg rs sic soCalled speaker stage term time title unclear figures: cell header: change distributor edition extent licence linking: seg tagdocs: eg textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer
May contain	analysis: interp interpGrp pc s w core: abbr add address bibl choice cit corr date del desc emph expan foreign gap gloss graphic hi index label lb lg list listBibl mentioned milestone name note num orig pb ptr q ref reg rs sic soCalled stage term time title unclear figures: figure formula table header: idno linking: anchor seg tagdocs: att code eg gi ident val
Declaration	element reg { att.global.attributes, att.editLike.attributes, att.typed.attributes, macro.paraContent }
Example	If all that is desired is to call attention to the fact that the copy text has been regularized, reg may be used alone: <q>Please <reg>knock</reg> if an <reg>answer</reg> is <reg>required</reg> </q>
Example	It is also possible to identify the individual responsible for the regularization, and, using the choice and orig elements, to provide both the original and regularized readings: <q>Please <choice> <reg resp="#LB">knock</reg> <orig>cnk</orig> </choice> if an <choice> <reg>answer</reg> <orig>nsr</orig> </choice> is <choice> <reg>required</reg> <orig>reqd</orig> </choice> </q>

<relatedItem>

<relatedItem> contains or references some other bibliographic item which is related to the present one in some specified manner, for example as a constituent or alternative version of it. [3.11.2.7. ]

Module core

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.typed (@type, @subtype)

target

points to the related bibliographic element by means of an absolute or relative URI reference

Status	Optional
Datatype	`data.pointer`

Member of

model.biblPart

Contained by

core: bibl

header: notesStmt

May contain

core: bibl listBibl ptr ref

Declaration

element relatedItem
{
   att.global.attributes,
   att.typed.attributes,
   attribute target { data.pointer }?,
   ( model.biblLike | model.ptrLike )?
}

Schematron

<sch:report test="@target and count( child::* ) > 0">If the @target attribute on <sch:name/> is used, the
relatedItem element must be empty</sch:report>
<sch:assert test="@target or child::*">A relatedItem element should have either a 'target' attribute
or a child element to indicate the related bibliographic item</sch:assert>

Example

<biblStruct>
<monogr>
  <author>Shirley, James</author>
  <title type="main">The gentlemen of Venice</title>
  <imprint>
   <pubPlace>New York</pubPlace>
   <publisher>Readex Microprint</publisher>
   <date>1953</date>
  </imprint>
  <extent>1 microprint card, 23 x 15 cm.</extent>
</monogr>
<series>
  <title>Three centuries of drama: English, 1642–1700</title>
</series>
<relatedItem type="otherForm">
  <biblStruct>
   <monogr>
    <author>Shirley, James</author>
    <title type="main">The gentlemen of Venice</title>
    <title type="sub">a tragi-comedie presented at the private house in Salisbury
         Court by Her Majesties servants</title>
    <imprint>
     <pubPlace>London</pubPlace>
     <publisher>H. Moseley</publisher>
     <date>1655</date>
    </imprint>
    <extent>78 p.</extent>
   </monogr>
  </biblStruct>
</relatedItem>
</biblStruct>

Note

If the target attribute is used to reference the related bibliographic item, the element should be empty.

<resp>

<resp> (responsibility) contains a phrase describing the nature of a person's intellectual responsibility, or an organization's role in the production or distribution of a work. [3.11.2.2. 2.2.1. 2.2.2. 2.2.5. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.canonical (@key, @ref)
Member of
Contained by	core: respStmt
May contain	analysis: interp interpGrp core: abbr address choice date emph expan foreign gap gloss hi index lb mentioned milestone name note num pb ptr ref rs soCalled term time title figures: figure header: idno linking: anchor tagdocs: att code gi ident val
Declaration	element resp { att.global.attributes, att.canonical.attributes, macro.phraseSeq.limited }
Example	<respStmt> <resp key="com">compiler</resp> <name>Edward Child</name> </respStmt>
Note	The attributes key or ref, inherited from the class att.canonical may be used to indicate the kind of responsibility in a normalized form, by referring directly (using ref) or indirectly (using key) to a standardized list of responsibility types, such as that maintained by a naming authority, for example the list maintained at http://www.loc.gov/marc/relators/relacode.html for bibliographic usage.

<respStmt>

<respStmt> (statement of responsibility) supplies a statement of responsibility for the intellectual content of a text, edition, recording, or series, where the specialized elements for authors, editors, etc. do not suffice or do not apply. May also be used to encode information about individuals or organizations which have played a role in the production or distribution of a bibliographic work. [3.11.2.2. 2.2.1. 2.2.2. 2.2.5. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.respLike
Contained by	core: bibl header: editionStmt seriesStmt titleStmt
May contain	core: name resp
Declaration	element respStmt { att.global.attributes, ( ( resp+, model.nameLike.agent+ ) \| ( model.nameLike.agent+, resp+ ) ) }
Example	<respStmt> <resp>transcribed from original ms</resp> <persName>Claus Huitfeldt</persName> </respStmt>
Example	<respStmt> <resp>converted to XML encoding</resp> <name>Alan Morrison</name> </respStmt>

<revisionDesc>

<revisionDesc> (revision description) summarizes the revision history for a file. [2.5. 2.1.1. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of
Contained by	header: teiHeader
May contain	core: list header: change
Declaration	element revisionDesc { att.global.attributes, ( list \| listChange \| change+ ) }
Example	<revisionDesc status="embargoed"> <change when="1991-11-11" who="#LB"> deleted chapter 10 </change> </revisionDesc>
Note	If present on this element, the status attribute should indicate the current status of the document. The same attribute may appear on any change to record the status at the time of that change. Conventionally change elements should be given in reverse date order, with the most recent change at the start of the list.

<row>

<row> contains one row of a table. [14.1.1. ]
Module	figures
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.tableDecoration (@role, @rows, @cols)
Member of
Contained by	figures: table
May contain	figures: cell
Declaration	element row { att.global.attributes, att.tableDecoration.attributes, cell+ }
Example	<row role="data"> <cell role="label">Classics</cell> <cell>Idle listless and unimproving</cell> </row>

<rs>

<rs> (referencing string) contains a general purpose name or referring string. [13.2.1. 3.5.1. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.naming (@role, @nymRef) (att.canonical (@key, @ref)) att.typed (@type, @subtype)
Member of	model.nameLike
Contained by	analysis: s core: abbr add addrLine address author bibl biblScope corr date del desc editor emph expan foreign gloss head hi item l label mentioned name note num orig p pubPlace publisher q ref reg resp rs sic soCalled speaker stage term time title unclear figures: cell figDesc header: authority catDesc change classCode creation distributor edition extent funder language licence principal sponsor linking: seg tagdocs: eg textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val
Declaration	element rs { att.global.attributes, att.naming.attributes, att.typed.attributes, macro.phraseSeq }
Example	<q>My dear <rs type="person">Mr. Bennet</rs>, </q> said <rs type="person">his lady</rs> to him one day, <q>have you heard that <rs type="place">Netherfield Park</rs> is let at last?</q>

<s>

<s> (s-unit) contains a sentence-like division of a text. [17.1. 8.4.1. ]
Module	analysis
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.segLike (@function) (att.fragmentable (@part)) att.typed (@type, @subtype)
Member of	model.segLike
Contained by	analysis: s core: abbr add addrLine author bibl biblScope corr date del editor emph expan foreign gloss head hi item l label mentioned name note num orig p pubPlace publisher q ref reg rs sic soCalled speaker stage term time title unclear figures: cell header: change distributor edition extent licence linking: seg tagdocs: eg textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val
Declaration	element s { att.global.attributes, att.segLike.attributes, att.typed.attributes, macro.phraseSeq }
Schematron	<s:report test="tei:s">You may not nest one s element within another: use seg instead</s:report>
Example	<head> <s>A short affair</s> </head> <s>When are you leaving?</s> <s>Tomorrow.</s>
Note	The s element may be used to mark orthographic sentences, or any other segmentation of a text, provided that the segmentation is end-to-end, complete, and non-nesting. For segmentation which is partial or recursive, the seg should be used instead. The type attribute may be used to indicate the type of segmentation intended, according to any convenient typology.

<salute>

<salute> (salutation) contains a salutation or greeting prefixed to a foreword, dedicatory epistle, or other division of a text, or the salutation in the closing of a letter, preface, etc. [4.2.2. ]
Module	textstructure
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.divWrapper
Contained by	core: lg list figures: figure table textstructure: body closer div front group opener
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val
Declaration	element salute { att.global.attributes, macro.phraseSeq }
Example	<salute>To all courteous mindes, that will voutchsafe the readinge.</salute>

<samplingDecl>

<samplingDecl> (sampling declaration) contains a prose description of the rationale and methods used in sampling texts in the creation of a corpus or collection. [2.3.2. 2.3. 15.3.2. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.declarable (@default)
Member of	model.encodingDescPart
Contained by	header: encodingDesc
May contain	core: p
Declaration	element samplingDecl { att.global.attributes, att.declarable.attributes, model.pLike+ }
Example	<samplingDecl> <p>Samples of up to 2000 words taken at random from the beginning, middle, or end of each text identified as relevant by respondents.</p> </samplingDecl>
Note	This element records all information about systematic inclusion or omission of portions of the text, whether a reflection of sampling procedures in the pure sense or of systematic omission of material deemed either too difficult to transcribe or not of sufficient interest.

<seg>

<seg> (arbitrary segment) represents any segmentation of text below the ‘chunk’ level. [16.3. 6.2. 7.2.5. ]
Module	linking
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.segLike (@function) (att.fragmentable (@part)) att.typed (@type, @subtype) att.responsibility (@cert, @resp)
Member of	model.choicePart model.segLike
Contained by	analysis: s w core: abbr add addrLine author bibl biblScope choice corr date del editor emph expan foreign gloss head hi item l label mentioned name note num orig p pubPlace publisher q ref reg rs sic soCalled speaker stage term time title unclear figures: cell header: change distributor edition extent licence linking: seg tagdocs: eg textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer
May contain	analysis: interp interpGrp pc s w core: abbr add address bibl choice cit corr date del desc emph expan foreign gap gloss graphic hi index label lb lg list listBibl mentioned milestone name note num orig pb ptr q ref reg rs sic soCalled stage term time title unclear figures: figure formula table header: idno linking: anchor seg tagdocs: att code eg gi ident val
Declaration	element seg { att.global.attributes, att.segLike.attributes, att.typed.attributes, att.responsibility.attributes, macro.paraContent }
Example	<seg>When are you leaving?</seg> <seg>Tomorrow.</seg>
Example	<s> <seg rend="caps" type="initial-cap">So father's only</seg> glory was the ballfield. </s>
Example	<seg type="preamble"> <seg>Sigmund, <seg type="patronym">the son of Volsung</seg>, was a king in Frankish country.</seg> <seg>Sinfiotli was the eldest of his sons ...</seg> <seg>Borghild, Sigmund's wife, had a brother ... </seg> </seg>
Note	The seg element may be used at the encoder's discretion to mark any segments of the text of interest for processing. One use of the element is to mark text features for which no appropriate markup is otherwise defined. Another use is to provide an identifier for some segment which is to be pointed at by some other element—i.e. to provide a target, or a part of a target, for a ptr or other similar element.

<seriesStmt>

<seriesStmt> (series statement) groups information about the series, if any, to which a publication belongs. [2.2.5. 2.2. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of
Contained by	header: fileDesc
May contain	core: biblScope editor p respStmt title header: idno
Declaration	element seriesStmt { att.global.attributes, ( model.pLike+ \| ( title+, ( editor \| respStmt ), ( idno \| biblScope ) ) ) }
Example	<seriesStmt> <title>Machine-Readable Texts for the Study of Indian Literature</title> <respStmt> <resp>ed. by</resp> <name>Jan Gonda</name> </respStmt> <biblScope unit="volume">1.2</biblScope> <idno type="ISSN">0 345 6789</idno> </seriesStmt>

<sic>

<sic> (Latin for thus or so) contains text reproduced although apparently incorrect or inaccurate. [3.4.1. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.responsibility (@cert, @resp)
Member of	model.choicePart model.pPart.transcriptional
Contained by	analysis: pc s w core: abbr add addrLine author bibl biblScope choice corr date del editor emph expan foreign gloss head hi item l label mentioned name note num orig p pubPlace publisher q ref reg rs sic soCalled speaker stage term time title unclear figures: cell header: change distributor edition extent licence linking: seg tagdocs: eg textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer
May contain	analysis: interp interpGrp pc s w core: abbr add address bibl choice cit corr date del desc emph expan foreign gap gloss graphic hi index label lb lg list listBibl mentioned milestone name note num orig pb ptr q ref reg rs sic soCalled stage term time title unclear figures: figure formula table header: idno linking: anchor seg tagdocs: att code eg gi ident val
Declaration	element sic { att.global.attributes, att.responsibility.attributes, macro.paraContent }
Example	for his nose was as sharp as a pen, and <sic>a Table</sic> of green fields.
Example	If all that is desired is to call attention to the apparent problem in the copy text, sic may be used alone: I don't know, Juan. It's so far in the past now — how <sic>we can</sic> prove or disprove anyone's theories?
Example	It is also possible, using the choice and corr elements, to provide a corrected reading: I don't know, Juan. It's so far in the past now — how <choice> <sic>we can</sic> <corr>can we</corr> </choice> prove or disprove anyone's theories?
Example	for his nose was as sharp as a pen, and <choice> <sic>a Table</sic> <corr>a' babbld</corr> </choice> of green fields.

<signed>

<signed> (signature) contains the closing salutation, etc., appended to a foreword, dedicatory epistle, or other division of a text. [4.2.2. ]
Module	textstructure
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.divBottomPart
Contained by	core: lg list figures: figure table textstructure: back body closer div front group opener postscript
May contain	analysis: interp interpGrp pc s w core: abbr add address bibl choice cit corr date del desc emph expan foreign gap gloss graphic hi index label lb lg list listBibl mentioned milestone name note num orig pb ptr q ref reg rs sic soCalled stage term time title unclear figures: figure formula table header: idno linking: anchor seg tagdocs: att code eg gi ident val
Declaration	element signed { att.global.attributes, macro.paraContent }
Example	<signed>Thine to command <name>Humph. Moseley</name> </signed>
Example	<closer> <signed>Sign'd and Seal'd, <list> <item>John Bull,</item> <item>Nic. Frog.</item> </list> </signed> </closer>

<soCalled>

<soCalled> contains a word or phrase for which the author or narrator indicates a disclaiming of responsibility, for example by the use of scare quotes or italics. [3.3.3. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.emphLike
Contained by	analysis: s core: abbr add addrLine author bibl biblScope corr date del desc editor emph expan foreign gloss head hi item l label mentioned name note num orig p pubPlace publisher q ref reg resp rs sic soCalled speaker stage term time title unclear figures: cell figDesc header: authority catDesc change classCode creation distributor edition extent funder language licence principal sponsor linking: seg tagdocs: eg textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val
Declaration	element soCalled { att.global.attributes, macro.phraseSeq }
Example	To edge his way along the crowded paths of life, warning all human sympathy to keep its distance, was what the knowing ones call <soCalled>nuts</soCalled> to Scrooge.

<sourceDesc>

<sourceDesc> (source description) describes the source from which an electronic text was derived or generated, typically a bibliographic description in the case of a digitized text, or a phrase such as "born digital" for a text which has no previous existence. [2.2.7. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.declarable (@default)
Member of
Contained by	header: fileDesc
May contain	core: bibl list listBibl p figures: table
Declaration	element sourceDesc { att.global.attributes, att.declarable.attributes, ( model.pLike+ \| ( model.biblLike \| model.sourceDescPart \| model.listLike )+ ) }
Example	<sourceDesc> <bibl> <title level="a">The Interesting story of the Children in the Wood</title>. In <author>Victor E Neuberg</author>, <title>The Penny Histories</title>. <publisher>OUP</publisher> <date>1968</date>. </bibl> </sourceDesc>
Example	<sourceDesc> <p>Born digital: no previous source exists.</p> </sourceDesc>

<sp>

<sp> (speech) contains an individual speech in a performance text, or a passage presented as such in a prose or verse text. [3.12.2. 3.12. 7.2.2. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.ascribed (@who)
Member of	model.divPart
Contained by	core: item note q stage figures: cell figure header: change licence textstructure: argument body div epigraph postscript
May contain	analysis: interp interpGrp core: cit gap index l lb lg list milestone note p pb q speaker stage figures: figure table linking: anchor
Declaration	element sp { att.global.attributes, att.ascribed.attributes, ( model.global, ( speaker, model.global )?, ( ( lg \| model.lLike \| model.pLike \| model.listLike \| model.stageLike \| model.qLike ), model.global* )+ ) }
Example	<sp> <speaker>The reverend Doctor Opimian</speaker> <p>I do not think I have named a single unpresentable fish.</p> </sp> <sp> <speaker>Mr Gryll</speaker> <p>Bream, Doctor: there is not much to be said for bream.</p> </sp> <sp> <speaker>The Reverend Doctor Opimian</speaker> <p>On the contrary, sir, I think there is much to be said for him. In the first place....</p> <p>Fish, Miss Gryll — I could discourse to you on fish by the hour: but for the present I will forbear...</p> </sp>
Note	The who attribute on this element may be used either in addition to the speaker element or as an alternative.
Note	Lines or paragraphs, stage directions, and phrase-level elements. The who attribute on this element may be used either in addition to the speaker element or as an alternative.

<speaker>

<speaker> contains a specialized form of heading or label, giving the name of one or more speakers in a dramatic text or fragment. [3.12.2. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of
Contained by	core: sp
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val
Declaration	element speaker { att.global.attributes, macro.phraseSeq }
Example	<sp who="#ni #rsa"> <speaker>Nancy and Robert</speaker> <stage type="delivery">(speaking simultaneously)</stage> <p>The future? ...</p> </sp> <list type="speakers"> <item xml:id="ni"/> <item xml:id="rsa"/> </list>

<sponsor> specifies the name of a sponsoring organization or institution. [2.2.1. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.respLike
Contained by	core: bibl header: editionStmt titleStmt
May contain	analysis: interp interpGrp core: abbr address choice date emph expan foreign gap gloss hi index lb mentioned milestone name note num pb ptr ref rs soCalled term time title figures: figure header: idno linking: anchor tagdocs: att code gi ident val
Declaration	element sponsor { att.global.attributes, macro.phraseSeq.limited }
Example	<sponsor>Association for Computers and the Humanities</sponsor> <sponsor>Association for Computational Linguistics</sponsor> <sponsor>Association for Literary and Linguistic Computing</sponsor>
Note	Sponsors give their intellectual authority to a project; they are to be distinguished from funders, who provide the funding but do not necessarily take intellectual responsibility.

<stage>

<stage> (stage direction) contains any kind of stage direction within a dramatic text or fragment. [3.12.2. 3.12. 7.2.4. ]

Module core

Attributes

Attributes att.ascribed (@who) att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))

type

indicates the kind of stage direction.

Status	Recommended
Datatype	`data.enumerated`
Suggested values include:	setting describes a setting. entrance describes an entrance. exit describes an exit. business describes stage business. novelistic is a narrative, motivating stage direction. delivery describes how a character speaks. modifier gives some detail about a character. location describes a location. mixed more than one of the above

Member of

model.stageLike

Contained by

core: add corr del desc emph head hi item l lg note orig p q ref reg sic sp stage title unclear

figures: cell figDesc figure

header: change licence

linking: seg

textstructure: argument body div docEdition epigraph imprimatur postscript signed titlePart

May contain

analysis: interp interpGrp pc s w

figures: figure formula table

header: idno

linking: anchor seg

tagdocs: att code eg gi ident val

Declaration

element stage
{
   att.ascribed.attributes,
   att.global.attributes,
   attribute type
   {
      "setting"
    | "entrance"
    | "exit"
    | "business"
    | "novelistic"
    | "delivery"
    | "modifier"
    | "location"
    | "mixed"
    | xsd:Name
   }?,
   macro.specialPara
}

Example

<stage type="setting">A curtain being drawn.</stage>
<stage type="setting">Music</stage>
<stage type="entrance">Enter Husband as being thrown off his horse.</stage>
<stage type="exit">Exit pursued by a bear.</stage>
<stage type="business">He quickly takes the stone out.</stage>
<stage type="delivery">To Lussurioso.</stage>
<stage type="novelistic">Having had enough, and embarrassed for the family.</stage>
<stage type="modifier">Disguised as Ansaldo.</stage>
<stage type="location">At a window.</stage>
<stage rend="inline" type="delivery">Aside.</stage>

Note

The who attribute may be used to indicate more precisely the person or persons participating in the action described by the stage direction.

<table>

<table> contains text displayed in tabular form, in rows and columns. [14.1.1. ]

Module figures

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))

rows

indicates the number of rows in the table.

Status	Optional
Datatype	`data.count`
Values	If no number is supplied, an application must calculate the number of rows.
Note	Rows should be presented from top to bottom.

cols

(columns) indicates the number of columns in each row of the table.

Status	Optional
Datatype	`data.count`
Values	If no number is supplied, an application must calculate the number of columns.
Note	Within each row, columns should be presented left to right.

Member of

model.listLike

Contained by

core: add corr del desc emph head hi item l note orig p q ref reg sic sp stage title unclear

figures: cell figDesc figure

header: change licence sourceDesc

linking: seg

textstructure: argument back body div docEdition epigraph imprimatur postscript signed titlePart

May contain

analysis: interp interpGrp

core: gap graphic head index lb milestone note pb

figures: figure formula row

linking: anchor

textstructure: argument byline closer dateline docAuthor docDate epigraph postscript salute signed trailer

Declaration

element table
{
   att.global.attributes,
   attribute rows { data.count }?,
   attribute cols { data.count }?,
   (
      ( model.headLike | model.global )*,
      ( ( row, model.global* )+ | ( ( model.graphicLike ), model.global* )+ ),
      ( ( model.divBottom ), model.global* )*
   )
}

Example

<table rows="4" cols="4">
<head>Poor Men's Lodgings in Norfolk (Mayhew, 1843)</head>
<row role="label">
  <cell role="data"/>
  <cell role="data">Dossing Cribs or Lodging Houses</cell>
  <cell role="data">Beds</cell>
  <cell role="data">Needys or Nightly Lodgers</cell>
</row>
<row role="data">
  <cell role="label">Bury St Edmund's</cell>
  <cell role="data">5</cell>
  <cell role="data">8</cell>
  <cell role="data">128</cell>
</row>
<row role="data">
  <cell role="label">Thetford</cell>
  <cell role="data">3</cell>
  <cell role="data">6</cell>
  <cell role="data">36</cell>
</row>
<row role="data">
  <cell role="label">Attleboro'</cell>
  <cell role="data">3</cell>
  <cell role="data">5</cell>
  <cell role="data">20</cell>
</row>
<row role="data">
  <cell role="label">Wymondham</cell>
  <cell role="data">1</cell>
  <cell role="data">11</cell>
  <cell role="data">22</cell>
</row>
</table>

Note

Contains an optional heading and a series of rows.

Any rendition information should be supplied using the global rend attribute, at the table, row, or cell level as appropriate.

<taxonomy>

<taxonomy> defines a typology either implicitly, by means of a bibliographic citation, or explicitly by a structured taxonomy. [2.3.7. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of
Contained by	header: classDecl
May contain	core: bibl desc gloss listBibl header: category
Declaration	element taxonomy { att.global.attributes, ( ( model.glossLike \| model.descLike )* \| category+ \| ( ( model.biblLike ), category* ) ) }
Example	<taxonomy xml:id="tax.b"> <bibl>Brown Corpus</bibl> <category xml:id="tax.b.a"> <catDesc>Press Reportage</catDesc> <category xml:id="tax.b.a1"> <catDesc>Daily</catDesc> </category> <category xml:id="tax.b.a2"> <catDesc>Sunday</catDesc> </category> <category xml:id="tax.b.a3"> <catDesc>National</catDesc> </category> <category xml:id="tax.b.a4"> <catDesc>Provincial</catDesc> </category> <category xml:id="tax.b.a5"> <catDesc>Political</catDesc> </category> <category xml:id="tax.b.a6"> <catDesc>Sports</catDesc> </category> </category> <category xml:id="tax.b.d"> <catDesc>Religion</catDesc> <category xml:id="tax.b.d1"> <catDesc>Books</catDesc> </category> <category xml:id="tax.b.d2"> <catDesc>Periodicals and tracts</catDesc> </category> </category> </taxonomy>

<TEI>

<TEI> (TEI document) contains a single TEI-conformant document, comprising a TEI header and a text, either in isolation or as part of a teiCorpus element. [4. 15.1. ]
Module	textstructure
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of
Contained by	core: teiCorpus
May contain	header: teiHeader textstructure: text
Declaration	element TEI { att.global.attributes, ( teiHeader, ( ( model.resourceLike+, text? ) \| text ) ) }
Schematron	<s:ns prefix="tei" uri="http://www.tei-c.org/ns/1.0"/>
Schematron	<s:ns prefix="rng" uri="http://relaxng.org/ns/structure/1.0"/>
Example	<TEI version="5.0" xmlns="http://www.tei-c.org/ns/1.0"> <teiHeader> <fileDesc> <titleStmt> <title>The shortest TEI Document Imaginable</title> </titleStmt> <publicationStmt> <p>First published as part of TEI P2, this is the P5 version using a name space.</p> </publicationStmt> <sourceDesc> <p>No source: this is an original work.</p> </sourceDesc> </fileDesc> </teiHeader> <text> <body> <p>This is about the shortest TEI document imaginable.</p> </body> </text> </TEI>
Note	This element is required.

<teiCorpus>

<teiCorpus> contains the whole of a TEI encoded corpus, comprising a single corpus header and one or more TEI elements, each containing a single text header and a text. [4. 15.1. ]

Module core

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))

version

The version of the TEI scheme

Status	Optional
Datatype	`data.version`
Default	5.0
Values	a TEI version number

Member of

Contained by

core: teiCorpus

May contain

core: teiCorpus

header: teiHeader

textstructure: TEI

Declaration

element teiCorpus
{
   att.global.attributes,
   attribute version { data.version }?,
   (
      teiHeader,
      ( ( model.resourceLike+, ( TEI | teiCorpus )* ) | ( TEI | teiCorpus )+ )
   )
}

Example

Note

Must contain one TEI header for the corpus, and a series of TEI elements, one for each text.

This element is mandatory when applicable.

<teiHeader>

<teiHeader> (TEI header) supplies the descriptive and declarative information making up an electronic title page prefixed to every TEI-conformant text. [2.1.1. 15.1. ]

Module header

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))

type

specifies the kind of document to which the header is attached, for example whether it is a corpus or individual text.

Status	Optional
Datatype	`data.enumerated`
Sample values include:	text the header is attached to a single text. [Default] corpus the header is attached to a corpus.

Member of

Contained by

core: teiCorpus

textstructure: TEI

May contain

header: encodingDesc fileDesc profileDesc revisionDesc

Declaration

element teiHeader
{
   att.global.attributes,
   attribute type { data.enumerated }?,
   ( fileDesc, model.teiHeaderPart*, revisionDesc? )
}

Example

<teiHeader>
<fileDesc>
  <titleStmt>
   <title>Shakespeare: the first folio (1623) in electronic form</title>
   <author>Shakespeare, William (1564–1616)</author>
   <respStmt>
    <resp>Originally prepared by</resp>
    <name>Trevor Howard-Hill</name>
   </respStmt>
   <respStmt>
    <resp>Revised and edited by</resp>
    <name>Christine Avern-Carr</name>
   </respStmt>
  </titleStmt>
  <publicationStmt>
   <distributor>Oxford Text Archive</distributor>
   <address>
    <addrLine>13 Banbury Road, Oxford OX2 6NN, UK</addrLine>
   </address>
   <idno type="OTA">119</idno>
   <availability>
    <p>Freely available on a non-commercial basis.</p>
   </availability>
   <date when="1968">1968</date>
  </publicationStmt>
  <sourceDesc>
   <bibl>The first folio of Shakespeare, prepared by Charlton Hinman (The Norton Facsimile,
       1968)</bibl>
  </sourceDesc>
</fileDesc>
<encodingDesc>
  <projectDesc>
   <p>Originally prepared for use in the production of a series of old-spelling
       concordances in 1968, this text was extensively checked and revised for use during the
       editing of the new Oxford Shakespeare (Wells and Taylor, 1989).</p>
  </projectDesc>
  <editorialDecl>
   <correction>
    <p>Turned letters are silently corrected.</p>
   </correction>
   <normalization>
    <p>Original spelling and typography is retained, except that long s and ligatured
         forms are not encoded.</p>
   </normalization>
  </editorialDecl>
  <refsDecl xml:id="ASLREF">
   <cRefPattern
     matchPattern="(\S+) ([^.]+)\.(.*)"
     replacementPattern="#xpath(//div1[@n='$1']/div2/[@n='$2']//lb[@n='$3'])">
    <p>A reference is created by assembling the following, in the reverse order as that
         listed here: <list>
      <item>the <att>n</att> value of the preceding <gi>lb</gi>
      </item>
      <item>a period</item>
      <item>the <att>n</att> value of the ancestor <gi>div2</gi>
      </item>
      <item>a space</item>
      <item>the <att>n</att> value of the parent <gi>div1</gi>
      </item>
     </list>
    </p>
   </cRefPattern>
  </refsDecl>
</encodingDesc>
<revisionDesc>
  <list>
   <item>
    <date when="1989-04-12">12 Apr 89</date> Last checked by CAC</item>
   <item>
    <date when="1989-03-01">1 Mar 89</date> LB made new file</item>
  </list>
</revisionDesc>
</teiHeader>

Note

One of the few elements unconditionally required in any TEI document.

<term>

<term> contains a single-word, multi-word, or symbolic designation which is regarded as a technical term. [3.3.4. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.declaring (@decls) att.pointing (@targetLang, @target, @evaluate) att.typed (@type, @subtype) att.canonical (@key, @ref) att.sortable (@sortKey) att.cReferencing (@cRef)
Member of	model.emphLike
Contained by	analysis: s core: abbr add addrLine author bibl biblScope corr date del desc editor emph expan foreign gloss head hi index item l label mentioned name note num orig p pubPlace publisher q ref reg resp rs sic soCalled speaker stage term time title unclear figures: cell figDesc header: authority catDesc change classCode creation distributor edition extent funder keywords language licence principal sponsor linking: seg tagdocs: eg textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val
Declaration	element term { att.global.attributes, att.declaring.attributes, att.pointing.attributes, att.typed.attributes, att.canonical.attributes, att.sortable.attributes, att.cReferencing.attributes, macro.phraseSeq }
Example	A computational device that infers structure from grammatical strings of words is known as a <term>parser</term>, and much of the history of NLP over the last 20 years has been occupied with the design of parsers.
Example	We may define <term xml:id="TDPV" rend="sc">discoursal point of view</term> as <gloss target="#TDPV">the relationship, expressed through discourse structure, between the implied author or some other addresser, and the fiction.</gloss>
Note	This element is used to supply the form under which an index entry is to be made for the location of a parent index element. In formal terminological work, there is frequently discussion over whether terms must be atomic or may include multi-word lexical items, symbolic designations, or phraseological units. The term element may be used to mark any of these. No position is taken on the philosophical issue of what a term can be; the looser definition simply allows the term element to be used by practitioners of any persuasion. As with other members of the att.canonical class, instances of this element occuring in a text may be associated with a canonical definition, either by means of a URI (using the ref attribute), or by means of some system-specific code value (using the key attribute). Because the mutually exclusive target and cRef attributes overlap with the function of the ref attribute, they are deprecated and may be removed at a subsequent release.

<text>

<text> contains a single text of any kind, whether unitary or composite, for example a poem or drama, a collection of essays, a novel, a dictionary, or a corpus sample. [4. 15.1. ]
Module	textstructure
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.declaring (@decls) att.typed (@type, @subtype)
Member of
Contained by	textstructure: TEI group
May contain	analysis: interp interpGrp core: gap index lb milestone note pb figures: figure linking: anchor textstructure: back body front group
Declaration	element text { att.global.attributes, att.declaring.attributes, att.typed.attributes, ( model.global, ( front, model.global )?, ( body \| group ), model.global, ( back, model.global )? ) }
Example	<text> <front> <docTitle> <titlePart>Autumn Haze</titlePart> </docTitle> </front> <body> <l>Is it a dragonfly or a maple leaf</l> <l>That settles softly down upon the water?</l> </body> </text>
Example	The body of a text may be replaced by a group of nested texts, as in the following schematic: <text> <front> <!-- front matter for the whole group --> </front> <group> <text> <!-- first text --> </text> <text> <!-- second text --> </text> </group> </text>
Note	This element should not be used to represent a text which is inserted at an arbitrary point within the structure of another, for example as in an embedded or quoted narrative; the <floatingText> is provided for this purpose.

<textClass>

<textClass> (text classification) groups information which describes the nature or topic of a text in terms of a standard classification scheme, thesaurus, etc. [2.4.3. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.declarable (@default)
Member of	model.profileDescPart
Contained by	header: profileDesc
May contain	header: catRef classCode keywords
Declaration	element textClass { att.global.attributes, att.declarable.attributes, ( classCode \| catRef \| keywords )* }
Example	<taxonomy> <category xml:id="acprose"> <catDesc>Academic prose</catDesc> </category> <!-- other categories here --> </taxonomy> <!-- ... --> <textClass> <catRef target="#acprose"/> <classCode scheme="http://www.udcc.org">001.9</classCode> <keywords scheme="http://authorities.loc.gov"> <list> <item>End of the world</item> <item>History - philosophy</item> </list> </keywords> </textClass>

<time>

<time> contains a phrase defining a time of day in any format. [3.5.4. ]
Module	core
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.datable (@calendar, @period) (att.datable.w3c (@when)) att.editLike (@evidence, @source, @instant) (att.dimensions (@unit, @quantity, @extent, @precision, @scope) (att.ranging (@atLeast, @atMost, @min, @max, @confidence)) ) (att.responsibility (@cert, @resp)) att.typed (@type, @subtype)
Member of	model.dateLike
Contained by	analysis: s core: abbr add addrLine author bibl biblScope corr date del desc editor emph expan foreign gloss head hi item l label mentioned name note num orig p pubPlace publisher q ref reg resp rs sic soCalled speaker stage term time title unclear figures: cell figDesc header: authority catDesc change classCode creation distributor edition extent funder language licence principal sponsor linking: seg tagdocs: eg textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val
Declaration	element time { att.global.attributes, att.datable.attributes, att.editLike.attributes, att.typed.attributes, ( text \| model.gLike \| model.phrase \| model.global )* }
Example	As he sat smiling, the quarter struck — <time when="11:45:00">the quarter to twelve</time>.

<title>

<title> contains a title for any kind of work. [3.11.2.2. 2.2.1. 2.2.5. ]

Module core

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.canonical (@key, @ref) att.typed (type, @subtype)

type

classifies the title according to some convenient typology.

Derived from	att.typed
Status	Optional
Datatype	`data.enumerated`
Sample values include:	main main title sub (subordinate) subtitle, title of part alt (alternate) alternate title, often in another language, by which the work is also known short abbreviated form of title desc (descriptive) descriptive paraphrase of the work functioning as a title
Note	This attribute is provided for convenience in analysing titles and processing them according to their type; where such specialized processing is not necessary, there is no need for such analysis, and the entire title, including subtitles and any parallel titles, may be enclosed within a single title element.

level

indicates the bibliographic level for a title, that is, whether it identifies an article, book, journal, series, or unpublished material.

Status	Optional
Datatype	`data.enumerated`
Legal values are:	a (analytic) the title applies to an analytic item, such as an article, poem, or other work published as part of a larger item. m (monographic) the title applies to a monograph such as a book or other item considered to be a distinct publication, including single volumes of multi-volume works j (journal) the title applies to any serial or periodical publication such as a journal, magazine, or newspaper s (series) the title applies to a series of otherwise distinct publications such as a collection u (unpublished) the title applies to any unpublished material (including theses and dissertations unless published by a commercial press)
Note	The level of a title is sometimes implied by its context: for example, a title appearing directly within an <analytic> element is ipso facto of level ‘a’, and one appearing within a <series> element of level ‘s’. For this reason, the level attribute is not required in contexts where its value can be unambiguously inferred. Where it is supplied in such contexts, its value should not contradict the value implied by its parent element.

Member of

model.emphLike

Contained by

analysis: s

figures: cell figDesc

header: authority catDesc change classCode creation distributor edition extent funder language licence principal seriesStmt sponsor titleStmt

linking: seg

tagdocs: eg

textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer

May contain

analysis: interp interpGrp pc s w

core: abbr add address bibl choice cit corr date del desc emph expan foreign gap gloss graphic hi index label lb lg list listBibl mentioned milestone name note num orig pb ptr q ref reg rs sic soCalled stage term time title unclear

figures: figure formula table

header: idno

linking: anchor seg

tagdocs: att code eg gi ident val

Declaration

element title
{
   att.global.attributes,
   att.canonical.attributes,
   att.typed.attribute.subtype,
   attribute type { data.enumerated }?,
   attribute level { "a" | "m" | "j" | "s" | "u" }?,
   macro.paraContent
}

Example

<title>Information Technology and the Research Process: Proceedings of
a conference held at Cranfield Institute of Technology, UK,
18–21 July 1989</title>

Example

<title>Hardy's Tess of the D'Urbervilles: a machine readable
edition</title>

Example

<title type="full">
<title type="main">Synthèse</title>
<title type="sub">an international journal for
epistemology, methodology and history of
science</title>
</title>

Note

The attributes key and ref, inherited from the class att.canonical may be used to indicate the canonical form for the title; the former, by supplying (for example) the identifier of a record in some external library system; the latter by pointing to an XML element somewhere containing the canonical form of the title.

<titlePage>

<titlePage> (title page) contains the title page of a text, appearing within the front or back matter. [4.6. ]

Module textstructure

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))

type

classifies the title page according to any convenient typology.

Status	Optional
Datatype	`data.enumerated`
Note	This attribute allows the same element to be used for volume title pages, series title pages, etc., as well as for the‘main’ title page of a work.

Member of

model.frontPart

Contained by

textstructure: back front

May contain

analysis: interp interpGrp

core: gap graphic index lb milestone note pb

figures: figure

linking: anchor

textstructure: argument byline docAuthor docDate docEdition docImprint docTitle epigraph imprimatur titlePart

Declaration

element titlePage
{
   att.global.attributes,
   attribute type { data.enumerated }?,
   (
      model.global*,
      ( model.titlepagePart ),
      ( model.titlepagePart | model.global )*
   )
}

Example

<titlePage>
<docTitle>
  <titlePart type="main">THOMAS OF Reading.</titlePart>
  <titlePart type="alt">OR, The sixe worthy yeomen of the West.</titlePart>
</docTitle>
<docEdition>Now the fourth time corrected and enlarged</docEdition>
<byline>By T.D.</byline>
<figure>
  <head>TP</head>
  <p>Thou shalt labor till thou returne to duste</p>
  <figDesc>Printers Ornament used by TP</figDesc>
</figure>
<docImprint>Printed at <name type="place">London</name> for <name>T.P.</name>
  <date>1612.</date>
</docImprint>
</titlePage>

<titlePart>

<titlePart> contains a subsection or division of the title of a work, as indicated on a title page. [4.6. ]

Module textstructure

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))

type

specifies the role of this subdivision of the title.

Status	Optional
Datatype	`data.enumerated`
Suggested values include:	main main title of the work [Default] sub (subordinate) subtitle of the work alt (alternate) alternative title of the work short abbreviated form of title desc (descriptive) descriptive paraphrase of the work

Member of

model.pLike.front model.titlepagePart

Contained by

textstructure: back docTitle front titlePage

May contain

analysis: interp interpGrp pc s w

figures: figure formula table

header: idno

linking: anchor seg

tagdocs: att code eg gi ident val

Declaration

element titlePart
{
   att.global.attributes,
   attribute type { "main" | "sub" | "alt" | "short" | "desc" | xsd:Name }?,
   macro.paraContent
}

Example

<docTitle>
<titlePart type="main">THE FORTUNES
   AND MISFORTUNES Of the FAMOUS
   Moll Flanders, &c.
</titlePart>
<titlePart type="desc">Who was BORN in NEWGATE,
   And during a Life of continu'd Variety for
   Threescore Years, besides her Childhood, was
   Twelve Year a <hi>Whore</hi>, five times a <hi>Wife</hi> (wherof
   once to her own Brother) Twelve Year a <hi>Thief,</hi>
   Eight Year a Transported <hi>Felon</hi> in <hi>Virginia</hi>,
   at last grew <hi>Rich</hi>, liv'd <hi>Honest</hi>, and died a
<hi>Penitent</hi>.</titlePart>
</docTitle>

<titleStmt>

<titleStmt> (title statement) groups information about the title of a work and those responsible for its content. [2.2.1. 2.2. ]
Module	header
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of
Contained by	header: fileDesc
May contain	core: author editor respStmt title header: funder principal sponsor
Declaration	element titleStmt { att.global.attributes, ( title+, model.respLike* ) }
Example	<titleStmt> <title>Capgrave's Life of St. John Norbert: a machine-readable transcription</title> <respStmt> <resp>compiled by</resp> <name>P.J. Lucas</name> </respStmt> </titleStmt>

<trailer>

<trailer> contains a closing title or footer appearing at the end of a division of a text. [4.2.4. 4.2. ]
Module	textstructure
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.typed (@type, @subtype)
Member of	model.divBottomPart
Contained by	core: lg list figures: figure table textstructure: back body div front group postscript
May contain	analysis: interp interpGrp pc s w core: abbr add address choice corr date del emph expan foreign gap gloss graphic hi index lb mentioned milestone name note num orig pb ptr ref reg rs sic soCalled term time title unclear figures: figure formula header: idno linking: anchor seg tagdocs: att code gi ident val
Declaration	element trailer { att.global.attributes, att.typed.attributes, macro.phraseSeq }
Example	<trailer>Explicit pars tertia</trailer>

<unclear>

<unclear> contains a word, phrase, or passage which cannot be transcribed with certainty because it is illegible or inaudible in the source. [11.3.3.1. 3.4.3. ]

Module core

Attributes

reason

indicates why the material is hard to transcribe.

Status	Optional
Datatype	1–∞ occurrences of `data.word`separated by whitespace
Values	one or more words describing the difficulty, e.g. faded, background-noise, passing-truck, illegible, eccentric-ductus.
<div> <head>Rx</head> <p>500 mg <unclear reason="illegible">placebo</unclear> </p> </div>

hand

Where the difficulty in transcription arises from action (partial deletion, etc.) assignable to an identifiable hand, signifies the hand responsible for the action.

Status	Optional
Datatype	`data.pointer`
Values	must point to a hand identifier typically but not necessarily declared in the document header (see section 11.3.2.1. ).

agent

Where the difficulty in transcription arises from damage, categorizes the cause of the damage, if it can be identified.

Status	Optional
Datatype	`data.enumerated`
Sample values include:	rubbing damage results from rubbing of the leaf edges mildew damage results from mildew on the leaf surface smoke damage results from smoke

Member of

model.choicePart model.pPart.transcriptional

Contained by

analysis: pc s w

figures: cell

header: change distributor edition extent licence

linking: seg

tagdocs: eg

textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer

May contain

analysis: interp interpGrp pc s w

figures: figure formula table

header: idno

linking: anchor seg

tagdocs: att code eg gi ident val

Declaration

element unclear
{
   att.global.attributes,
   att.editLike.attributes,
   attribute reason { list { data.word+ } }?,
   attribute hand { data.pointer }?,
   attribute agent { data.enumerated }?,
   macro.paraContent
}

Example

<u> ...and then <unclear reason="background-noise">Nathalie</unclear> said ... </u>

Note

The same element is used for all cases of uncertainty in the transcription of element content, whether for written or spoken material. For other aspects of certainty, uncertainty, and reliability of tagging and transcription, see chapter 21. .

The <damage>, gap, del, unclear and <supplied> elements may be closely allied in use. See section 11.3.3.2. for discussion of which element is appropriate for which circumstance.

<val>

<val> (value) contains a single attribute value. [22. 22.4.5. ]
Module	tagdocs
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend) (att.global.linking (@corresp, @next, @prev)) (att.global.analytic (@ana)) (att.global.facs (@facs))
Member of	model.phrase.xml
Contained by	analysis: s core: abbr add addrLine author biblScope corr date del desc editor emph expan foreign gloss head hi item l label mentioned name note num orig p pubPlace publisher q ref reg resp rs sic soCalled speaker stage term time title unclear figures: cell figDesc header: authority catDesc change classCode creation distributor edition extent funder language licence principal sponsor linking: seg tagdocs: eg textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer
May contain	Character data only
Declaration	element val { att.global.attributes, text }
Example	<val>unknown</val>

<w>

<w> (word) represents a grammatical (not necessarily orthographic) word. [17.1. ]

Module analysis

Attributes

lemma

provides a lemma for the word, such as an uninflected dictionary entry form.

Status	Optional
Datatype	`data.text`

lemmaRef

provides a pointer to a definition of the lemma for the word, for example in an online lexicon.

Status	Optional
Datatype	`data.pointer`

Member of

model.segLike

Contained by

analysis: s w

figures: cell

header: change distributor edition extent licence

linking: seg

tagdocs: eg

textstructure: byline closer dateline docAuthor docDate docEdition docImprint imprimatur opener salute signed titlePart trailer

May contain

analysis: interp interpGrp pc w

core: abbr add choice corr del expan gap hi index lb milestone note orig pb reg sic unclear

figures: figure

linking: anchor seg

Declaration

element w
{
   att.global.attributes,
   att.segLike.attributes,
   att.typed.attributes,
   attribute lemma { data.text }?,
   attribute lemmaRef { data.pointer }?,
   (
      text
    | model.gLike
    | seg
    | w
    | m
    | c
    | pc
    | model.global
    | model.lPart
    | model.hiLike
    | model.pPart.edit
   )*
}

Example

Schema tei_lite: Model classes

model.addressLike

model.addressLike groups elements used to represent a postal or e-mail address. [1. ]
Module	tei
Used by	model.pPart.data
Members	address

model.addrPart

model.addrPart groups elements such as names or postal codes which may appear as part of a postal address. [3.5.2. ]
Module	tei
Used by	address
Members	model.nameLike [model.nameLike.agent [name] idno rs] addrLine

model.availabilityPart

model.availabilityPart groups elements such as licences and paragraphs of text which may appear as part of an availability statement [2.2.4. ]
Module	tei
Used by	availability
Members	licence

model.biblLike

model.biblLike groups elements containing a bibliographic description. [3.11. ]
Module	tei
Used by	cit listBibl model.inter relatedItem sourceDesc taxonomy
Members	bibl listBibl

model.biblPart

model.biblPart groups elements which represent components of a bibliographic description. [3.11. ]
Module	tei
Used by	bibl
Members	model.imprintPart [biblScope distributor pubPlace publisher] model.respLike [author editor funder principal respStmt sponsor] bibl edition extent relatedItem

model.choicePart

model.choicePart groups elements (other than choice itself) which can be used within a choice alternation. [3.4. ]
Module	tei
Used by	choice
Members	abbr corr expan orig reg seg sic unclear

model.common

model.common groups common chunk- and inter-level elements. [1.3. ]
Module	tei
Used by	argument body div epigraph figure postscript
Members	model.divPart [model.lLike [l] model.pLike [p] lg sp] model.inter [model.biblLike [bibl listBibl] model.egLike [eg] model.labelLike [desc label] model.listLike [list table] model.qLike [model.quoteLike [cit] q] model.stageLike [stage] ]
Note	This class defines the set of chunk- and inter-level elements; it is used in many content models, including those for textual divisions.

model.dateLike

model.dateLike groups elements containing temporal expressions. [3.5.4. 13.3.6. ]
Module	tei
Used by	model.pPart.data
Members	date time

model.descLike

model.descLike groups elements which contain a description of their function.
Module	tei
Used by	category gap graphic interp interpGrp taxonomy
Members	desc

model.divBottom

model.divBottom groups elements appearing at the end of a text division. [4.2. ]
Module	tei
Used by	body div figure front group lg list table
Members	model.divBottomPart [closer postscript signed trailer] model.divWrapper [argument byline dateline docAuthor docDate epigraph salute]

model.divBottomPart

model.divBottomPart groups elements which can occur only at the end of a text division. [4.6. ]
Module	tei
Used by	back model.divBottom postscript
Members	closer postscript signed trailer

model.divGenLike

model.divGenLike groups elements used to represent a structural division which is generated rather than explicitly present in the source.
Module	tei
Used by	body div
Members	divGen

model.divLike

model.divLike groups elements used to represent un-numbered generic structural divisions.
Module	tei
Used by	back body div front
Members	div

model.divPart

model.divPart groups paragraph-level elements appearing directly within divisions. [1.3. ]
Module	tei
Used by	macro.specialPara model.common
Members	model.lLike [l] model.pLike [p] lg sp
Note	Note that this element class does not include members of the model.inter class, which can appear either within or between paragraph-level items.

model.divTop

model.divTop groups elements appearing at the beginning of a text division. [4.2. ]
Module	tei
Used by	body div group lg list
Members	model.divTopPart [model.headLike [head] opener] model.divWrapper [argument byline dateline docAuthor docDate epigraph salute]

model.divTopPart

model.divTopPart groups elements which can occur only at the beginning of a text division. [4.6. ]
Module	tei
Used by	model.divTop postscript
Members	model.headLike [head] opener

model.divWrapper

model.divWrapper groups elements which can appear at either top or bottom of a textual division. [4.2. ]
Module	tei
Used by	model.divBottom model.divTop
Members	argument byline dateline docAuthor docDate epigraph salute

model.egLike

model.egLike groups elements containing examples or illustrations. [22.1.1. ]
Module	tei
Used by	cit model.inter
Members	eg

model.emphLike

model.emphLike groups phrase-level elements which are typographically distinct and to which a specific function can be attributed. [3.3. ]
Module	tei
Used by	model.highlighted model.limitedPhrase
Members	code emph foreign gloss ident mentioned soCalled term title

model.encodingDescPart

model.encodingDescPart groups elements which may be used inside encodingDesc and appear multiple times.
Module	tei
Used by	encodingDesc
Members	classDecl editorialDecl projectDesc refsDecl samplingDecl

model.entryPart.top

model.entryPart.top groups high level elements within a structured dictionary entry [9.2. ]
Module	tei
Used by
Members	cit
Note	Members of this class typically contain related parts of a dictionary entry which form a coherent subdivision, for example a particular sense, homonym, etc.

model.frontPart

model.frontPart groups elements which appear at the level of divisions within front or back matter. [7.1. ]
Module	tei
Used by	back front
Members	divGen titlePage

model.global

model.global groups elements which may appear at any point within a TEI text. [1.3. ]
Module	tei
Used by	address argument back bibl body byline cit closer date div docImprint docTitle epigraph figure front group head lg list macro.paraContent macro.phraseSeq macro.phraseSeq.limited macro.specialPara opener postscript sp table text time titlePage w
Members	model.global.edit [gap] model.global.meta [index interp interpGrp] model.milestoneLike [anchor lb milestone pb] model.noteLike [note] figure

model.global.edit

model.global.edit groups globally available elements which perform a specifically editorial function. [1.3. ]
Module	tei
Used by	model.global
Members	gap

model.global.meta

model.global.meta groups globally available elements which describe the status of other elements. [1.3. ]
Module	tei
Used by	model.global
Members	index interp interpGrp
Note	Elements in this class are typically used to hold groups of links or of abstract interpretations, or by provide indications of certainty etc. It may find be convenient to localize all metadata elements, for example to contain them within the same divison as the elements that they relate to; or to locate them all to a division of their own. They may however appear at any point in a TEI text.

model.glossLike

model.glossLike groups elements which provide an alternative name, explanation, or description for any markup construct.
Module	tei
Used by	category taxonomy
Members	gloss

model.graphicLike

model.graphicLike groups elements containing images, formulae, and similar objects. [3.9. ]
Module	tei
Used by	figure formula model.phrase table
Members	formula graphic

model.headLike

model.headLike groups elements used to provide a title or heading at the start of a text division.
Module	tei
Used by	argument divGen figure listBibl model.divTopPart table
Members	head

model.highlighted

model.highlighted groups phrase-level elements which are typographically distinct. [3.3. ]
Module	tei
Used by	bibl model.phrase
Members	model.emphLike [code emph foreign gloss ident mentioned soCalled term title] model.hiLike [hi]

model.hiLike

model.hiLike groups phrase-level elements which are typographically distinct but to which no specific function can be attributed. [3.3. ]
Module	tei
Used by	formula model.highlighted model.limitedPhrase model.linePart w
Members	hi

model.imprintPart

model.imprintPart groups the bibliographic elements which occur inside imprints. [3.11. ]
Module	tei
Used by	model.biblPart
Members	biblScope distributor pubPlace publisher

model.inter

model.inter groups elements which can appear either within or between paragraph-like elements. [1.3. ]
Module	tei
Used by	head macro.limitedContent macro.paraContent macro.specialPara model.common
Members	model.biblLike [bibl listBibl] model.egLike [eg] model.labelLike [desc label] model.listLike [list table] model.qLike [model.quoteLike [cit] q] model.stageLike [stage]

model.labelLike

model.labelLike groups elements used to gloss or explain other parts of a document.
Module	tei
Used by	lg model.inter
Members	desc label

model.limitedPhrase

model.limitedPhrase groups phrase-level elements excluding those elements primarily intended for transcription of existing sources. [1.3. ]
Module	tei
Used by	catDesc creation macro.limitedContent macro.phraseSeq.limited
Members	model.emphLike [code emph foreign gloss ident mentioned soCalled term title] model.hiLike [hi] model.pPart.data [model.addressLike [address] model.dateLike [date time] model.measureLike [num] model.nameLike [model.nameLike.agent [name] idno rs] ] model.pPart.editorial [abbr choice expan] model.phrase.xml [att gi val] model.ptrLike [ptr ref]

model.linePart

model.linePart groups transcriptional elements which appear within lines or zones of a source-oriented transcription within a <sourceDoc> element.
Module	tei
Used by
Members	model.hiLike [hi] model.pPart.transcriptional [add corr del orig reg sic unclear] model.segLike [pc s seg w] choice

model.listLike

model.listLike groups list-like elements. [3.7. ]
Module	tei
Used by	back model.inter sourceDesc sp
Members	list table

model.lLike

model.lLike groups elements representing metrical components such as verse lines.
Module	tei
Used by	head lg model.divPart sp
Members	l

model.measureLike

model.measureLike groups elements which denote a number, a quantity, a measurement, or similar piece of text that conveys some numerical meaning. [3.5.3. ]
Module	tei
Used by	model.pPart.data
Members	num

model.milestoneLike

model.milestoneLike groups milestone-style elements used to represent reference systems. [1.3. 3.10.3. ]
Module	tei
Used by	listBibl model.global
Members	anchor lb milestone pb

model.nameLike

model.nameLike groups elements which name or refer to a person, place, or organization.
Module	tei
Used by	model.addrPart model.pPart.data
Members	model.nameLike.agent [name] idno rs
Note	A superset of the naming elements that may appear in datelines, addresses, statements of responsibility, etc.

model.nameLike.agent

model.nameLike.agent groups elements which contain names of individuals or corporate bodies. [3.5. ]
Module	tei
Used by	model.nameLike respStmt
Members	name
Note	This class is used in the content model of elements which reference names of people or organizations.

model.noteLike

model.noteLike groups globally-available note-like elements. [3.8. ]
Module	tei
Used by	model.global notesStmt
Members	note

model.phrase

model.phrase groups elements which can occur at the level of individual words or phrases. [1.3. ]
Module	tei
Used by	byline closer date docImprint head macro.paraContent macro.phraseSeq macro.specialPara opener time
Members	model.graphicLike [formula graphic] model.highlighted [model.emphLike [code emph foreign gloss ident mentioned soCalled term title] model.hiLike [hi] ] model.lPart model.pPart.data [model.addressLike [address] model.dateLike [date time] model.measureLike [num] model.nameLike [model.nameLike.agent [name] idno rs] ] model.pPart.edit [model.pPart.editorial [abbr choice expan] model.pPart.transcriptional [add corr del orig reg sic unclear] ] model.phrase.xml [att gi val] model.ptrLike [ptr ref] model.segLike [pc s seg w]
Note	This class of elements can occur only within larger elements of the class inter or chunk. In prose, this means these elements can occur within paragraphs, list items, lines of verse, etc.

model.phrase.xml

model.phrase.xml groups phrase-level elements used to encode XML constructs such as element names, attribute names, and attribute values [22. ]
Module	tei
Used by	model.limitedPhrase model.phrase
Members	att gi val

model.pLike

model.pLike groups paragraph-like elements.
Module	tei
Used by	availability back editionStmt editorialDecl encodingDesc front model.divPart projectDesc publicationStmt refsDecl samplingDecl seriesStmt sourceDesc sp
Members	p

model.pLike.front

model.pLike.front groups paragraph-like elements which can occur as direct constituents of front matter. [4.6. ]
Module	tei
Used by	back front
Members	argument byline docAuthor docDate docEdition docImprint docTitle epigraph head titlePart

model.pPart.data

model.pPart.data groups phrase-level elements containing names, dates, numbers, measures, and similar data. [3.5. ]
Module	tei
Used by	bibl model.limitedPhrase model.phrase
Members	model.addressLike [address] model.dateLike [date time] model.measureLike [num] model.nameLike [model.nameLike.agent [name] idno rs]

model.pPart.edit

model.pPart.edit groups phrase-level elements for simple editorial correction and transcription. [3.4. ]
Module	tei
Used by	bibl model.phrase pc w
Members	model.pPart.editorial [abbr choice expan] model.pPart.transcriptional [add corr del orig reg sic unclear]

model.pPart.editorial

model.pPart.editorial groups phrase-level elements for simple editorial interventions that may be useful both in transcribing and in authoring. [3.4. ]
Module	tei
Used by	model.limitedPhrase model.pPart.edit
Members	abbr choice expan

model.pPart.transcriptional

model.pPart.transcriptional groups phrase-level elements used for editorial transcription of pre-existing source materials. [3.4. ]
Module	tei
Used by	model.linePart model.pPart.edit
Members	add corr del orig reg sic unclear

model.profileDescPart

model.profileDescPart groups elements which may be used inside profileDesc and appear multiple times.
Module	tei
Used by	profileDesc
Members	creation langUsage textClass

model.ptrLike

model.ptrLike groups elements used for purposes of location and reference. [3.6. ]
Module	tei
Used by	bibl cit model.limitedPhrase model.phrase relatedItem
Members	ptr ref

model.publicationStmtPart

model.publicationStmtPart groups elements which may appear within the publicationStmt element of the TEI header. [2.2.4. ]
Module	tei
Used by	publicationStmt
Members	address authority availability date distributor idno pubPlace publisher

model.qLike

model.qLike groups elements related to highlighting which can appear either within or between chunk-level elements. [3.3. ]
Module	tei
Used by	cit model.inter sp
Members	model.quoteLike [cit] q

model.quoteLike

model.quoteLike groups elements used to directly contain quotations.
Module	tei
Used by	model.qLike
Members	cit

model.respLike

model.respLike groups elements which are used to indicate intellectual or other significant responsibility, for example within a bibliographic element.
Module	tei
Used by	editionStmt model.biblPart titleStmt
Members	author editor funder principal respStmt sponsor

model.segLike

model.segLike groups elements used for arbitrary segmentation. [16.3. 17.1. ]
Module	tei
Used by	bibl model.linePart model.phrase
Members	pc s seg w
Note	The principles on which segmentation is carried out, and any special codes or attribute values used, should be defined explicitly in the <segmentation> element of the encodingDesc within the associated TEI header.

model.stageLike

model.stageLike groups elements containing stage directions or similar things defined by the module for performance texts. [7.3. ]
Module	tei
Used by	lg model.inter sp
Members	stage
Note	Stage directions are members of class inter: that is, they can appear between or within component-level elements.

model.teiHeaderPart

model.teiHeaderPart groups high level elements which may appear more than once in a TEI header.
Module	tei
Used by	teiHeader
Members	encodingDesc profileDesc

model.titlepagePart

model.titlepagePart groups elements which can occur as direct constituents of a title page, such as docTitle, docAuthor, docImprint, or epigraph. [4.6. ]
Module	tei
Used by	titlePage
Members	argument byline docAuthor docDate docEdition docImprint docTitle epigraph graphic imprimatur titlePart

Schema tei_lite: Attribute classes

att.ascribed

att.ascribed provides attributes for elements representing speech or action that can be ascribed to a specific individual. [1. ]

Module tei

Members change q sp stage

Attributes

who

indicates the person, or group of people, to whom the element content is ascribed.

Status	Optional
Datatype	1–∞ occurrences of `data.pointer`separated by whitespace
Values	For transcribed speech, this will typically identify a participant or participant group; in other contexts, it will point to any identified <person> element.
In the following example from Hamlet, speeches (sp) in the body of the play are linked to <castItem> elements in the <castList> using the who attribute. <castItem type="role"> <role xml:id="Barnardo">Bernardo</role> </castItem> <castItem type="role"> <role xml:id="Francisco">Francisco</role> <roleDesc>a soldier</roleDesc> </castItem> <!-- ... --> <sp who="#Barnardo"> <speaker>Bernardo</speaker> <l n="1">Who's there?</l> </sp> <sp who="#Francisco"> <speaker>Francisco</speaker> <l n="2">Nay, answer me: stand, and unfold yourself.</l> </sp>

att.breaking

att.breaking provides an attribute to indicate whether or not the element concerned is considered to mark the end of an orthographic token in the same way as whitespace. [3.10.3. ]

Module tei

Members lb milestone pb

Attributes

break

indicates whether or not the element bearing this attribute should be considered to mark the end of an orthographic token in the same way as whitespace.

Status	Recommended
Datatype	`data.enumerated`
Sample values include	yes the element bearing this attribute is considered to mark the end of any adjacent orthographic token irrespective of the presence of any adjacent whitespace no the element bearing this attribute is considered not to mark the end of any adjacent orthographic token irrespective of the presence of any adjacent whitespace maybe l'encodage ne prends aucune position sur la question.
In the following lines from the Dream of the Rood, linebreaks occur in the middle of the words lāðost and reord-berendum. <ab> ...eƿesa tome iu icƿæs ȝeƿorden ƿita heardoſt . leodum la<lb break="no"/> ðost ærþan ichim lifes ƿeȝ rihtne ȝerymde reord be<lb break="no"/> rendum hƿæt me þaȝeƿeorðode ƿuldres ealdor ofer... </ab>

att.canonical

att.canonical provides attributes which can be used to associate a representation such as a name or title with canonical information about the object being named or referenced.

Module tei

Members att.naming [att.personal [name] author editor pubPlace rs] docAuthor docTitle resp term title

Attributes

key

provides an externally-defined means of identifying the entity (or entities) being named, using a coded value of some kind.

Status	Optional
Datatype	`data.text`
<author> <name key="name 427308" type="organisation">[New Zealand Parliament, Legislative Council]</name> </author>
<author> <name key="Hugo, Victor (1802-1885)" ref="http://www.idref.fr/026927608">Victor Hugo</name> </author>
Note	The value may be a unique identifier from a database, or any other externally-defined string identifying the referent. No particular syntax is proposed for the values of the key attribute, since its form will depend entirely on practice within a given project. For the same reason, this attribute is not recommended in data interchange, since there is no way of ensuring that the values used by one project are distinct from those used by another. In such a situation, a preferable approach for magic tokens which follows standard practice on the Web is to use a ref attribute whose value is a tag URI as defined in RFC 4151.

ref

(reference) provides an explicit means of locating a full definition for the entity being named by means of one or more URIs.

Status	Optional
Datatype	1–∞ occurrences of `data.pointer`separated by whitespace
Note	The value must point directly to one or more XML elements by means of one or more URIs, separated by whitespace. If more than one is supplied, the implication is that the name identifies several distinct entities.

att.cReferencing

att.cReferencing provides an attribute which may be used to supply a canonical reference as a means of identifying the target of a pointer.

Module tei

Members gloss ptr ref term

Attributes

cRef

(canonical reference) specifies the destination of the pointer by supplying a canonical reference from a scheme defined in a refsDecl element in the TEI header

Status	Optional
Datatype	`data.text`
Values	the value of cRef should be constructed so that when the algorithm for the resolution of canonical references (described in section 16.2.6. ) is applied to it the result is a valid URI reference to the intended target
Note	The refsDecl to use may be indicated with the decls attribute. Currently these Guidelines only provide for a single canonical reference to be encoded on any given ptr element.

att.datable

att.datable provides attributes for normalization of elements that contain dates, times, or datable events.

Module tei

Members change creation date licence name time

Attributes

Attributes att.datable.w3c (@when)

calendar

indicates the system or calendar to which the date represented by the content of this element belongs.

Status	Optional
Datatype	`data.pointer`
Schematron	<sch:rule context="tei:*[@calendar]"> <sch:assert test="string-length(.) gt 0">@calendar indicates the system or calendar to which the date represented by the content of this element belongs, but this <sch:name/> element has no textual content.</sch:assert></sch:rule>
He was born on <date calendar="#Gregorian">Feb. 22, 1732</date> (<date calendar="#Julian" when="1732-02-22"> Feb. 11, 1731/32, O.S.</date>).

period

supplies a pointer to some location defining a named period of time within which the datable item is understood to have occurred.

Status	Optional
Datatype	`data.pointer`

Note

This ‘superclass’ provides attributes that can be used to provide normalized values of temporal information. By default, the attributes from the att.datable.w3c class are provided. If the module for names & dates is loaded, this class also provides attributes from the att.datable.iso and att.datable.custom classes. In general, the possible values of attributes restricted to the W3C datatypes form a subset of those values available via the ISO 8601 standard. However, the greater expressiveness of the ISO datatypes may not be needed, and there exists much greater software support for the W3C datatypes.

att.datable.w3c

att.datable.w3c provides attributes for normalization of elements that contain datable events using the W3C datatypes.

Module tei

Members att.datable [change creation date licence name time]

Attributes

when

supplies the value of the date or time in a standard form, e.g. yyyy-mm-dd.

Status	Optional
Datatype	`data.temporal.w3c`
Values	A normalized form of temporal expression conforming to the W3C XML Schema Part 2: Datatypes Second Edition.
Examples of W3C date, time, and date & time formats. <p> <date when="1945-10-24">24 Oct 45</date> <date when="1996-09-24T07:25:00Z">September 24th, 1996 at 3:25 in the morning</date> <time when="1999-01-04T20:42:00-05:00">Jan 4 1999 at 8 pm</time> <time when="14:12:38">fourteen twelve and 38 seconds</time> <date when="1962-10">October of 1962</date> <date when="--06-12">June 12th</date> <date when="---01">the first of the month</date> <date when="--08">August</date> <date when="2006">MMVI</date> <date when="0056">AD 56</date> <date when="-0056">56 BC</date> </p>
This list begins in the year 1632, more precisely on Trinity Sunday, i.e. the Sunday after Pentecost, in that year the <date calendar="#Julian" when="1632-06-06">27th of May (old style)</date>.
<opener> <dateline> <placeName>Dorchester, Village,</placeName> <date when="1828-03-02">March 2d. 1828.</date> </dateline> <salute>To Mrs. Cornell,</salute> Sunday <time when="12:00:00">noon.</time> </opener>
Note	The value of the when attribute should be the normalized representation of the date, time, or combined date & time intended, in any of the standard formats specified by XML Schema Part 2: Datatypes Second Edition, using the Gregorian calendar. The most commonly-encountered format for the date part of the when attribute is `yyyy-mm-dd`, but `yyyy`, `--mm`, `---dd`, `yyyy-mm`, or `--mm-dd` may also be used. For the time part, the form `hh:mm:ss` is used. Note that this format does not currently permit use of the value 0000 to represent the year 1 BCE; instead the value -0001 should be used.

att.declarable

att.declarable provides attributes for those elements in the TEI header which may be independently selected by means of the special purpose decls attribute. [15.3. ]

Module tei

Members availability bibl editorialDecl langUsage listBibl projectDesc refsDecl samplingDecl sourceDesc textClass

Attributes

default

indicates whether or not this element is selected by default when its parent is selected.

Status	Optional
Datatype	`data.truthValue`
Legal values are:	true This element is selected if its parent is selected false This element can only be selected explicitly, unless it is the only one of its kind, in which case it is selected if its parent is selected. [Default]

Note

The rules governing the association of declarable elements with individual parts of a TEI text are fully defined in chapter 15.3. . Only one element of a particular type may have a default attribute with a value of true.

att.declaring

att.declaring provides attributes for elements which may be independently associated with a particular declarable element within the header, thus overriding the inherited default for that element. [15.3. ]

Module tei

Members back body div front gloss graphic group lg p ptr ref term text

Attributes

decls

identifies one or more declarable elements within the header, which are understood to apply to the element bearing this attribute and its content.

Status	Optional
Datatype	1–∞ occurrences of `data.pointer`separated by whitespace
Values	must identify a set of declarable elements of different types.

Note

The rules governing the association of declarable elements with individual parts of a TEI text are fully defined in chapter 15.3. .

att.dimensions

att.dimensions provides attributes for describing the size of physical objects.

Module tei

Members att.editLike [att.transcriptional [add del] corr date expan gap name reg time unclear]

Attributes

Attributes att.ranging (@atLeast, @atMost, @min, @max, @confidence)

unit

names the unit used for the measurement

Status	Optional
Datatype	`data.enumerated`
Suggested values include:	cm (centimetres) mm (millimetres) in (inches) lines lines of text chars (characters) characters of text

quantity

specifies the length in the units specified

Status	Optional
Datatype	`data.numeric`

extent

indicates the size of the object concerned using a project-specific vocabulary combining quantity and units in a single string of words.

Status	Optional
Datatype	`data.text`
Values	any measurement phrase, e.g. 25 letters, 2 × 3 inches.
<gap extent="5 words"/>
<height extent="half the page"/>

precision

characterizes the precision of the values specified by the other attributes.

Status	Optional
Datatype	`data.certainty`

scope

where the measurement summarizes more than one observation, specifies the applicability of this measurement.

Status	Optional
Datatype	`data.enumerated`
Sample values include:	all measurement applies to all instances. most measurement applies to most of the instances inspected. range measurement applies to only the specified range of instances.

att.divLike

att.divLike provides attributes common to all elements which behave in the same way as divisions. [4. ]

Module tei

Members div lg

Attributes

Attributes att.fragmentable (@part)

org

(organization) specifies how the content of the division is organized.

Status	Optional
Datatype	`data.enumerated`
Legal values are:	composite no claim is made about the sequence in which the immediate contents of this division are to be processed, or their inter-relationships. uniform the immediate contents of this element are regarded as forming a logical unit, to be processed in sequence. [Default]

sample

indicates whether this division is a sample of the original source and if so, from which part.

Status	Optional
Datatype	`data.enumerated`
Legal values are:	initial division lacks material present at end in source. medial division lacks material at start and end. final division lacks material at start. unknown position of sampled material within original unknown. complete division is not a sample. [Default]

att.edition

att.edition provides attributes identifying the source edition from which some encoded feature derives.

Module tei

Members lb milestone pb

Attributes

(edition) supplies an arbitrary identifier for the source edition in which the associated feature (for example, a page, column, or line break) occurs at this point in the text.

Status	Optional
Datatype	1–∞ occurrences of `data.word`separated by whitespace
Values	A string of characters or sigil used conventionally to identify the edition.

edRef

(edition reference) provides a pointer to the source edition in which the associated feature (for example, a page, column, or line break) occurs at this point in the text.

Status	Optional
Datatype	1–∞ occurrences of `data.pointer`separated by whitespace

Example

<l>Of Mans First Disobedience,<lb ed="1674"/> and<lb ed="1667"/> the Fruit</l>
<l>Of that Forbidden Tree, whose<lb ed="1667 1674"/> mortal tast</l>
<l>Brought Death into the World,<lb ed="1667"/> and all<lb ed="1674"/> our woe,</l>

Example

<listBibl>
<bibl xml:id="stapledon1937">
<author>Olaf Stapledon</author>,
<title>Starmaker</title>, <publisher>Methuen</publisher>, <date>1937</date>
</bibl>
<bibl xml:id="stapledon1968">
<author>Olaf Stapledon</author>,
<title>Starmaker</title>, <publisher>Dover</publisher>, <date>1968</date>
</bibl>
</listBibl>

<p>Looking into the future aeons from the supreme moment of
the cosmos, I saw the populations still with all their
strength maintaining the<pb n="411" edRef="#stapledon1968"/>essentials of their ancient culture,
still living their personal lives in zest and endless
novelty of action, … I saw myself still
preserving, though with increasing difficulty, my lucid
con-<pb n="291" edRef="#stapledon1937"/>sciousness;</p>

att.editLike

att.editLike provides attributes describing the nature of an encoded scholarly intervention or interpretation of any kind. [1. ]

Module tei

Members att.transcriptional [add del] corr date expan gap name reg time unclear

Attributes

Attributes att.dimensions (@unit, @quantity, @extent, @precision, @scope) (att.ranging (@atLeast, @atMost, @min, @max, @confidence)) att.responsibility (@cert, @resp)

evidence

indicates the nature of the evidence supporting the reliability or accuracy of the intervention or interpretation.

Status	Optional
Datatype	1–∞ occurrences of `data.enumerated`separated by whitespace
Suggested values include:	internal there is internal evidence to support the intervention. external there is external evidence to support the intervention. conjecture the intervention or interpretation has been made by the editor, cataloguer, or scholar on the basis of their expertise.

source

contains a list of one or more pointers indicating sources supporting the given intervention or interpretation.

Status	Optional
Datatype	1–∞ occurrences of `data.pointer`separated by whitespace
Values	A space-delimited series of sigla; each sigil should correspond to a witness or witness group and occur as the value of the xml:id attribute on a <witness> or <msDesc> element elsewhere in the document.

instant

indicates whether this is an instant revision or not.

Status	Optional
Datatype	`data.xTruthValue`
Default	false

Note

The members of this attribute class are typically used to represent any kind of editorial intervention in a text, for example a correction or interpretation, or to date or localize manuscripts etc.

att.fragmentable

att.fragmentable groups structural elements which may be fragmented, usually as a consequence of some overlapping hierarchy.

Module tei

Members att.divLike [div lg] att.segLike [pc s seg w] l p

Attributes

part

specifies whether or not its parent element is fragmented in some way, typically by some other overlapping structure: for example a speech which is divided between two or more verse stanzas, a paragraph which is split across a page division, a verse line which is divided between two speakers.

Status	Optional
Datatype	`data.enumerated`
Legal values are:	Y (yes) the element is fragmented in some (unspecified) respect N (no) either the element is not fragmented, or no claim is made as to its completeness. [Default] I (initial) this is the initial part of a fragmented element M (medial) this is a medial part of a fragmented element F (final) this is the final part of a fragmented element
Note	The values I, M, or F should be used only where it is clear how the element may be be reconstituted.

att.global

att.global provides attributes common to all elements in the TEI encoding scheme.

Module tei

Members TEI abbr add addrLine address anchor argument att author authority availability back bibl biblScope body byline catDesc catRef category cell change choice cit classCode classDecl closer code corr creation date dateline del desc distributor div divGen docAuthor docDate docEdition docImprint docTitle edition editionStmt editor editorialDecl eg emph encodingDesc epigraph expan extent figDesc figure fileDesc foreign formula front funder gap gi gloss graphic group head hi ident idno imprimatur index interp interpGrp item keywords l label langUsage language lb lg licence list listBibl mentioned milestone name note notesStmt num opener orig p pb pc postscript principal profileDesc projectDesc ptr pubPlace publicationStmt publisher q ref refsDecl reg relatedItem resp respStmt revisionDesc row rs s salute samplingDecl seg seriesStmt sic signed soCalled sourceDesc sp speaker sponsor stage table taxonomy teiCorpus teiHeader term text textClass time title titlePage titlePart titleStmt trailer unclear val w

Attributes

Attributes att.global.linking (@corresp, @next, @prev) att.global.analytic (@ana) att.global.facs (@facs)

xml:id

(identifier) provides a unique identifier for the element bearing the attribute.

Status	Optional
Datatype	`xsd:ID`
Note	The xml:id attribute may be used to specify a canonical reference for an element; see section 3.10. .

(number) gives a number (or other label) for an element, which is not necessarily unique within the document.

Status	Optional
Datatype	`data.text`
Values	the value consists of a single token which may however contain punctuation characters, whitespace or word separating characters. It need not be restricted to numbers.
Note	The n attribute may be used to specify the numbering of chapters, sections, list items, etc.; it may also be used in the specification of a standard reference system for the text.

xml:lang

(language) indicates the language of the element content using a ‘tag’ generated according to BCP 47.

Status	Optional
Datatype	`data.language`
Values	The value must conform to BCP 47. If the value is a private use code (i.e., starts with x- or contains -x-), a language element with a matching value for its ident attribute should be supplied in the TEI header to document this value. Such documentation may also optionally be supplied for non-private-use codes, though these must remain consistent with their (IETF)Internet Engineering Task Force definitions.
<p> … The consequences of this rapid depopulation were the loss of the last <foreign xml:lang="rap">ariki</foreign> or chief (Routledge 1920:205,210) and their connections to ancestral territorial organization.</p>
Note	the xml:lang value will be inherited from the immediately enclosing element, or from its parent, and so on up the document hierarchy. It is generally good practice to specify xml:lang at the highest appropriate level, noticing that a different default may be needed for the teiHeader from that needed for the associated resource element or elements, and that a single TEI document may contain texts in many languages. The authoritative list of registered language subtags is maintained by IANA and is available at http://www.iana.org/assignments/language-subtag-registry. For a good general overview of the construction of language tags, see http://www.w3.org/International/articles/language-tags/, and for a practical step-by-step guide, see http://www.w3.org/International/questions/qa-choosing-language-tags.

rend

(rendition) indicates how the element in question was rendered or presented in the source text.

Status	Optional
Datatype	1–∞ occurrences of `data.word`separated by whitespace
Values	may contain any number of tokens, each of which may contain letters, punctuation marks, or symbols, but not whitespace or word-separating characters.
<head rend="align(center) case(allcaps)"> <lb/>To The <lb/>Duchesse <lb/>of <lb/>Newcastle, <lb/>On Her <lb/> <hi rend="case(mixed)">New Blazing-World</hi>. </head>
Note	These Guidelines make no binding recommendations for the values of the rend attribute; the characteristics of visual presentation vary too much from text to text and the decision to record or ignore individual characteristics varies too much from project to project. Some potentially useful conventions are noted from time to time at appropriate points in the Guidelines. The values of the rend attribute are a set of sequence-indeterminate individual tokens separated by whitespace.

att.global.analytic

att.global.analytic provides additional global attributes for associating specific analyses or interpretations with appropriate portions of a text. [17.3. ]

Module analysis

Members att.global [TEI abbr add addrLine address anchor argument att author authority availability back bibl biblScope body byline catDesc catRef category cell change choice cit classCode classDecl closer code corr creation date dateline del desc distributor div divGen docAuthor docDate docEdition docImprint docTitle edition editionStmt editor editorialDecl eg emph encodingDesc epigraph expan extent figDesc figure fileDesc foreign formula front funder gap gi gloss graphic group head hi ident idno imprimatur index interp interpGrp item keywords l label langUsage language lb lg licence list listBibl mentioned milestone name note notesStmt num opener orig p pb pc postscript principal profileDesc projectDesc ptr pubPlace publicationStmt publisher q ref refsDecl reg relatedItem resp respStmt revisionDesc row rs s salute samplingDecl seg seriesStmt sic signed soCalled sourceDesc sp speaker sponsor stage table taxonomy teiCorpus teiHeader term text textClass time title titlePage titlePart titleStmt trailer unclear val w]

Attributes

ana

(analysis) indicates one or more elements containing interpretations of the element on which the ana attribute appears.

Status	Optional
Datatype	1–∞ occurrences of `data.pointer`separated by whitespace
Values	one or more valid identifiers of one or more interpretive elements (usually <fs> or interp), separated by white space.
Note	When multiple values are given, they may reflect either multiple divergent interpretations of an ambiguous text, or multiple mutually consistent interpretations of the same passage in different contexts.

att.global.facs

att.global.facs groups elements corresponding with all or part of an image, because they contain an alternative representation of it, typically but not necessarily a transcription of it. [11.1. ]

Module transcr

Attributes

facs

(facsimile) points to all or part of an image which corresponds with the content of the element.

Status	Optional
Datatype	1–∞ occurrences of `data.pointer`separated by whitespace

att.global.linking

att.global.linking defines a set of attributes for hypertextual linking.

Module linking

Attributes

corresp

(corresponds) points to elements that correspond to the current element in some way.

Status	Optional
Datatype	1–∞ occurrences of `data.pointer`separated by whitespace
<group> <text xml:id="t1-g1-t1" xml:lang="mi"> <body xml:id="t1-g1-t1-body1"> <div type="chapter"> <head>He Whakamaramatanga mo te Ture Hoko, Riihi hoki, i nga Whenua Maori, 1876.</head> <p>…</p> </div> </body> </text> <text xml:id="t1-g1-t2" xml:lang="en"> <body xml:id="t1-g1-t2-body1" corresp="#t1-g1-t1-body1"> <div type="chapter"> <head>An Act to regulate the Sale, Letting, and Disposal of Native Lands, 1876.</head> <p>…</p> </div> </body> </text> </group> In this example a group contains two texts, each containing the same document in a different language. The correspondence is indicated using corresp. The language is indicated using `xml:lang`, whose value is inherited; both the tag with the corresp and the tag pointed to by the corresp inherit the value from their immediate parent.

points to the next element of a virtual aggregate of which the current element is part.

Status	Optional
Datatype	`data.pointer`

(previous) points to the previous element of a virtual aggregate of which the current element is part.

Status	Optional
Datatype	`data.pointer`

att.interpLike

att.interpLike provides attributes for elements which represent a formal analysis or interpretation. [17.2. ]

Module tei

Members interp interpGrp

Attributes

Attributes att.responsibility (@cert, @resp)

type

indicates what kind of phenomenon is being noted in the passage.

Status	Recommended
Datatype	`data.enumerated`
Sample values include:	image identifies an image in the passage. character identifies a character associated with the passage. theme identifies a theme in the passage. allusion identifies an allusion to another text.

inst

(instances) points to instances of the analysis or interpretation represented by the current element.

Status	Optional
Datatype	1–∞ occurrences of `data.pointer`separated by whitespace
Note	The current element should be an analytic one. The element pointed at should be a textual one.

att.media

att.media provides attributes for specifying display and related properties of external media.

Module tei

Members graphic

Attributes

width

Where the media are displayed, indicates the display width

Status	Optional
Datatype	`data.outputMeasurement`

height

Where the media are displayed, indicates the display height

Status	Optional
Datatype	`data.outputMeasurement`

scale

Where the media are displayed, indicates a scale factor to be applied when generating the desired display size

Status	Optional
Datatype	`data.numeric`

att.milestoneUnit

att.milestoneUnit provides an attribute to indicate the type of section which is changing at a specific milestone. [3.10.3. 2.3.6.3. 2.3.6. ]

Module core

Members milestone

Attributes

unit

provides a conventional name for the kind of section changing at this milestone.

Status	Required
Datatype	`data.enumerated`
Suggested values include:	page physical page breaks (synonymous with the pb element). column column breaks. line line breaks (synonymous with the lb element). book any units termed book, liber, etc. poem individual poems in a collection. canto cantos or other major sections of a poem. speaker changes of speaker or narrator. stanza stanzas within a poem, book, or canto. act acts within a play. scene scenes within a play or act. section sections of any kind. absent passages not present in the reference edition. unnumbered passages present in the text, but not to be included as part of the reference.
<milestone n="23" ed="La" unit="Dreissiger"/> ... <milestone n="24" ed="AV" unit="verse"/> ...
Note	If the milestone marks the beginning of a piece of text not present in the reference edition, the special value absent may be used as the value of unit. The normal interpretation is that the reference edition does not contain the text which follows, until the next milestone tag for the edition in question is encountered. In addition to the values suggested, other terms may be appropriate (e.g. Stephanus for the Stephanus numbers in Plato).

att.naming

att.naming provides attributes common to elements which refer to named persons, places, organizations etc. [3.5.1. 13.3.5. ]

Module tei

Members att.personal [name] author editor pubPlace rs

Attributes

Attributes att.canonical (@key, @ref)

role

may be used to specify further information about the entity referenced by this name, for example the occupation of a person, or the status of a place.

Status	Optional
Datatype	`data.enumerated`

nymRef

(reference to the canonical name) provides a means of locating the canonical form (nym) of the names associated with the object named by the element bearing it.

Status	Optional
Datatype	1–∞ occurrences of `data.pointer`separated by whitespace
Note	The value must point directly to one or more XML elements by means of one or more URIs, separated by whitespace. If more than one is supplied, the implication is that the name is associated with several distinct canonical names.

att.personal

att.personal (attributes for components of names usually, but not necessarily, personal names) common attributes for those elements which form part of a name usually, but not necessarily, a personal name. [13.2.1. ]

Module tei

Members name

Attributes

Attributes att.naming (@role, @nymRef) (att.canonical (@key, @ref))

full

indicates whether the name component is given in full, as an abbreviation or simply as an initial.

Status	Optional
Datatype	`data.enumerated`
Legal values are:	yes the name component is spelled out in full. [Default] abb (abbreviated) the name component is given in an abbreviated form. init (initial letter) the name component is indicated only by one initial.

sort

specifies the sort order of the name component in relation to others within the name.

Status	Optional
Datatype	`data.count`
Values	A positive number indicating the sort order.

att.placement

att.placement provides attributes for describing where on the source page or object a textual element appears. [3.4.3. 11.3.1.4. ]

Module tei

Members add figure label note

Attributes

place

specifies where this item is placed

Status	Recommended
Datatype	1–∞ occurrences of `data.enumerated`separated by whitespace
Suggested values include:	below below the line bottom at the foot of the page margin in the margin (left, right, or both) top at the top of the page opposite on the opposite, i.e. facing, page overleaf on the other side of the leaf above above the line end at the end of e.g. chapter or volume. inline within the body of the text. inspace in a predefined space, for example left by an earlier scribe.
<add place="margin">[An addition written in the margin]</add> <add place="bottom opposite">[An addition written at the foot of the current page and also on the facing page]</add>
<note place="bottom">Ibid, p.7</note>

att.pointing

att.pointing defines a set of attributes used by all elements which point to other elements by means of one or more URI references. [1.3.1.1.2. 3.6. ]

Module tei

Members catRef gloss licence note ptr ref term

Attributes

targetLang

specifies the language of the content to be found at the destination referenced by target, using a ‘language tag’ generated according to BCP 47.

Status	Optional
Datatype	`data.language`
Schematron	<sch:rule context="tei:*[not(self::tei:schemaSpec)][@targetLang]"> <sch:assert test="count(@target)">@targetLang can only be used if @target is specified.</sch:assert></sch:rule>
Values	The value must conform to BCP 47. If the value is a private use code (i.e., starts with x- or contains -x-), a language element with a matching value for its ident attribute should be supplied in the TEI header to document this value. Such documentation may also optionally be supplied for non-private-use codes, though these must remain consistent with their (IETF)Internet Engineering Task Force definitions.
<linkGrp xml:id="pol-swh_aln_2.1-linkGrp"> <ptr xml:id="pol-swh_aln_2.1.1-ptr" target="pol/UDHR/text.xml#pol_txt_1-head" type="tuv" targetLang="pl"/> <ptr xml:id="pol-swh_aln_2.1.2-ptr" target="swh/UDHR/text.xml#swh_txt_1-head" type="tuv" targetLang="sw"/> </linkGrp> In the example above, the <linkGrp> combines pointers at parallel fragments of the Universal Declaration of Human Rights: one of them is in Polish, the other in Swahili.

target

specifies the destination of the reference by supplying one or more URI References

Status	Optional
Datatype	1–∞ occurrences of `data.pointer`separated by whitespace
Values	One or more syntactically valid URI references, separated by whitespace. Because whitespace is used to separate URIs, no whitespace is permitted inside a single URI. If a whitespace character is required in a URI, it should be escaped with the normal mechanism, e.g. `TEI%20Consortium`.

evaluate

specifies the intended meaning when the target of a pointer is itself a pointer.

Status	Optional
Datatype	`data.enumerated`
Legal values are:	all if the element pointed to is itself a pointer, then the target of that pointer will be taken, and so on, until an element is found which is not a pointer. one if the element pointed to is itself a pointer, then its target (whether a pointer or not) is taken as the target of this pointer. none no further evaluation of targets is carried out beyond that needed to find the element specified in the pointer's target.
Note	If no value is given, the application program is responsible for deciding (possibly on the basis of user input) how far to trace a chain of pointers.

att.ranging

att.ranging provides attributes for describing numerical ranges.

Module tei

Members att.dimensions [att.editLike [att.transcriptional [add del] corr date expan gap name reg time unclear] ] num

Attributes

atLeast

gives a minimum estimated value for the approximate measurement.

Status	Optional
Datatype	`data.numeric`

atMost

gives a maximum estimated value for the approximate measurement.

Status	Optional
Datatype	`data.numeric`

min

where the measurement summarizes more than one observation or a range, supplies the minimum value observed.

Status	Optional
Datatype	`data.numeric`

max

where the measurement summarizes more than one observation or a range, supplies the maximum value observed.

Status	Optional
Datatype	`data.numeric`

confidence

specifies the degree of statistical confidence (between zero and one) that a value falls within the range specified by min and max, or the proportion of observed values that fall within that range.

Status	Optional
Datatype	`data.probability`

att.resourced

att.resourced provides attributes by which a resource (such as an externally held media file) may be located.

Module tei

Members graphic

Attributes

url

(uniform resource locator) specifies the URL from which the media concerned may be obtained.

Status	Required
Datatype	`data.pointer`

att.responsibility

att.responsibility provides attributes indicating who is responsible for something asserted by the markup and the degree of certainty associated with it. [3.4. 17.3. 13.1.1. ]

Module tei

Members att.editLike [att.transcriptional [add del] corr date expan gap name reg time unclear] att.interpLike [interp interpGrp] abbr note orig seg sic

Attributes

cert

(certainty) signifies the degree of certainty associated with the intervention or interpretation.

Status	Optional
Datatype	`data.certainty`

resp

(responsible party) indicates the agency responsible for the intervention or interpretation, for example an editor or transcriber.

Status	Optional
Datatype	1–∞ occurrences of `data.pointer`separated by whitespace
Values	A pointer to an element typically, but not necessarily, in the document header that is associated with a person asserted as responsible for some aspect of the text's creation, transcription, editing, or encoding.

Example

Blessed are the
<choice>
<sic>placemakers</sic>
<corr resp="#editor" cert="high">peacmakers</corr>
</choice>: for they shall be called the children of God.

att.segLike

att.segLike provides attributes for elements used for arbitrary segmentation. [16.3. 17.1. ]

Module tei

Members pc s seg w

Attributes

Attributes att.fragmentable (@part)

function

characterizes the function of the segment.

Status	Optional
Datatype	`data.enumerated`
Values	For a <cl>, may take values such as coordinate, subject, adverbial etc. For a <phr>, such values as subject, predicate etc. may be more appropriate.

att.sortable

att.sortable provides attributes for elements in lists or groups that are sortable, but whose sorting key cannot be derived mechanically from the element content. [1. ]

Module tei

Members bibl idno item list listBibl term

Attributes

sortKey

supplies the sort key for this element in an index, list or group which contains it.

Status	Optional
Datatype	`data.word`
Values	a sequence of characters which, when sorted with the other values, will produced the desired order; specifics of sort key construction are application-dependent.
David's other principal backer, Josiah ha-Kohen <index indexName="NAMES"> <term sortKey="Azarya_Josiah_Kohen">Josiah ha-Kohen b. Azarya</term> </index> b. Azarya, son of one of the last gaons of Sura was David's own first cousin.
Note	The sort key is used to determine the sequence and grouping of entries in an index. Dictionary order often differs from the collation sequence of machine-readable character sets; in English-language dictionaries, an entry for 4-H will often appear alphabetized under ‘fourh’, and McCoy may be alphabetized under‘maccoy’, while A1, A4, and A5 may all appear in numeric order ‘alphabetized’ between ‘a-’ and ‘AA’. The sort key is required if the orthography of the dictionary entry does not suffice to determine its location.

att.source

att.source provides attributes for pointing to the source of a bibliographic reference. [3.3.3. 8.3.4. ]

Module tei

Members q

Attributes

source

(pointer to a bibliographical source reference) provides a pointer to the bibliographical source from which a quotation or citation is drawn.

Status	Optional
Datatype	1–∞ occurrences of `data.pointer`separated by whitespace
Values	Must point to one or more bibliographic elements in the TEI header or elsewhere

Example

<p>

As Willard McCarty (<bibl xml:id="mcc_2012">2012, p.2</bibl>)
tells us, <quote source="#mcc_2012">‘Collaboration’ is a
problematic and should be a contested term.</quote>

</p>

Example

<p>

<quote source="#chicago_15_ed">Grammatical theories
are in flux, and the more we learn, the less we
seem to know.</quote>

</p>

<bibl xml:id="chicago_15_ed">
<title level="m">The Chicago Manual of Style</title>,
<edition>15th edition</edition>.
<pubPlace>Chicago</pubPlace>:
<publisher>University of Chicago Press</publisher>
(<date>2003</date>),
<biblScope unit="page">p.147</biblScope>.

</bibl>

att.spanning

att.spanning provides attributes for elements which delimit a span of text by pointing mechanisms rather than by enclosing it. [1.3.1. ]

Module tei

Members index lb milestone pb

Attributes

spanTo

indicates the end of a span initiated by the element bearing this attribute.

Status	Optional
Datatype	`data.pointer`
Schematron	The @spanTo attribute must point to an element following the current element <sch:rule context="tei:[@spanTo]"> <sch:assert test="id(substring(@spanTo,2)) and following::[@xml:id=substring(current()/@spanTo,2)]">The element indicated by @spanTo (<sch:value-of select="@spanTo"/>) must follow the current element <sch:name/> </sch:assert></sch:rule>
Values	points to an element following this one in the current document.

Note

The span is defined as running in document order from the start of the content of the pointing element to the end of the content of the element pointed to by the spanTo attribute (if any). If no value is supplied for the attribute, the assumption is that the span is coextensive with the pointing element. If no content is present, the assumption is that the starting point of the span is immediately following the element itself.

att.tableDecoration

att.tableDecoration provides attributes used to decorate rows or cells of a table. [14. ]

Module tei

Members cell row

Attributes

role

indicates the kind of information held in this cell or in each cell of this row.

Status	Optional
Datatype	`data.enumerated`
Suggested values include:	label labelling or descriptive information only. data data values. [Default]
Note	When this attribute is specified on a row, its value is the default for all cells in this row. When specified on a cell, its value overrides any default specified by the role attribute of the parent row element.

rows

indicates the number of rows occupied by this cell or row.

Status	Optional
Datatype	`data.count`
Default	1
Values	A number; a value greater than one indicates that this cell (or row) spans several rows.
Note	Where several cells span several rows, it may be more convenient to use nested tables.

cols

(columns) indicates the number of columns occupied by this cell or row.

Status	Optional
Datatype	`data.count`
Default	1
Values	A number; a value greater than one indicates that this cell or row spans several columns.
Note	Where an initial cell spans an entire row, it may be better treated as a heading.

att.transcriptional

att.transcriptional provides attributes specific to elements encoding authorial or scribal intervention in a text when transcribing manuscript or similar sources. [1. ]

Module tei

Members add del

Attributes

Attributes att.editLike (@evidence, @source, @instant) (att.dimensions (@unit, @quantity, @extent, @precision, @scope) (att.ranging (@atLeast, @atMost, @min, @max, @confidence)) ) (att.responsibility (@cert, @resp))

hand

signifies the hand of the agent which made the intervention.

Status	Optional
Datatype	`data.pointer`
Values	must refer to a <handNote> element, typically declared in the document header (see section 11.3.2.1. ).

status

indicates the effect of the intervention, for example in the case of a deletion, strikeouts which include too much or too little text, or in the case of an addition, an insertion which duplicates some of the text already present.

Status	Optional
Datatype	`data.enumerated`
Sample values include:	duplicate all of the text indicated as an addition duplicates some text that is in the original, whether the duplication is word-for-word or less exact. duplicate-partial part of the text indicated as an addition duplicates some text that is in the original excessStart some text at the beginning of the deletion is marked as deleted even though it clearly should not be deleted. excessEnd some text at the end of the deletion is marked as deleted even though it clearly should not be deleted. shortStart some text at the beginning of the deletion is not marked as deleted even though it clearly should be. shortEnd some text at the end of the deletion is not marked as deleted even though it clearly should be. partial some text in the deletion is not marked as deleted even though it clearly should be. unremarkable the deletion is not faulty. [Default]
Note	Status information on each deletion is needed rather rarely except in critical editions from authorial manuscripts; status information on additions is even less common. Marking a deletion or addition as faulty is inescapably an interpretive act; the usual test applied in practice is the linguistic acceptability of the text with and without the letters or words in question.

cause

documents the presumed cause for the intervention.

Status	Optional
Datatype	`data.enumerated`
Legal values are:	fix repeated for the purpose of fixation unclear repeated to clarify a previously illegible or badly written text or mark

seq

(sequence) assigns a sequence number related to the order in which the encoded features carrying this attribute are believed to have occurred.

Status	Mandatory when applicable
Datatype	`data.count`

att.translatable

att.translatable provides attributes used to indicate the status of a translatable portion of an ODD document.

Module tei

Members desc gloss

Attributes

versionDate

specifies the date on which the source text was extracted and sent to the translator

Status	Optional
Datatype	`data.temporal.w3c`
Note	The versionDate attribute can be used to determine whether a translation might need to be revisited, by comparing the modification date on the containing file with the versionDate value on the translation. If the file has changed, changelogs can be checked to see whether the source text has been modified since the translation was made.

att.typed

att.typed provides attributes which can be used to classify or subclassify elements in any way. [1.3.1. ]

Module tei

Members add anchor bibl change cit corr date del div figure gloss group head ident label lb lg listBibl milestone name note pb pc ptr ref reg relatedItem rs s seg term text time trailer w

Attributes

type

characterizes the element in some sense, using any convenient classification scheme or typology.

Status	Optional
Datatype	`data.enumerated`
<div type="verse"> <head>Night in Tarras</head> <lg type="stanza"> <l>At evening tramping on the hot white road</l> <l>…</l> </lg> <lg type="stanza"> <l>A wind sprang up from nowhere as the sky</l> <l>…</l> </lg> </div>
Note	The type attribute is present on a number of elements, not all of which are members of att.typed, usually because these elements restrict the possible values for the attribute in a specific way.

subtype

provides a sub-categorization of the element, if needed

Status	Optional
Datatype	`data.enumerated`
Note	The subtype attribute may be used to provide any sub-classification for the element additional to that provided by its type attribute.

Schematron

<sch:rule context="*[@subtype]">
<sch:assert test="@type">The <sch:name/> element should not be categorized in detail with @subtype
unless also categorized in general with @type</sch:assert></sch:rule>

Note

When appropriate, values from an established typology should be used. Alternatively a typology may be defined in the associated TEI header. If values are to be taken from a project-specific list, this should be defined using the <valList> element in the project-specific schema description, as described in 23.3.1.4. .

Schema tei_lite: Macros

data.certainty

data.certainty defines the range of attribute values expressing a degree of certainty.
Module	tei
Used by
Declaration	data.certainty = "high" \| "medium" \| "low" \| "unknown"
Note	Certainty may be expressed by one of the predefined symbolic values high, medium, or low. The value unknown should be used in cases where the encoder does not wish to assert an opinion about the matter. For more precise indication, data.probability may be used instead or in addition.

data.count

data.count defines the range of attribute values used for a non-negative integer value used as a count.
Module	tei
Used by	Element: table/@rows table/@cols
Declaration	data.count = xsd:nonNegativeInteger
Note	Only positive integer values (including zero) are permitted

data.duration.iso

data.duration.iso defines the range of attribute values available for representation of a duration in time using ISO 8601 standard formats
Module	tei
Used by
Declaration	data.duration.iso = token { pattern = "[0-9.,DHMPRSTWYZ/:+\-]+" }
Example	<time dur-iso="PT0,75H">three-quarters of an hour</time>
Example	<date dur-iso="P1,5D">a day and a half</date>
Example	<date dur-iso="P14D">a fortnight</date>
Example	<time dur-iso="PT0.02S">20 ms</time>
Note	A duration is expressed as a sequence of number-letter pairs, preceded by the letter P; the letter gives the unit and may be Y (year), M (month), D (day), H (hour), M (minute), or S (second), in that order. The numbers are all unsigned integers, except for the last, which may have a decimal component (using either `.` or `,` as the decimal point; the latter is preferred). If any number is 0, then that number-letter pair may be omitted. If any of the H (hour), M (minute), or S (second) number-letter pairs are present, then the separator `T` must precede the first ‘time’ number-letter pair. For complete details, see ISO 8601 Data elements and interchange formats — Information interchange — Representation of dates and times.

data.duration.w3c

data.duration.w3c defines the range of attribute values available for representation of a duration in time using W3C datatypes.
Module	tei
Used by
Declaration	data.duration.w3c = xsd:duration
Example	<time dur="PT45M">forty-five minutes</time>
Example	<date dur="P1DT12H">a day and a half</date>
Example	<date dur="P7D">a week</date>
Example	<time dur="PT0.02S">20 ms</time>
Note	A duration is expressed as a sequence of number-letter pairs, preceded by the letter P; the letter gives the unit and may be Y (year), M (month), D (day), H (hour), M (minute), or S (second), in that order. The numbers are all unsigned integers, except for the `S` number, which may have a decimal component (using `.` as the decimal point). If any number is 0, then that number-letter pair may be omitted. If any of the H (hour), M (minute), or S (second) number-letter pairs are present, then the separator `T` must precede the first ‘time’ number-letter pair. For complete details, see the W3C specification.

data.enumerated

data.enumerated defines the range of attribute values expressed as a single XML name taken from a list of documented possibilities.
Module	tei
Used by	Element: abbr/@type att/@scheme availability/@status biblScope/@unit biblScope/@type divGen/@type formula/@notation gap/@agent gi/@scheme idno/@type list/@type num/@type pc/@force pc/@unit q/@type stage/@type teiHeader/@type title/@type title/@level titlePage/@type titlePart/@type unclear/@agent
Declaration	data.enumerated = data.name
Note	Attributes using this datatype must contain a word which follows the rules defining a legal XML name (see http://www.w3.org/TR/REC-xml/#dt-name): for example they cannot include whitespace or begin with digits. Typically, the list of documented possibilities will be provided (or exemplified) by a value list in the associated attribute specification, expressed with a <valList> element.

data.language

data.language defines the range of attribute values used to identify a particular combination of human language and writing system. [6.1. ]
Module	tei
Used by	Element: language/@ident
Declaration	data.language = xsd:language \| ""
Note	The values for this attribute are language ‘tags’ as defined in BCP 47. Currently BCP 47 comprises RFC 4646 and RFC 4647; over time, other IETF documents may succeed these as the best current practice. A ‘language tag’, per BCP 47, is assembled from a sequence of components or subtags separated by the hyphen character (-, U+002D). The tag is made of the following subtags, in the following order. Every subtag except the first is optional. If present, each occurs only once, except the fourth and fifth components (variant and extension), which are repeatable. language The IANA-registered code for the language. This is almost always the same as the ISO 639 2-letter language code if there is one. The list of available registered language subtags can be found at http://www.iana.org/assignments/language-subtag-registry. It is recommended that this code be written in lower case. script The ISO 15924 code for the script. These codes consist of 4 letters, and it is recommended they be written with an initial capital, the other three letters in lower case. The canonical list of codes is maintained by the Unicode Consortium, and is available at http://unicode.org/iso15924/iso15924-codes.html. The IETF recommends this code be omitted unless it is necessary to make a distinction you need. region Either an ISO 3166 country code or a UN M.49 region code that is registered with IANA (not all such codes are registered, e.g. UN codes for economic groupings or codes for countries for which there is already an ISO 3166 2-letter code are not registered). The former consist of 2 letters, and it is recommended they be written in upper case. The list of codes can be found at http://www.iso.org/iso/en/prods-services/iso3166ma/02iso-3166-code-lists/index.html. The latter consist of 3 digits; the list of codes can be found at http://unstats.un.org/unsd/methods/m49/m49.htm. variant An IANA-registered variation. These codes are used to indicate additional, well-recognized variations that define a language or its dialects that are not covered by other available subtags. extension An extension has the format of a single letter followed by a hyphen followed by additional subtags. These exist to allow for future extension to BCP 47, but as of this writing no such extensions are in use. private use An extension that uses the initial subtag of the single letter x (i.e., starts with `x-`) has no meaning except as negotiated among the parties involved. These should be used with great care, since they interfere with the interoperability that use of RFC 4646 is intended to promote. In order for a document that makes use of these subtags to be TEI conformant, a corresponding language element must be present in the TEI header. There are two exceptions to the above format. First, there are language tags in the IANA registry that do not match the above syntax, but are present because they have been ‘grandfathered’ from previous specifications. Second, an entire language tag can consist of only a private use subtag. These tags start with `x-`, and do not need to follow any further rules established by the IETF and endorsed by these Guidelines. Like all language tags that make use of private use subtags, the language in question must be documented in a corresponding language element in the TEI header. Examples include sn Shona zh-TW Taiwanese zh-Hant-HK Chinese written in traditional script as used in Hong Kong en-SL English as spoken in Sierra Leone pl Polish es-MX Spanish as spoken in Mexico es-419 Spanish as spoken in Latin America The W3C Internationalization Activity has published a useful introduction to BCP 47, Language tags in HTML and XML.

data.name

data.name defines the range of attribute values expressed as an XML Name.
Module	tei
Used by	att data.enumerated giElement: index/@indexName
Declaration	data.name = xsd:Name
Note	Attributes using this datatype must contain a single word which follows the rules defining a legal XML name (see http://www.w3.org/TR/REC-xml/#dt-name): for example they cannot include whitespace or begin with digits.

data.namespace

data.namespace defines the range of attribute values used to indicate XML namespaces as defined by the W3C Namespaces in XML Technical Recommendation.
Module	tei
Used by
Declaration	data.namespace = xsd:anyURI
Note	The range of syntactically valid values is defined by RFC 3986 Uniform Resource Identifier (URI): Generic Syntax

data.numeric

data.numeric defines the range of attribute values used for numeric values.
Module	tei
Used by	Element: num/@value
Declaration	data.numeric = xsd:double \| token { pattern = "(\-?[\d]+/\-?[\d]+)" } \| xsd:decimal
Note	Any numeric value, represented as a decimal number, in floating point format, or as a ratio. To represent a floating point number, expressed in scientific notation, ‘E notation’, a variant of ‘exponential notation’, may be used. In this format, the value is expressed as two numbers separated by the letter E. The first number, the significand (sometimes called the mantissa) is given in decimal format, while the second is an integer. The value is obtained by multiplying the mantissa by 10 the number of times indicated by the integer. Thus the value represented in decimal notation as 1000.0 might be represented in scientific notation as 10E3. A value expressed as a ratio is represented by two integer values separated by a solidus (/) character. Thus, the value represented in decimal notation as 0.5 might be represented as a ratio by the string 1/2.

data.outputMeasurement

data.outputMeasurement defines a range of values for use in specifying the size of an object that is intended for display.
Module	tei
Used by
Declaration	data.outputMeasurement = token { pattern = "[\-+]?\d+(\.\d+)?(%\|cm\|mm\|in\|pt\|pc\|px\|em\|ex\|gd\|rem\|vw\|vh\|vm)" }
Example	<figure> <head>The TEI Logo</head> <figDesc>Stylized yellow angle brackets with the letters <mentioned>TEI</mentioned> in between and <mentioned>text encoding initiative</mentioned> underneath, all on a white background.</figDesc> <graphic height="600px" width="600px" url="http://www.tei-c.org/logos/TEI-600.jpg"/> </figure>
Note	These values map directly onto the values used by XSL-FO and CSS. For definitions of the units see those specifications; at the time of this writing the most complete list is in the CSS3 working draft.

data.pattern

data.pattern (regular expression pattern) defines attribute values which are expressed as a regular expression.
Module	tei
Used by
Declaration	data.pattern = token
Note	A regular expression, often called a pattern, is an expression that describes a set of strings. They are usually used to give a concise description of a set, without having to list all elements. For example, the set containing the three strings Handel, Händel, and Haendel can be described by the pattern `H(ä\|ae?)ndel` (or alternatively, it is said that the pattern `H(ä\|ae?)ndel` matches each of the three strings) Wikipedia

data.pointer

data.pointer defines the range of attribute values used to provide a single URI, absolute or relative, pointing to some other resource, either within the current document or elsewhere.
Module	tei
Used by	Element: catRef/@scheme change/@target classCode/@scheme gap/@hand keywords/@scheme note/@targetEnd relatedItem/@target unclear/@hand w/@lemmaRef
Declaration	data.pointer = xsd:anyURI
Note	The range of syntactically valid values is defined by RFC 3986 Uniform Resource Identifier (URI): Generic Syntax. Note that the values themselves are encoded using RFC 3987 Internationalized Resource Identifiers (IRIs) mapping to URIs. For example, `https://secure.wikimedia.org/wikipedia/en/wiki/%` is encoded as `https://secure.wikimedia.org/wikipedia/en/wiki/%25` while `http://موقع.وزارة-الاتصالات.مصر/` is encoded as `http://xn--4gbrim.xn----rmckbbajlc6dj7bxne2c.xn--wgbh1c/`

data.probability

data.probability defines the range of attribute values expressing a probability.
Module	tei
Used by
Declaration	data.probability = xsd:double { minInclusive = "0" maxInclusive = "1" }
Note	Probability is expressed as a real number between 0 and 1; 0 representing certainly false and 1 representing certainly true.

data.replacement

data.replacement defines attribute values which contain a replacement template.
Module	tei
Used by
Declaration	data.replacement = text

data.temporal.w3c

data.temporal.w3c defines the range of attribute values expressing a temporal expression such as a date, a time, or a combination of them, that conform to the W3C XML Schema Part 2: Datatypes specification.
Module	tei
Used by	Element: docDate/@when
Declaration	data.temporal.w3c = xsd:date \| xsd:gYear \| xsd:gMonth \| xsd:gDay \| xsd:gYearMonth \| xsd:gMonthDay \| xsd:time \| xsd:dateTime
Note	If it is likely that the value used is to be compared with another, then a time zone indicator should always be included, and only the dateTime representation should be used.

data.text

data.text defines the range of attribute values used to express some kind of identifying string as a single sequence of unicode characters possibly including whitespace.
Module	tei
Used by	Element: w/@lemma
Declaration	data.text = string
Note	Attributes using this datatype must contain a single ‘token’ in which whitespace and other punctuation characters are permitted.

data.truthValue

data.truthValue defines the range of attribute values used to express a truth value.
Module	tei
Used by	Element: note/@anchored pc/@pre
Declaration	data.truthValue = xsd:boolean
Note	The possible values of this datatype are 1 or true, or 0 or false.
Note	This datatype applies only for cases where uncertainty is inappropriate; if the attribute concerned may have a value other than true or false, e.g. unknown, or inapplicable, it should have the extended version of this datatype: data.xTruthValue.

data.version

data.version defines the range of attribute values which may be used to specify a TEI version number.
Module	tei
Used by	Element: teiCorpus/@version
Declaration	data.version = token { pattern = "[\d]+(\.[\d]+){0,2}" }
Note	The value of this attribute follows the pattern specified by the Unicode consortium for its version number (http://unicode.org/version). A version number contains digits and fullstop characters only. The first number supplied identifies the major version number. A second and third number, for minor and sub-minor version numbers, may also be supplied.

data.versionNumber

data.versionNumber defines the range of attribute values used for version numbers.
Module	tei
Used by
Declaration	data.versionNumber = token { pattern = "[\d]+[a-z][\d](\.[\d]+[a-z][\d]){0,3}" }

data.word

data.word defines the range of attribute values expressed as a single word or token.
Module	tei
Used by	Element: biblScope/@from biblScope/@to code/@lang gap/@reason unclear/@reason
Declaration	data.word = token { pattern = "(\p{L}\|\p{N}\|\p{P}\|\p{S})+" }
Note	Attributes using this datatype must contain a single ‘word’ which contains only letters, digits, punctuation characters, or symbols: thus it cannot include whitespace.

data.xmlName

data.xmlName defines attribute values which contain an XML name.
Module	tei
Used by
Declaration	data.xmlName = xsd:NCName
Note	The rules defining an XML name form a part of the XML Specification.

data.xTruthValue

data.xTruthValue (extended truth value) defines the range of attribute values used to express a truth value which may be unknown.
Module	tei
Used by
Declaration	data.xTruthValue = xsd:boolean \| "unknown" \| "inapplicable"
Note	In cases where where uncertainty is inappropriate, use the datatype data.TruthValue.

macro.limitedContent

macro.limitedContent (paragraph content) defines the content of prose elements that are not used for transcription of extant materials. [1.3. ]
Module	tei
Used by	desc figDesc
Declaration	macro.limitedContent = ( text \| model.limitedPhrase \| model.inter )*

macro.paraContent

macro.paraContent (paragraph content) defines the content of paragraphs and similar elements. [1.3. ]
Module	tei
Used by	add corr del docEdition emph hi imprimatur l orig p ref reg seg sic signed title titlePart unclear
Declaration	macro.paraContent = ( text \| model.gLike \| model.phrase \| model.inter \| model.global \| lg )*

macro.phraseSeq

macro.phraseSeq (phrase sequence) defines a sequence of character data and phrase-level elements. [1.4.1. ]
Module	tei
Used by	abbr addrLine author biblScope dateline distributor docAuthor docDate edition editor eg expan extent foreign gloss label mentioned name num pubPlace publisher rs s salute soCalled speaker term trailer
Declaration	macro.phraseSeq = ( text \| model.gLike \| model.phrase \| model.global )*

macro.phraseSeq.limited

macro.phraseSeq.limited (limited phrase sequence) defines a sequence of character data and those phrase-level elements that are not typically used for transcribing extant documents. [1.4.1. ]
Module	tei
Used by	authority classCode funder language principal resp sponsor
Declaration	macro.phraseSeq.limited = ( text \| model.limitedPhrase \| model.global )*

macro.specialPara

macro.specialPara ('special' paragraph content) defines the content model of elements such as notes or list items, which either contain a series of component-level elements or else have the same structure as a paragraph, containing a series of phrase-level and inter-level elements. [1.3. ]
Module	tei
Used by	cell change item licence note q stage
Declaration	macro.specialPara = ( text \| model.gLike \| model.phrase \| model.inter \| model.divPart \| model.global )*

Table of contents

4.1 Les éléments marquant la division du texte

4.2 Les têtes et fins de chapitre

4.3 La prose, les vers et le théâtre

6.1 Les changements de police de caractères, etc.

6.2 Les citations et les éléments associés

6.3 Les mots étrangers ou les expressions

8.1 Les références croisées simples

8.2 Le renvoi à d’autres documents

8.3 Les types spéciaux de liens

9.1 La correction et la normalisation

9.2 Les omissions, les suppressions et les ajouts

9.3 Les abréviations et leurs formes développées

10.1 Les dénominations et les expressions référentielles

10.2 Les dates et les heures

10.3 Les nombres

15.1 Les phrases « orthographiques »

15.2 Les mots et la ponctuation

15.3 Les éléments généraux d’interprétation

16.1 Les éléments supplémentaires pour les documents techniques

16.2 Les sections générées

16.3 La génération d’index

16.4 Les adresses

18.1 Les Parties liminaires

18.1.1 La page titre

18.1.2 Différents types de liminaires

18.2 Les Annexes

18.2.1 Les divisions structurelles des Annexes

19.1 La description du fichier

19.1.1 La mention du titre

19.1.2 La mention de l’édition

19.1.3 La mention relative à la taille

19.1.4 La mention de la publication

19.1.5 La mention des séries et des notes

19.1.6 La description de la source

19.2 La description de l’encodage

19.2.1 Les descriptions du projet et des échantillons

19.2.2 Les déclarations éditoriales

19.2.3 Les déclarations relatives au système de référence et à la classification

19.3 La description du profil

19.4 La description des modifications

Schema tei_lite: Elements

<abbr>

<add>

<address>

<addrLine>

<anchor>

<argument>

<att>

<author>

<authority>

<availability>

<back>

<bibl>

<biblScope>

<body>

<byline>

<catDesc>

<category>

<catRef>

<cell>

<change>

<choice>

<cit>

<classCode>

<classDecl>

<closer>

<code>

<corr>

<creation>

<date>

<dateline>

<del>

<desc>

<distributor>

<div>

<divGen>

<docAuthor>

<docDate>

<docEdition>