TEI Tite — A recommendation for off-site text encoding

Formal specification

Schema tei_tite: changed components

att.declarable

att.declarable provides attributes for those elements in the TEI Header which may be independently selected by means of the special purpose decls attribute.
Module	tei
Members	bibl listBibl
Attributes	Attributes

att.editLike

att.editLike provides attributes describing the nature of a encoded scholarly intervention or interpretation of any kind.

Module tei

Members gap unclear date time

Attributes

Attributes att.dimensions (@unit, @quantity, @extent, @precision, @scope) (att.ranging (@atLeast, @atMost, @min, @max)) att.responsibility (@cert, @resp)

source

contains a list of one or more pointers indicating sources supporting the given intervention or interpretation.

Status	Mandatory when applicable
Datatype	1–∞ occurrences of `data.pointer`separated by whitespace
Values	A space-delimited series of sigla; each sigil should correspond to a witness or witness group and occur as the value of the xml:id attribute on a <witness> or <msDesc> element elsewhere in the document.

att.global

att.global provides attributes common to all elements in the TEI encoding scheme.

Module tei

Members p foreign hi q cit desc gap unclear name email address addrLine num measureGrp date time abbr ptr ref list item label head note graphic milestone pb lb cb author editor respStmt resp title publisher biblScope pubPlace bibl listBibl relatedItem l lg sp speaker stage text body group floatingText div1 div2 div3 div4 div5 div6 div7 trailer byline dateline argument epigraph opener closer salute signed postscript titlePage docTitle titlePart docAuthor docEdition docImprint docDate front back table row cell formula figure ab seg g b i ul sub sup smcap cols ornament

Attributes

xml:id

(identifier) provides a unique identifier for the element bearing the attribute.

Status	Optional
Datatype	`xsd:ID`
Values	any valid XML identifier.
Note	The xml:id attribute may be used to specify a canonical reference for an element; see section ??.

(number) gives a number (or other label) for an element, which is not necessarily unique within the document.

Status	Optional
Datatype	1–∞ occurrences of `data.word`separated by whitespace
Values	the value may contain only letters, digits, punctuation characters, or symbols: it may not contain whitespace or word separating characters. It need not be restricted to numbers.
Note	The n attribute may be used to specify the numbering of chapters, sections, list items, etc.; it may also be used in the specification of a standard reference system for the text.

xml:lang

(language) indicates the language of the element content using a ‘tag’ generated according to BCP 47

Status	Optional
Datatype	`data.language`
Values	The value must conform to BCP 47. If the value is a private use code (i.e., starts with `x-` or contains `-x-`) it should, and if not it may, match the value of an ident attribute of a <language> element supplied in the TEI Header of the current document.
Note	If no value is specified for xml:lang, the xml:lang value for the immediately enclosing element is inherited; for this reason, a value should always be specified on the outermost element (<TEI>).

rend

(rendition) indicates how the element in question was rendered or presented in the source text.

Status	Optional
Datatype	1–∞ occurrences of `data.word`separated by whitespace
Values	may contain any number of tokens, each of which may contain letters, punctuation marks, or symbols, but not word-separating characters.
<head rend="align(center) case(allcaps)"> <lb/>To The <lb/>Duchesse <lb/>of <lb/>Newcastle, <lb/>On Her <lb/> <hi rend="case(mixed)">New Blazing-World</hi>. </head>
Note	These Guidelines make no binding recommendations for the values of the rend attribute; the characteristics of visual presentation vary too much from text to text and the decision to record or ignore individual characteristics varies too much from project to project. Some potentially useful conventions are noted from time to time at appropriate points in the Guidelines.

xml:base

provides a base URI reference with which applications can resolve relative URI references into absolute URI references.

Status	Optional
Datatype	`data.pointer`
Values	any syntactically valid URI reference.
<div type="bibl"> <head>Bibliography</head> <listBibl xml:base="http://www.lib.ucdavis.edu/BWRP/Works/"> <bibl n="1"> <author> <name>Landon, Letitia Elizabeth</name> </author> <ref target="LandLVowOf.sgm"> <title>The Vow of the Peacock</title> </ref> </bibl> <bibl n="2"> <author> <name>Compton, Margaret Clephane</name> </author> <ref target="NortMIrene.sgm"> <title>Irene, a Poem in Six Cantos</title> </ref> </bibl> <bibl n="3"> <author> <name>Taylor, Jane</name> </author> <ref target="TaylJEssay.sgm"> <title>Essays in Rhyme on Morals and Manners</title> </ref> </bibl> </listBibl> </div>

xml:space

signals an intention about how white space should be managed by applications.

Status	Optional
Legal values are:	default the processor should treat white space according to the default XML white space handling rules preserve the processor should preserve unchanged any and all white space in the source
Note	The XML specification provides further guidance on the use of this attribute.

att.typed

att.typed provides attributes which can be used to classify or subclassify elements in any way.

Module tei

Members cit name measureGrp date time ptr ref head note milestone pb lb cb bibl listBibl relatedItem lg text floatingText div1 div2 div3 div4 div5 div6 div7 figure ab seg g

Attributes

type

characterizes the element in some sense, using any convenient classification scheme or typology.

Status	Optional
Datatype	`data.enumerated`

[http://www.tei-c.org/ns/tite/1.0]

<b> (bold) for capturing typographical feature: bold glyphs.
Module	derived-module-tei_tite
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend, @xml:base, @xml:space)
Used by	model.hiLike
Contained by	core: abbr addrLine author bibl biblScope date editor email foreign head hi item l label name note num p pubPlace publisher q ref speaker stage time title unclear derived-module-tei_tite: b i smcap sub sup ul figures: cell linking: ab seg textstructure: byline closer dateline docAuthor docDate docEdition docImprint opener salute signed titlePart trailer
May contain	core: abbr address bibl cb cit date desc email foreign gap graphic hi label lb list listBibl measureGrp milestone name note num pb ptr q ref stage time title unclear derived-module-tei_tite: b cols i ornament smcap sub sup ul figures: figure formula table gaiji: g linking: seg
Declaration	element b { att.global.attributes, macro.paraContent }

<cols> [http://www.tei-c.org/ns/tite/1.0]

<cols> (columns) with the ‘n’ attribute (denoting new number of columns) is used to mark where a document changes columnar layout.

Module derived-module-tei_tite

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend, @xml:base, @xml:space)

indicates the edition or version in which the change in columnar layout is located at this point

Status	Optional
Datatype	`data.code`

Used by

model.milestoneLike

Contained by

core: abbr addrLine address author bibl biblScope cit date editor email foreign head hi item l label lg list listBibl name note num p pubPlace publisher q ref resp sp speaker stage time title unclear

derived-module-tei_tite: b i smcap sub sup ul

figures: cell figure table

linking: ab seg

May contain Empty element

Declaration

element cols
{
   att.global.attributes,
   attribute [http://www.tei-c.org/ns/tite/1.0]ed { data.code }?,
   empty
}

<gap>

<gap> (gap) indicates a point where material has been omitted in a transcription, whether for editorial reasons described in the TEI header, as part of sampling practice, or because the material is illegible, invisible, or inaudible. http://www.tei-c.org/release/doc/tei-p5-doc/en/html/CO.html#COEDADD

Module core

Attributes

Attributes att.global (@xml:id, @n, @xml:lang, @rend, @xml:base, @xml:space) att.editLike (@source) (att.dimensions (@unit, @quantity, @extent, @precision, @scope) (att.ranging (@atLeast, @atMost, @min, @max)) ) (att.responsibility (@cert, @resp))

reason

gives the reason for omission. Sample values include sampling, inaudible, irrelevant, cancelled.

Status	Optional
Datatype	1–∞ occurrences of `data.word`separated by whitespace
Values	any short indication of the reason for the omission.

Used by

model.global.edit

Contained by

core: abbr addrLine address author bibl biblScope cit date editor email foreign head hi item l label lg list name note num p pubPlace publisher q ref resp sp speaker stage time title unclear

derived-module-tei_tite: b i smcap sub sup ul

figures: cell figure table

linking: ab seg

May contain

core: desc

Declaration

element gap
{
   attribute reason { list { data.word, data.word* } }?,
   att.global.attributes,
   att.editLike.attribute.source,
   att.dimensions.attributes,
   model.glossLike*
}

Example

Example

Note

The gap, unclear, and <del> core tag elements may be closely allied in use with the <damage> and <supplied> elements, available when using the additional tagset for transcription of primary sources. See section ?? for discussion of which element is appropriate for which circumstance.

[http://www.tei-c.org/ns/tite/1.0]

<i> (italics) for capturing typographical feature: italicized glyphs.
Module	derived-module-tei_tite
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend, @xml:base, @xml:space)
Used by	model.hiLike
Contained by	core: abbr addrLine author bibl biblScope date editor email foreign head hi item l label name note num p pubPlace publisher q ref speaker stage time title unclear derived-module-tei_tite: b i smcap sub sup ul figures: cell linking: ab seg textstructure: byline closer dateline docAuthor docDate docEdition docImprint opener salute signed titlePart trailer
May contain	core: abbr address bibl cb cit date desc email foreign gap graphic hi label lb list listBibl measureGrp milestone name note num pb ptr q ref stage time title unclear derived-module-tei_tite: b cols i ornament smcap sub sup ul figures: figure formula table gaiji: g linking: seg
Declaration	element i { att.global.attributes, macro.paraContent }

<ornament> [http://www.tei-c.org/ns/tite/1.0]

<ornament> for capturing typographical feature: printer's ornament, horizontal line, strings of asterisks or periods, etc, indicating an informal division that does not call for a new <div> element. If a horizontal rule or printer's ornament, use appropriate rend attribute and leave the element empy; if the ornament can be represented with characters, include these in the element.
Module	derived-module-tei_tite
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend, @xml:base, @xml:space)
Used by	model.inter model.titlepagePart
Contained by	core: desc head hi item l note p q ref stage title unclear derived-module-tei_tite: b i smcap sub sup ul figures: cell linking: ab seg textstructure: argument body div1 div2 div3 div4 div5 div6 div7 docEdition epigraph postscript titlePage titlePart
May contain	Character data only
Declaration	element ornament { att.global.attributes, text }

<smcap> [http://www.tei-c.org/ns/tite/1.0]

<smcap> (smallcaps) for capturing typographical feature: glyphs in small capitals.
Module	derived-module-tei_tite
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend, @xml:base, @xml:space)
Used by	model.hiLike
Contained by	core: abbr addrLine author bibl biblScope date editor email foreign head hi item l label name note num p pubPlace publisher q ref speaker stage time title unclear derived-module-tei_tite: b i smcap sub sup ul figures: cell linking: ab seg textstructure: byline closer dateline docAuthor docDate docEdition docImprint opener salute signed titlePart trailer
May contain	core: abbr address bibl cb cit date desc email foreign gap graphic hi label lb list listBibl measureGrp milestone name note num pb ptr q ref stage time title unclear derived-module-tei_tite: b cols i ornament smcap sub sup ul figures: figure formula table gaiji: g linking: seg
Declaration	element smcap { att.global.attributes, macro.paraContent }

[http://www.tei-c.org/ns/tite/1.0]

<sub> (subscript) for capturing typographical feature: subscript glyphs.
Module	derived-module-tei_tite
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend, @xml:base, @xml:space)
Used by	model.hiLike
Contained by	core: abbr addrLine author bibl biblScope date editor email foreign head hi item l label name note num p pubPlace publisher q ref speaker stage time title unclear derived-module-tei_tite: b i smcap sub sup ul figures: cell linking: ab seg textstructure: byline closer dateline docAuthor docDate docEdition docImprint opener salute signed titlePart trailer
May contain	core: abbr address bibl cb cit date desc email foreign gap graphic hi label lb list listBibl measureGrp milestone name note num pb ptr q ref stage time title unclear derived-module-tei_tite: b cols i ornament smcap sub sup ul figures: figure formula table gaiji: g linking: seg
Declaration	element sub { att.global.attributes, macro.paraContent }

[http://www.tei-c.org/ns/tite/1.0]

<sup> (superscript) for capturing typographical feature: superscript glyphs.
Module	derived-module-tei_tite
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend, @xml:base, @xml:space)
Used by	model.hiLike
Contained by	core: abbr addrLine author bibl biblScope date editor email foreign head hi item l label name note num p pubPlace publisher q ref speaker stage time title unclear derived-module-tei_tite: b i smcap sub sup ul figures: cell linking: ab seg textstructure: byline closer dateline docAuthor docDate docEdition docImprint opener salute signed titlePart trailer
May contain	core: abbr address bibl cb cit date desc email foreign gap graphic hi label lb list listBibl measureGrp milestone name note num pb ptr q ref stage time title unclear derived-module-tei_tite: b cols i ornament smcap sub sup ul figures: figure formula table gaiji: g linking: seg
Declaration	element sup { att.global.attributes, macro.paraContent }

<ul> [http://www.tei-c.org/ns/tite/1.0]

<ul> (underline) for capturing typographical feature: underlined glyphs.
Module	derived-module-tei_tite
Attributes	Attributes att.global (@xml:id, @n, @xml:lang, @rend, @xml:base, @xml:space)
Used by	model.hiLike
Contained by	core: abbr addrLine author bibl biblScope date editor email foreign head hi item l label name note num p pubPlace publisher q ref speaker stage time title unclear derived-module-tei_tite: b i smcap sub sup ul figures: cell linking: ab seg textstructure: byline closer dateline docAuthor docDate docEdition docImprint opener salute signed titlePart trailer
May contain	core: abbr address bibl cb cit date desc email foreign gap graphic hi label lb list listBibl measureGrp milestone name note num pb ptr q ref stage time title unclear derived-module-tei_tite: b cols i ornament smcap sub sup ul figures: figure formula table gaiji: g linking: seg
Declaration	element ul { att.global.attributes, macro.paraContent }

<unclear>

<unclear> contains a word, phrase, or passage which cannot be transcribed with certainty because it is illegible or inaudible in the source. http://www.tei-c.org/release/doc/tei-p5-doc/en/html/PH.html#PHDA http://www.tei-c.org/release/doc/tei-p5-doc/en/html/CO.html#COEDADD

Module core

Attributes

reason

indicates why the material is hard to transcribe.

Status	Optional
Datatype	1–∞ occurrences of `data.word`separated by whitespace
Values	one or more words describing the difficulty, e.g. faded, background noise, passing truck, illegible, eccentric ductus.
<div> <head>Rx</head> <p>500 mg <unclear reason="illegible">placebo</unclear> </p> </div>

Used by

model.choicePart model.pPart.transcriptional

Contained by

core: abbr addrLine author bibl biblScope date editor email foreign head hi item l label name note num p pubPlace publisher q ref speaker stage time title unclear

derived-module-tei_tite: b i smcap sub sup ul

figures: cell

linking: ab seg

textstructure: byline closer dateline docAuthor docDate docEdition docImprint opener salute signed titlePart trailer

May contain

core: abbr address bibl cb cit date desc email foreign gap graphic hi label lb list listBibl measureGrp milestone name note num pb ptr q ref stage time title unclear

derived-module-tei_tite: b cols i ornament smcap sub sup ul

figures: figure formula table

gaiji: g

linking: seg

Declaration

element unclear
{
   attribute reason { list { data.word, data.word* } }?,
   att.global.attributes,
   att.editLike.attribute.source,
   att.dimensions.attributes,
   macro.paraContent
}

Example

and from time to time invited in like manner
his att<unclear>ention</unclear>

Here the last few letters of the word are hard to read.

Example

...and then <unclear reason="background-noise">Nathalie</unclear> said ...

Note

The same element is used for all cases of uncertainty in the transcription of element content, whether for written or spoken material. For other aspects of certainty, uncertainty, and reliability of tagging and transcription, see chapter ??.

The <damage>, gap, <del>, unclear and <supplied> elements may be closely allied in use. See section ?? for discussion of which element is appropriate for which circumstance.

Schema tei_tite: unchanged components

ab: (anonymous block) contains any arbitrary component-level unit of text, acting as an anonymous container for phrase or inter level elements analogous to, but without the semantic baggage of, a paragraph.

abbr: (abbreviation) contains an abbreviation of any sort.

addrLine: (address line) contains one line of a postal address.

address: contains a postal address, for example of a publisher, an organization, or an individual.

argument: A formal list or prose description of the topics addressed by a subdivision of a text.

att.ascribed: provides attributes for elements representing speech or action that can be ascribed to a specific individual.

att.breaking: provides an attribute to indicate whether or not the element concerned is considered to mark the end of an orthographic token in the same way as whitespace.

att.canonical: provides attributes which can be used to associate a representation such as a name or title with canonical information about the object being named or referenced.

att.datable: provides attributes for normalization of elements that contain dates, times, or datable events.

att.datable.w3c: provides attributes for normalization of elements that contain datable events using the W3C datatypes.

att.declaring: provides attributes for elements which may be independently associated with a particular declarable element within the header, thus overriding the inherited default for that element.

att.dimensions: provides attributes for describing the size of physical objects.

att.internetMedia: provides attributes for specifying the type of a computer resource using a standard taxonomy.

att.measurement: provides attributes to represent a regularized or normalized measurement.

att.naming: provides attributes common to elements which refer to named persons, places, organizations etc.

att.placement: provides attributes for describing where on the source page or object a textual element appears.

att.pointing: defines a set of attributes used by all elements which point to other elements by means of one or more URI references.

att.ranging: provides attributes for describing numerical ranges.

att.responsibility: provides attributes indicating who is responsible for something asserted by the markup and the degree of certainty associated with it.

att.sourced: provides attributes identifying the source edition from which some encoded feature derives.

att.spanning: provides attributes for elements which delimit a span of text by pointing mechanisms rather than by enclosing it.

att.tableDecoration: provides attributes used to decorate rows or cells of a table.

att.translatable: provides attributes used to indicate the status of a translatable portion of an ODD document.

author: in a bibliographic reference, contains the name(s) of the author(s), personal or corporate, of a work; for example in the same form as that provided by a recognized bibliographic name authority.

back: (back matter) contains any appendixes, etc. following the main part of a text.

bibl: (bibliographic citation) contains a loosely-structured bibliographic citation of which the sub-components may or may not be explicitly tagged.

biblScope: (scope of citation) defines the scope of a bibliographic reference, for example as a list of page numbers, or a named subdivision of a larger work.

body: (text body) contains the whole body of a single unitary text, excluding any front or back matter.

byline: contains the primary statement of responsibility given for a work on its title page or at the head or end of the work.

cb: (column break) marks the boundary between one column of a text and the next in a standard reference system.

cell: contains one cell of a table.

cit: (cited quotation) contains a quotation from some other document, together with a bibliographic reference to its source. In a dictionary it may contain an example text with at least one occurrence of the word form, used in the sense being described, or a translation of the headword, or an example.

closer: groups together salutations, datelines, and similar phrases appearing as a final group at the end of a division, especially of a letter.

data.certainty: defines the range of attribute values expressing a degree of certainty.

data.code: defines the range of attribute values expressing a coded value by means of a pointer to some other element which contains a definition for it.

data.count: defines the range of attribute values used for a non-negative integer value used as a count.

data.duration.w3c: defines the range of attribute values available for representation of a duration in time using W3C datatypes.

data.enumerated: defines the range of attribute values expressed as a single XML name taken from a list of documented possibilities.

data.key: defines the range of attribute values expressing a coded value by means of an arbitrary identifier, typically taken from a set of externally-defined possibilities.

data.language: defines the range of attribute values used to identify a particular combination of human language and writing system.

data.name: defines the range of attribute values expressed as an XML Name.

data.numeric: defines the range of attribute values used for numeric values.

data.outputMeasurement: defines a range of values for use in specifying the size of an object that is intended for display on the web.

data.pointer: defines the range of attribute values used to provide a single URI pointer to any other resource, either within the current document or elsewhere.

data.temporal.w3c: defines the range of attribute values expressing a temporal expression such as a date, a time, or a combination of them, that conform to the W3C XML Schema Part 2: Datatypes specification.

data.truthValue: defines the range of attribute values used to express a truth value.

data.word: defines the range of attribute values expressed as a single word or token.

date: contains a date in any format.

dateline: contains a brief description of the place, date, time, etc. of production of a letter, newspaper story, or other work, prefixed or suffixed to it as a kind of heading or trailer.

desc: (description) contains a brief description of the object documented by its parent element, including its intended usage, purpose, or application where this is appropriate.

div1: (level-1 text division) contains a first-level subdivision of the front, body, or back of a text.

div2: (level-2 text division) contains a second-level subdivision of the front, body, or back of a text.

div3: (level-3 text division) contains a third-level subdivision of the front, body, or back of a text.

div4: (level-4 text division) contains a fourth-level subdivision of the front, body, or back of a text.

div5: (level-5 text division) contains a fifth-level subdivision of the front, body, or back of a text.

div6: (level-6 text division) contains a sixth-level subdivision of the front, body, or back of a text.

div7: (level-7 text division) contains the smallest possible subdivision of the front, body or back of a text, larger than a paragraph.

docAuthor: (document author) contains the name of the author of the document, as given on the title page (often but not always contained in a byline).

docDate: (document date) contains the date of a document, as given (usually) on a title page.

docEdition: (document edition) contains an edition statement as presented on a title page of a document.

docImprint: (document imprint) contains the imprint statement (place and date of publication, publisher name), as given (usually) at the foot of a title page.

docTitle: (document title) contains the title of a document, including all its constituents, as given on a title page.

editor: secondary statement of responsibility for a bibliographic item, for example the name of an individual, institution or organization, (or of several such) acting as editor, compiler, translator, etc.

email: (electronic mail address) contains an e-mail address identifying a location to which e-mail messages can be delivered.

epigraph: contains a quotation, anonymous or attributed, appearing at the start of a section or chapter, or on a title page.

figure: groups elements representing or containing graphic information such as an illustration or figure.

floatingText: contains a single text of any kind, whether unitary or composite, which interrupts the text containing it at any point and after which the surrounding text resumes.

foreign: (foreign) identifies a word or phrase as belonging to some language other than that of the surrounding text.

formula: contains a mathematical or other formula.

front: (front matter) contains any prefatory matter (headers, title page, prefaces, dedications, etc.) found at the start of a document, before the main body.

g: (character or glyph) represents a non-standard character or glyph.

graphic: indicates the location of an inline graphic, illustration, or figure.

group: contains the body of a composite text, grouping together a sequence of distinct texts (or groups of such texts) which are regarded as a unit for some purpose, for example the collected works of an author, a sequence of prose essays, etc.

head: (heading) contains any type of heading, for example the title of a section, or the heading of a list, glossary, manuscript description, etc.

hi: (highlighted) marks a word or phrase as graphically distinct from the surrounding text, for reasons concerning which no claim is made.

item: contains one component of a list.

l: (verse line) contains a single, possibly incomplete, line of verse.

label: contains the label associated with an item in a list; in glossaries, marks the term being defined.

lb: (line break) marks the start of a new (typographic) line in some edition or version of a text.

lg: (line group) contains a group of verse lines functioning as a formal unit, e.g. a stanza, refrain, verse paragraph, etc.

list: (list) contains any sequence of items organized as a list.

listBibl: (citation list) contains a list of bibliographic citations of any kind.

macro.anyXML: defines a content model within which any XML elements are permitted

macro.limitedContent: (paragraph content) defines the content of prose elements that are not used for transcription of extant materials.

macro.paraContent: (paragraph content) defines the content of paragraphs and similar elements.

macro.phraseSeq: (phrase sequence) defines a sequence of character data and phrase-level elements.

macro.phraseSeq.limited: (limited phrase sequence) defines a sequence of character data and those phrase-level elements that are not typically used for transcribing extant documents.

macro.specialPara: ('special' paragraph content) defines the content model of elements such as notes or list items, which either contain a series of component-level elements or else have the same structure as a paragraph, containing a series of phrase-level and inter-level elements.

measureGrp: (measure group) contains a group of dimensional specifications which relate to the same object, for example the height and width of a manuscript page.

milestone: marks a boundary point separating any kind of section of a text, typically but not necessarily indicating a point at which some part of a standard reference system changes, where the change is not represented by a structural element.

model.addrPart: groups elements such as names or postal codes which may appear as part of a postal address.

model.addressLike: groups elements used to represent a postal or e-mail address.

model.biblLike: groups elements containing a bibliographic description.

model.biblPart: groups elements which represent components of a bibliographic description.

model.choicePart: groups elements (other than <choice> itself) which can be used within a <choice> alternation.

model.common: groups common chunk- and inter-level elements.

model.dateLike: groups elements containing temporal expressions.

model.div1Like: groups top-level structural divisions.

model.div2Like: groups second-level structural divisions.

model.div3Like: groups third-level structural divisions.

model.div4Like: groups fourth-level structural divisions.

model.div5Like: groups fifth-level structural divisions.

model.div6Like: groups sixth-level structural divisions.

model.div7Like: groups seventh-level structural divisions.

model.divBottom: groups elements appearing at the end of a text division.

model.divBottomPart: groups elements which can occur only at the end of a text division.

model.divGenLike: groups elements used to represent a structural division which is generated rather than explicitly present in the source.

model.divLike: groups elements used to represent un-numbered generic structural divisions.

model.divPart: groups paragraph-level elements appearing directly within divisions.

model.divTop: groups elements appearing at the beginning of a text division.

model.divTopPart: groups elements which can occur only at the beginning of a text division.

model.divWrapper: groups elements which can appear at either top or bottom of a textual division.

model.egLike: groups elements containing examples or illustrations.

model.emphLike: groups phrase-level elements which are typographically distinct and to which a specific function can be attributed.

model.entryPart: groups elements appearing at any level within a dictionary entry.

model.entryPart.top: groups high level elements within a structured dictionary entry

model.frontPart: groups elements which appear at the level of divisions within front or back matter.

model.gLike: groups elements used to represent individual non-Unicode characters or glyphs.

model.global: groups elements which may appear at any point within a TEI text.

model.global.edit: groups globally available elements which perform a specifically editorial function.

model.glossLike: groups elements which provide an alternative name, explanation, or description for any markup construct.

model.graphicLike: groups elements containing images, formulae, and similar objects.

model.headLike: groups elements used to provide a title or heading at the start of a text division.

model.hiLike: groups phrase-level elements which are typographically distinct but to which no specific function can be attributed.

model.highlighted: groups phrase-level elements which are typographically distinct.

model.imprintPart: groups the bibliographic elements which occur inside imprints.

model.inter: groups elements which can appear either within or between paragraph-like elements.

model.lLike: groups elements representing metrical components such as verse lines.

model.labelLike: groups elements used to gloss or explain other parts of a document.

model.limitedPhrase: groups phrase-level elements excluding those elements primarily intended for transcription of existing sources.

model.listLike: groups list-like elements.

model.measureLike: groups elements which denote a number, a quantity, a measurement, or similar piece of text that conveys some numerical meaning.

model.milestoneLike: groups milestone-style elements used to represent reference systems.

model.msItemPart: groups elements which can appear within a manuscript item description.

model.msQuoteLike: groups elements which represent passages such as titles quoted from a manuscript as a part of its description.

model.nameLike: groups elements which name or refer to a person, place, or organization.

model.nameLike.agent: groups elements which contain names of individuals or corporate bodies.

model.noteLike: groups globally-available note-like elements.

model.pLike: groups paragraph-like elements.

model.pLike.front: groups paragraph-like elements which can occur as direct constituents of front matter.

model.pPart.data: groups phrase-level elements containing names, dates, numbers, measures, and similar data.

model.pPart.edit: groups phrase-level elements for simple editorial correction and transcription.

model.pPart.editorial: groups phrase-level elements for simple editorial interventions that may be useful both in transcribing and in authoring.

model.pPart.transcriptional: groups phrase-level elements used for editorial transcription of pre-existing source materials.

model.personPart: groups elements which form part of the description of a person.

model.phrase: groups elements which can occur at the level of individual words or phrases.

model.ptrLike: groups elements used for purposes of location and reference.

model.publicationStmtPart: groups elements which may appear within the <publicationStmt> element of the TEI Header.

model.qLike: groups elements related to highlighting which can appear either within or between chunk-level elements.

model.quoteLike: groups elements used to directly contain quotations.

model.respLike: groups elements which are used to indicate intellectual or other significant responsibility, for example within a bibliographic element.

model.segLike: groups elements used for arbitrary segmentation.

model.stageLike: groups elements containing stage directions or similar things defined by the module for performance texts.

model.titlepagePart: groups elements which can occur as direct constituents of a title page, such as docTitle, docAuthor, docImprint, or epigraph.

name: (name, proper noun) contains a proper noun or noun phrase.

note: contains a note or annotation.

num: (number) contains a number, written in any form.

opener: groups together dateline, byline, salutation, and similar phrases appearing as a preliminary group at the start of a division, especially of a letter.

p: (paragraph) marks paragraphs in prose.

pb: (page break) marks the boundary between one page of a text and the next in a standard reference system.

postscript: contains a postscript, e.g. to a letter.

ptr: (pointer) defines a pointer to another location.

pubPlace: (publication place) contains the name of the place where a bibliographic item was published.

publisher: provides the name of the organization responsible for the publication or distribution of a bibliographic item.

q: (separated from the surrounding text with quotation marks) contains material which is marked as (ostensibly) being somehow different than the surrounding text, for any one of a variety of reasons including, but not limited to: direct speech or thought, technical terms or jargon, authorial distance, quotations from elsewhere, and passages that are mentioned but not used.

ref: (reference) defines a reference to another location, possibly modified by additional text or comment.

relatedItem: contains or references some other bibliographic item which is related to the present one in some specified manner, for example as a constituent or alternative version of it.

resp: (responsibility) contains a phrase describing the nature of a person's intellectual responsibility.

respStmt: (statement of responsibility) supplies a statement of responsibility for the intellectual content of a text, edition, recording, or series, where the specialized elements for authors, editors, etc. do not suffice or do not apply.

row: contains one row of a table.

salute: (salutation) contains a salutation or greeting prefixed to a foreword, dedicatory epistle, or other division of a text, or the salutation in the closing of a letter, preface, etc.

seg: (arbitrary segment) represents any segmentation of text below the ‘chunk’ level.

signed: (signature) contains the closing salutation, etc., appended to a foreword, dedicatory epistle, or other division of a text.

sp: (speech) An individual speech in a performance text, or a passage presented as such in a prose or verse text.

speaker: A specialized form of heading or label, giving the name of one or more speakers in a dramatic text or fragment.

stage: (stage direction) contains any kind of stage direction within a dramatic text or fragment.

table: contains text displayed in tabular form, in rows and columns.

text: contains a single text of any kind, whether unitary or composite, for example a poem or drama, a collection of essays, a novel, a dictionary, or a corpus sample.

time: contains a phrase defining a time of day in any format.

title: contains a title for any kind of work.

titlePage: (title page) contains the title page of a text, appearing within the front or back matter.

titlePart: contains a subsection or division of the title of a work, as indicated on a title page.

trailer: contains a closing title or footer appearing at the end of a division of a text.

Table of contents

1 Introduction

2 General Requirements

2.1 What to Capture

2.2 End-of-line Hyphens

2.3 Character Encoding

2.4 Accuracy and Verification

2.5 Documenting the Encoding Process

3 Global Text Structure

3.1 TEI Tite text structure

3.2 Groups of Texts

3.3 Structural Divisions

3.3.1 False Indicators

3.4 Front and Back Matter

4 Types of Text

4.1 Letters

4.2 Verse

4.3 Drama

4.4 Newspapers

5 Block-level Features

5.1 Block Quotations

5.2 Figures

5.3 Tables and Lists

5.4 Notes

5.5 ‘divWrapper’ Elements

5.6 Uncertain Blocks

6 Phrase-level Features

6.1 Typographical Changes

6.2 Phrase-level Quotation

6.3 Alignment and Indentation

6.4 Uncertain Segments

6.5 Unknown Glyphs

7 Reference Systems

TEI Tite and TEI Text Encoding in Libraries Guidelines

Acknowledgments

Formal specification

Schema tei_tite: changed components

att.declarable

att.editLike

att.global

att.typed

<b> [http://www.tei-c.org/ns/tite/1.0]

<cols> [http://www.tei-c.org/ns/tite/1.0]

<gap>

<i> [http://www.tei-c.org/ns/tite/1.0]

<ornament> [http://www.tei-c.org/ns/tite/1.0]

<smcap> [http://www.tei-c.org/ns/tite/1.0]

<sub> [http://www.tei-c.org/ns/tite/1.0]

<sup> [http://www.tei-c.org/ns/tite/1.0]

<ul> [http://www.tei-c.org/ns/tite/1.0]

<unclear>

Schema tei_tite: unchanged components