Written as the principal product of an independent study under
This electronic document is the original, but it primarily synthesizes
work done at the
and represented in their documents , respectively.
This document specifies how TEI Tite should be applied. Its organizing model is roughly the structure of a TEI document itself, and it proceeds from high-level features to low, starting with general requirements, text structure, directions on when to group texts, considerations about type of text (genre and format), continuing down to instructions on marking phrase-level features, reference systems, and so forth. In its original ODD (one document does-it-all) format, this document can generate everything necessary for working in TEI Tite: both documentation (this Tite-specific prose as well as the full technical documentation for each of its elements) and schemas in either W3C Schema, RELAX NG, or XML DTD. Software utilities, including the Roma web tool, can generate these.
Tite-encoded documents are TEI documents, and TEI Tite, with the
exception of convenience elements (
Tite can be used to encode printed prose, poetry, drama, newspapers, and anything else which can be described with the basic TEI building-blocks of divisions, paragraphs, line groups, and speeches.
In this documentation,
All printed material should be captured: all text (that is, printed characters) should be transcribed and the presence of graphical items or other non-transcribable elements should be indicated with markup.
A distinction should be maintained in the electronic transcription
between end-of-line or
Characters should be encoded in UTF-8. For characters not easily input from the keyboard, use hexadecimal numeric entities (e.g. é, the small latin e with acute accent, is represented as é).
The standard for accuracy of transcription should be at least 99.99% (1 error in 10,000 characters). The sample size for verification will be 5% of the total text.
Almost surely, difficult encoding situations will arise whose resolution may not be covered by this documentation or the TEI Guidelines. In such cases, it is important to document the markup choices that are made. To this end each encoded file should be accompanied by a document with such notes. These notes should reference features of a document that seem remarkable to encoders and how these were handled by encoders.
In TEI Tite,
The
Tite omits the
A document should be encoded as a group of texts only when each
member of the group contains its own front or back matter (most
often, a separate title page). In this case the
In cases where a document appears to contain a group of texts but the
above condition is not met, encode each unit as a (numbered)
Tite uses numbered divisions:
The
When a heading is present, encode it with the
The
A
Another potential false indication of a new structural division is an
Front and back matter should be encoded with the
Information on the verso of the title page should be included as well
(after a
Common items to encode in front and back matter -- and therefore common
front
back
Tite is equipped to support basic encoding of several types of text: in terms of genre, it supports prose, verse, and drama, and in terms of format, it supports books, newspapers, pamphlets, and other similar printed material. Tite has special elements for letters, verse, drama, and newspapers.
All verse should be encoded within at least one
Each line of verse should be encoded with the
go,
do say,
some say, "No."indent(1)
, indent(2)
,
etc.) to make clear levels of indentation.
The standard TEI elements for drama should be used:
Scenes and acts should be encoded as appropriately nested type="castlist".
Prologues and epilogues can be treated as
Tite includes the elements
Use the
If present, transcribe all quotation marks or other delimiters inside
the
Use the
Tables and lists are encoded as in the TEI Guidelines, but note the following.
If a cell in a table is a heading or a label, set the
If unsure about whether a structure is best encoded as a list or table, record it as a table only if it would not be properly understood without tabular layout.
Lists should be encoded as either sequences of
Both the reference to the note in the running text and the note itself
must be encoded. Use
When encoding the note itself with the
Transcribe the note directly after it is referenced in the document. In
the case of notes without explicit reference (pointed to with
Elements that can appear at the beginning and end of structural
divisions, such as "I have sworn upon the altar of God eternal hostility against every
form of tyranny over the mind of man."
In rare cases where the logical identity of a block-level element is
hard to discern, use the TEI element
The
There are six elements in Tite that capture specific typographical
features:
These mark the physical change, and are agnostic about a logical
motivation for it. There are two exceptions to this approach, however:
marking foreign words and titles. In the case of foreign words, use the
If there is a typographical feature not covered by the above elements,
the TEI
For passages set off by quotation marks or another delimeter, use the
If the alignment of an element seems remarkable, set the element's
To indicate level of indentation (often in verse), use numerical arguments
to indent
, as in
The
Alternately, when a passage of text is for some reason too hard to
read, use the
For cases in which it is unknown which character a given glyph
corresponds to, mark the glyph with the
Encode page breaks (top.
If marking column breaks, follow the same rules as for page breaks.
Column breaks are imagined to appear at the
If line breaks are to be captured, use the
As of May 2009, the TEI Special Interest Group (SIG) on Libraries is in the process of revising its
levels of encodingbased on depth of markup applied. A draft of this document is available at the Libraries SIG's wiki. Because the levels of encoding provide a tremendously useful common set of terms, it's helpful to situate TEI Tite according to them.
Mapped to GBP levels, TEI Tite would sit between three and four: it
requires use of all the elements from level three plus additional ones,
but requires fewer elements than level four. Relative to level three,
Simple Analysis,
Tite
Because Tite is closer to it, it's level four (Basic Content Analysis
) that provides the most useful
comparison. The folowing items represent instances where Tite is
As of this point, it seems that bringing Tite-encoded documents up to GBP
level four would simply require additional application of markup, not significant
reworking of markup, and in that way Tite seems compatible with the GBP.
Not mentioned above is one key but purposeful incompatibility: Tite's lack of
a header. A TEI header must be added, and the root
The TEI Tite is simply a synthesis of work done at the
and represented in their documents respectively. Many thanks to the institutions and individuals responsible
for sharing their experience and expertise for the benefit of the TEI community at
large.
Also, thank you to members of the TEI Special Interest Group on Libraries who provided very valuable corrections and suggestions.
nattribute (denoting new number of columns) is used to mark where a document changes columnar layout.