11 Representation of Primary Sources
Table of contents
- 11.1 Digital Facsimiles
- 11.2 Scope of Transcriptions
- 11.3 Altered, Corrected, and Erroneous Texts
- 11.4 Hands and Responsibility
- 11.5 Damage and Conjecture
- 11.6 Aspects of Layout
- 11.7 Headers, Footers, and Similar Matter
- 11.8 Other Primary Source Features not Covered in these Guidelines
- 11.9 Module for Transcription of Primary Sources
This chapter defines a module intended for use in the representation of primary sources, such as manuscripts or other written materials. Section 11.1 Digital Facsimiles provides elements for the encoding of digital facsimiles or images of such materials, while the remainder of the chapter discusses ways of encoding detailed transcriptions of such materials. It is expected that this module will also be useful in the preparation of critical editions, but the module defined here is distinct from that defined in chapter 12 Critical Apparatus, and may be used independently of it. Detailed metadata relating to primary sources of any kind may be recorded using the elements defined by the manuscript description module discussed in chapter 10 Manuscript Description, but again the present module may be used independently if such data is not required.
It should be noted that, as elsewhere in these Guidelines, this chapter places more emphasis on the problems of representing the textual components of a document than on those relating to the description of the document's physical characteristics such as the carrier medium or physical construction. These aspects, of particular importance in codicology and the bibliographic study of incunables, are touched on in the chapter on Manuscript Description (10 Manuscript Description) and also form the subject of ongoing work in the TEI Physical Bibliography workgroup.
Although this chapter discusses manuscript materials more frequently than other forms of written text, most of the recommendations presented are equally applicable mutatis mutandis in the encoding of printed matter or indeed any form of written source, including monumental inscriptions. Similarly, where in the following descriptions terms such as ‘scribe’, ‘author’, ‘editor’, ‘annotator’ or ‘corrector’ are used, these may be re-interpreted in terms more appropriate to the medium being transcribed. In printed material, for example, the ‘compositor’ plays a role analogous to the ‘scribe’, while in an authorial manuscript, the author and the scribe are the same person.
11.1 Digital FacsimilesTEI: Digital Facsimiles¶
These Guidelines are mostly concerned with the preparation of digital texts, in which a pre-existing text is transcribed or otherwise converted into character form, and marked up in XML. However, it is also very common practice to make a different form of ‘digital text’, which is instead composed of digital images of the original source, typically one per page, or other written surface. We call such a resource a digital facsimile. A digital facsimile may, in the simplest case, just consist of a collection of images, with some metadata to identify them and the source materials portrayed. It may sometimes contain a variety of images of the same source pages, for example of different resolutions, or of different kinds. Such a collection may form part of any kind of document, for example a commentary of a codicological or paeleographic nature, where there is a need to align explanatory text with image data. And it may also be complemented by a transcribed or encoded version of the original source, which may be linked to the page images. In this section we present elements designed to support these various possibilities and discuss the associated mechanisms provided by these Guidelines.
- att.global.facs groups elements corresponding with all or part of an image,
because they contain an alternative representation of it, typically
but not necessarily a transcription of it.
facs (facsimile) points to all or part of an image which corresponds with the content of the element.
<teiHeader>
<!--...-->
</teiHeader>
<text>
<pb facs="page1.png"/>
<!-- text contained on page 1 is encoded here -->
<pb facs="page2.png"/>
<!-- text contained on page 2 is encoded here -->
</text>
</TEI>
The recommended approach to encoding facsimiles is instead to use the facs attribute in conjunction with the elements facsimile, surface, and zone, which are also provided by this module. These elements make it possible to accommodate multiple images of each page, as well as to record arbitrary planar coordinates of textual elements on any kind of written surface and to link such elements with digital facsimile images of them. Typical applications include the provision of full text search in ‘digital facsimile editions’, and ways of annotating graphics, for example so as to identify individuals appearing in a group portraits and link them to data about the person represented.
- facsimile contains a representation of some written source in the form of a set of images rather than as transcribed or encoded text.
- surface defines a written surface in terms of a rectangular
coordinate space, optionally grouping one or more graphic representations of
that space, and rectangular zones of interest
within it.
start points to an element which encodes the starting position of the text corresponding to the inscribed part of the surface. - zone defines a rectangular area contained within a surface element.
- a TEI Header and a text element
- a TEI Header and a facsimile element
- a TEI Header, a facsimile element, and a text element
Like the text element, a facsimile element may also contain an optional front or back element, used in the same way as described in sections 4.5 Front Matter and 4.7 Back Matter.
<graphic url="page1.png"/>
<graphic url="page2.png"/>
<graphic url="page3.png"/>
<graphic url="page4.png"/>
</facsimile>
<graphic url="page1.png"/>
<surface>
<graphic url="page2-highRes.png"/>
<graphic url="page2-lowRes.png"/>
</surface>
<graphic url="page3.png"/>
<graphic url="page4.png"/>
</facsimile>
The surface element provides a way of indicating that the two images of page2 represent the same physical surface within the source material. A surface might be a sheet of paper or parchment, a face of a monument, a billboard, a membrane of a scroll, or indeed any two-dimensional surface, of any size.
- att.coordinated elements which can be positioned within a two dimensional
coordinate system.
ulx gives the x coordinate value for the upper left corner of a rectangular space. uly gives the y coordinate value for the upper left corner of a rectangular space. lrx gives the x coordinate value for the lower right corner of a rectangular space. lry gives the y coordinate value for the lower right corner of a rectangular space.
The same coordinate space is used for a surface and for all of its child elements.34 It may be most convenient to derive a coordinate space from a digital image of the surface in question such that each pixel in the image corresponds with a whole number of units (typically 1) in the coordinate space. In other cases it may be more convenient to use units such as millimetres; in neither case is any specific mapping to the physical dimensions of the object represented implied.
Each surface can contain one or more zone elements, each of which represents a rectangular region or bounding box defined in terms of the same coordinate space as that of its parent surface element. This provides a unit of analysis which may be used to define any rectangular region of interest, such as a detail or illustration, or some part of the surface which is to be aligned with a particular text element. The att.coordinated attributes listed above are also used to supply the coordinates of a zone.
As we have seen, a surface will usually correspond with the whole of a written surface. A zone, by contrast, defines any arbitrary rectangular area of interest using the same coordinate system. It might be bigger or smaller than its parent surface, or might overlap its boundaries. The only constraint is that it must be defined using the same coordinate system.
When an image of some kind is supplied within either a zone or a surface, the implication is that the whole of the image represents the zone or surface containing it. In the simple case therefore, we might imagine a surface defining a page, within which there is a graphic representing the whole of that page, and a number of zones defining parts of the page, each with its own graphic, each representing a part of the page. If however one of those graphics actually represents an area larger than the page (for example to include a binding or the surface of a desk on which the page rests), then it will be enclosed by a zone with coordinates larger than those of the parent surface.
Note that this mechanism does not provide any way of addressing a non-rectangular area, nor of coping with distortions introduced by perspective or parallax; if this is needed, the more powerful mechanisms provided by the Standard Vector Graphics (SVG) language should be used to define an overlay, as further discussed in 16.4.3 A Three-way Alignment.

<surface
ulx="50"
uly="20"
lrx="400"
lry="280">
<!-- ... -->
</surface>
</facsimile>
<surface
ulx="50"
uly="20"
lrx="400"
lry="280">
<zone
ulx="0"
uly="0"
lrx="500"
lry="321">
<graphic
url="http://upload.wikimedia.org/wikipedia/commons/5/50/Handschrift.karlsruhe.blb.jpg"/>
</zone>
</surface>
</facsimile>
If desired, the binaryObject element described in 3.9 Graphics and other non-textual components (or any other element from the model.graphicLike class) may be used instead of a graphic element.
<surface
ulx="50"
uly="20"
lrx="210"
lry="280">
<desc>left hand page</desc>
<zone
ulx="0"
uly="0"
lrx="500"
lry="321">
<graphic
url="http://upload.wikimedia.org/wikipedia/commons/5/50/Handschrift.karlsruhe.blb.jpg"/>
</zone>
</surface>
<surface
ulx="240"
uly="25"
lrx="400"
lry="280">
<desc>right hand page</desc>
<zone
ulx="0"
uly="0"
lrx="500"
lry="321">
<graphic
url="http://upload.wikimedia.org/wikipedia/commons/5/50/Handschrift.karlsruhe.blb.jpg"/>
</zone>
</surface>
</facsimile>
<surface
ulx="50"
uly="20"
lrx="210"
lry="280">
<desc>Left hand page</desc>
<zone
ulx="0"
uly="0"
lrx="500"
lry="321">
<graphic
url="http://upload.wikimedia.org/wikipedia/commons/5/50/Handschrift.karlsruhe.blb.jpg"/>
</zone>
<zone
ulx="90"
uly="40"
lrx="200"
lry="225">
<desc>Written part of left hand page</desc>
</zone>
</surface>
</facsimile>
<surface
ulx="0"
uly="0"
lrx="200"
lry="300">
<graphic url="Bovelles-49r.png"/>
</surface>
</facsimile>

<surface
ulx="0"
uly="0"
lrx="200"
lry="300">
<graphic url="Bovelles-49r.png"/>
<zone
ulx="25"
uly="25"
lrx="180"
lry="60">
<desc>contains the title</desc>
</zone>
<zone
ulx="28"
uly="75"
lrx="175"
lry="178"/>
<!-- contains the paragraph in italics -->
<zone
ulx="105"
uly="76"
lrx="175"
lry="160"/>
<!-- contains the figure -->
<zone
ulx="45"
uly="125"
lrx="60"
lry="130"/>
<!-- contains the word "pendans" -->
</surface>
</facsimile>
ulx="105"
uly="76"
lrx="175"
lry="160">
<graphic url="Bovelles49r-detail.png"/>
</zone>
<surface
ulx="0"
uly="0"
lrx="200"
lry="300">
<zone
xml:id="B49r"
ulx="0"
uly="0"
lrx="200"
lry="300">
<graphic url="Bovelles-49r.png"/>
</zone>
<zone
ulx="105"
uly="76"
lrx="175"
lry="160">
<graphic url="Bovelles49r-detail.png"/>
</zone>
<zone
xml:id="B49rHead"
ulx="25"
uly="25"
lrx="180"
lry="60"/>
<!-- contains the title -->
<zone
xml:id="B49rPara2"
ulx="28"
uly="75"
lrx="175"
lry="178"/>
<!-- contains the paragraph in italics -->
<zone
xml:id="B49rFig1"
ulx="105"
uly="76"
lrx="175"
lry="160"/>
<!-- contains the figure -->
<zone
xml:id="B49rW457"
ulx="45"
uly="125"
lrx="60"
lry="130"/>
<!-- contains the word "pendans" -->
</surface>
</facsimile>
<fw>De Geometrie 49</fw>
<head facs="#B49rHead">DU SON ET ACCORD DES CLOCHES ET <lb/> des alleures des chevaulx,
chariotz & charges, des fontaines:& <lb/> encyclie du monde,
& de la dimension du corps humain.</head>
<head>Chapitre septiesme</head>
<div n="1">
<p>Le son & accord des cloches pendans en ung mesme <lb/> axe, est
faict en contraires parties.</p>
<p rend="it" facs="#B49rPara2">LEs cloches ont quasi fi<lb/>gures de rondes
pyra<lb/>mides imperfaictes & <lb/> irregulieres: & leur
accord se <lb/> fait par reigle geometrique. Com<lb/>me si les deux
cloches C & D <lb/> sont <w facs="#B49rW457">pendans</w> à ung
mesme axe <lb/> ou essieu A B: je dis que leur ac<lb/>cord se fera en
co<ex>n</ex>traires parties<lb/> co<ex>m</ex>me voyez icy
figuré. Car qua<ex>n</ex>d <lb/> lune sera en hault, laultre
declinera embas. Aultrement si elles decli<lb/>nent toutes deux
ensembles en une mesme partie, elles seront discord, <lb/> & sera
leur sonnerie mal plaisante à oyr.<figure facs="#B49rFig1">
<graphic url="Bovelles49r-detail.png"/>
</figure>
</p>
</div>
Further discussion of the encoding choices made in the above transcription is provided in the remainder of this chapter.
<surface start="#PB49R">
<graphic url="Bovelles-49r.png"/>
</surface>
</facsimile>
<text>
<!-- ... -->
<pb xml:id="PB49R"/>
<fw>De Geometrie 49</fw>
<!-- ... -->
</text>
11.2 Scope of TranscriptionsTEI: Scope of Transcriptions¶
- first, methods of recording editorial or other alterations to the text, such as expansion of abbreviations, corrections, conjectures, etc. (section 11.3 Altered, Corrected, and Erroneous Texts)
- then, methods of describing important extra-linguistic phenomena in the source: unusual spaces, lines, page and line breaks, change of manuscript hand, etc. (section 11.4 Hands and Responsibility)
- finally, a method of recording material such as running heads, catch-words, and the like (section 11.7 Headers, Footers, and Similar Matter)
These recommendations are not intended to meet every transcriptional circumstance likely to be faced by any scholar. Rather, they should be regarded as a base which can be elaborated if necessary by different scholars in different disciplines.
As a rule, all elements which may be used in the course of a transcription of a single witness may also be used in a critical apparatus, i.e. within the elements proposed in chapter 12 Critical Apparatus. This can generally be achieved by nesting a particular reading containing tagged elements from a particular witness within the rdg element in an app structure.
Just as a critical apparatus may contain transcriptional elements within its record of variant readings in various witnesses, one may record variant readings in an individual witness by use of the apparatus mechanisms app and rdg. This is discussed in section 12.3 Using Apparatus Elements in Transcriptions.
11.3 Altered, Corrected, and Erroneous TextsTEI: Altered, Corrected, and Erroneous Texts¶
In the detailed transcription of any source, it may prove necessary to record various types of actual or potential alteration of the text: expansion of abbreviations, correction of the text (either by author, scribe, or later hand, or by previous or current editors or scholars), addition, deletion, or substitution of material, and the like. The sections below describe how such phenomena may be encoded using either elements defined in the core module (defined in chapter 3 Elements Available in All TEI Documents) or specialized elements available only when the module described in this chapter is available.
11.3.1 Core elements for Transcriptional WorkTEI: Core elements for Transcriptional Work¶
- abbr (abbreviation) contains an abbreviation of any sort.
- add (addition) contains letters, words, or phrases inserted in the text by an author, scribe, annotator, or corrector.
- choice groups a number of alternative encodings for the same point in a text.
- corr (correction) contains the correct form of a passage apparently erroneous in the copy text.
- del (deletion) contains a letter, word, or passage deleted, marked as deleted, or otherwise indicated as superfluous or spurious in the copy text by an author, scribe, annotator, or corrector.
- expan (expansion) contains the expansion of an abbreviation.
- gap indicates a point where material has been omitted in a transcription, whether for editorial reasons described in the TEI header, as part of sampling practice, or because the material is illegible, invisible, or inaudible.
- sic (latin for thus or so) contains text reproduced although apparently incorrect or inaccurate.
- att.editLike provides attributes describing the nature of a encoded scholarly intervention or interpretation of any kind.
cert (certainty) signifies the degree of certainty associated with the intervention or interpretation. resp (responsible party) indicates the agency responsible for the intervention or interpretation, for example an editor or transcriber. source contains a list of one or more pointers indicating the sources which support the given reading. - att.typed provides attributes which can be used to classify or subclassify elements in any way.
type characterizes the element in some sense, using any convenient classification scheme or typology. subtype provides a sub-categorization of the element, if needed
The following sections describe how the core elements just named may be used in the transcription of primary source materials.
11.3.2 Abbreviation and ExpansionTEI: Abbreviation and Expansion¶
The writing of manuscripts by hand lends itself to the use of abbreviation to shorten scribal labour. Commonly occurring letters, groups of letters, words, or even whole phrases, may be represented by significant marks. This phenomenon of manuscript abbreviation is so widespread and so various that no taxonomy of it is here attempted. Instead, methods are shown which allow abbreviations to be encoded using the core elements mentioned above.
A manuscript abbreviation may be viewed in two ways. One may transcribe it as a particular sequence of letters or marks upon the page: thus, a ‘p with a bar through the descender’, a ‘superscript hook’, a ‘macron’. One may also interpret the abbreviation in terms of the letter or letters it is seen as standing for: thus, ‘per’, ‘re’, ‘n’. Both of these views are supported by these Guidelines.
In many cases the glyph found in the manuscript source also exists in the Unicode character set: for example the common Latin brevigraph ⁊, standing for et and often known as the ‘Tironian et’ can be directly represented in any XML document as the Unicode character with code point U+204A (see further Character References and vi.1. Language identification). In cases where it does not, these Guidelines recommend use of the g element provided by the gaiji module described in chapter 5 Representation of Non-standard Characters and Glyphs. This module allows the encoder great flexibility both in processing and in documenting non-standard characters or glyphs, including the ability to provide detailed documentation and images for them.
ladder
<!-- elsewhere -->
<charDecl>
<char xml:id="b-er">
<!-- definition for the er brevigraph -->
</char>
<char xml:id="b-per">
<!-- definition for the per brevigraph -->
</char>
</charDecl>
<abbr>
<g ref="#b-per">per</g>sone
</abbr>
...
<expan>persone</expan> ...
<abbr>eu<g ref="#b-er">er</g>y</abbr>
<expan>euery</expan>
</choice>
- ex (editorial expansion) contains a sequence of letters added by an editor or transcriber when expanding an abbreviation.
- am (abbreviation marker) contains a sequence of letters or signs present in an abbreviation which are omitted or replaced in the expanded form of the abbreviation.
<g ref="#b-er"/>
</am>y</abbr>
<abbr>
<am>
<g ref="#b-per"/>
</am>sone
</abbr> ...
<expan>
<ex>per</ex>sone
</expan> ...
<am>
<g ref="#b-er"/>
</am>
<ex>er</ex>
</choice>y
<choice>
<am>
<g ref="#b-per"/>
</am>
<ex>per</ex>
</choice>sone ...
As implied in the preceding discussion, making decisions about which of these various methods of representing abbreviation to use will form an important part of an encoder's practice. As a rule, the abbr and am elements should be preferred where it is wished to signify that the content of the element is an abbreviation, without necessarily indicating what the abbreviation may stand for. The ex and expan elements should be used where it is wished to signify that the content of the element is not present in the source but has been supplied by the transcriber, without necessarily indicating the abbreviation used in the original. The decision as to which course of action is appropriate may vary from abbreviation to abbreviation; there is no requirement that the one system be used throughout a transcription, although doing so will generally simplify processing. The choice is likely to be a matter of editorial policy. If the highest priority is to transcribe the text literatim, while indicating the presence of abbreviations, the choice will be to use abbr or am throughout. If the highest priority is to present a reading transcription, while indicating that some letters or words are not actually present in the original, the choice will be to use ex or expan throughout.
plural ending (-es, -is, -ys>) but the singular <hi rend="it">good</hi> was used with the meaning <q>property</q>,
<q>wealth</q>, at this time (v. examples quoted in OED, sb. Good,
C. 7, b, c, d and 8 spec.)</note>
good<ex resp="#mp" cert="high">e</ex> I was welbeloued
<resp>Editorial emendations</resp>
<name>Malcom Parkes</name>
</respStmt>
<choice>
<sic>good<abbr>ɽ</abbr>
</sic>
<expan resp="#mp" cert="high">good<ex>e</ex>
</expan>
</choice>
I was welbeloued
If more than one expansion for the same abbreviation is to be recorded, multiple notes may be supplied. It may also be appropriate to use the markup for critical apparatus; an example is given in section 12.3 Using Apparatus Elements in Transcriptions.
11.3.3 Correction and ConjectureTEI: Correction and Conjecture¶
has been modified by James to begin ‘But One must ...’, without the inital capital O having been reduced to lowercase. This non-standard orthography could be recorded thus:One must have lived longer with this system, to appreciate its advantages.
must have lived ...
have lived ...
<choice>
<sic>One</sic>
<corr>one</corr>
</choice> must have lived
...
et <choice>
<sic>angues</sic>
<corr>augens</corr>
</choice>.
Note that the corr element is used to provide a corrected form which is not present in the source; in the case of a correction made in the source itself, whether scribal, authorial, or by some other hand, the add, del, and subst elements described in 11.3.4 Additions and Deletions should be used.
create nothing <supplied>we</supplied> develope.
As with expan and abbr, the choice as to whether to record simply that there is an apparent error, or simply that a correction has been applied, or to record both possible readings within a choice element is left to the encoder. The decision is likely to be a matter of editorial policy, which might be applied consistently throughout or decided case by case. If the highest priority is to present an uncorrected transcription while noting perceived errors in the original, the choice will typically be to use only sic throughout. If the highest priority is to present a reading transcription, while indicating that perceived errors in the original have been corrected, the choice will be to use only corr throughout.
Further information may be attached to instances of these elements by the note element and resp and cert attributes. Instances of these elements may also be classified according to any convenient typology using the type attribute.
Were membres maad, of generacioun
And of so parfit wis a
<choice xml:id="corr117">
<sic>wight</sic>
<corr>wright</corr>
</choice>
ywroght?
<!-- ... -->
<note target="#corr117">This emendation of the Hengwrt copy text,
based on a Latin source and on the reading of three late
and usually unauthoritative manuscripts, was proposed
by E. Talbot Donaldson in <bibl>
<title>Speculum</title> 40 (1965)
626–33.</bibl>
</note>
<!-- somewhere in the header ... --><name xml:id="ETD">E Talbot Donaldson</name>
<!-- ... -->
And of so parfit wis a
<choice>
<sic>wight</sic>
<corr resp="#ETD" cert="medium">wright</corr>
</choice>
ywroght?
<sic>mens</sic>
<corr>iners</corr>
</choice> que nutu dei
gesta sunt ... unde esset uiriliter
<choice xml:id="sic-2">
<corr>uegetata</corr>
<sic>negata</sic>
</choice>
graphically what the scribe should be copying but which does not make
sense in the context.</note>
<choice>
<sic>mens</sic>
<corr type="graphSubs">iners</corr>
</choice> que nutu dei
gesta sunt ... unde esset uiriliter
<choice>
<corr type="graphSubs">uegetata</corr>
<sic>negata</sic>
</choice>
<choice>
<sic>mens</sic>
<corr type="graphSubs">iners</corr>
<corr type="reversal">inres</corr>
</choice> que nutu dei
gesta sunt ...
<p>The following codes are used to categorise corrections identified
in this transcription:
<list type="gloss">
<label>graphSubs</label>
<item>Substitution of a more familiar word which resembles
graphically what the scribe should be copying but which does not make
sense in the context.</item>
<!-- ... -->
</list>
</p>
</correction>
For a given project, it may well be desirable to limit the possible values for the type or subtype attributes automatically. This is easily done but requires customization of the TEI system using techniques described in 23.2 Personalization and Customization, in particular 23.2.1.4 Modification of Attribute and Attribute Value Lists, which should be consulted for further information on this topic.
<choice>
<sic>wight</sic>
<corr resp="#mp" source="#Gg">wyf</corr>
</choice>
ywroght?
<msIdentifier>
<settlement>Cambridge</settlement>
<repository>University Library</repository>
<idno>Gg.1. 27</idno>
</msIdentifier>
<!-- further description of the manuscript here -->
</msDesc>
parfit wis a
<app>
<rdg wit="#Hg">wight</rdg>
<rdg wit="#Ln #Ry2 #Ld">wright</rdg>
<rdg wit="#Gg">wyf</rdg>
</app>
parfit wis a
<app>
<rdg wit="#Hg">wight</rdg>
<rdg wit="#Ln #Ry2 #Ld">
<corr resp="#ETD">wright</corr>
</rdg>
<rdg wit="#Gg">
<corr resp="#mp">wyf</corr>
</rdg>
</app>
Like the resp attribute, the cert attribute may be used with both corr and rdg elements. When used on the rdg element, these attributes indicate confidence in and responsibility for identifying the reading within the sources specified; when used on the corr element they indicate confidence in and responsibility for the use of the reading to correct the base text. If no other source is indicated (either by the source attribute, or by the wit attribute of a parent rdg), the reading supplied within a corr has been provided by the person indicated by the resp attribute.
If it is desired to express aspects of certainty and responsibility for some other aspect of the use of these elements, then the mechanisms discussed in chapter 21 Certainty and Responsibility may be found useful. See also 11.4.2 Hand, Responsibility, and Certainty Attributes for further discussion of the issues of certainty and responsibility in the context of transcription.
11.3.4 Additions and DeletionsTEI: Additions and Deletions¶
- add (addition) contains letters, words, or phrases inserted in the text by an author, scribe, annotator, or corrector.
- addSpan/ (added span of text) marks the beginning of a longer sequence of text added by an author, scribe, annotator or corrector (see also <add>).
- del (deletion) contains a letter, word, or passage deleted, marked as deleted, or otherwise indicated as superfluous or spurious in the copy text by an author, scribe, annotator, or corrector.
- delSpan/ (deleted span of text) marks the beginning of a longer sequence of text deleted, marked as deleted, or otherwise signaled as superfluous or spurious by an author, scribe, annotator, or corrector.
- att.spanning provides attributes for elements which delimit a span of text by pointing mechanisms rather than by enclosing it.
spanTo indicates the end of a span initiated by the element bearing this attribute.
- att.transcriptional provides attributes specific to elements encoding authorial or
scribal intervention in a text when
transcribing manuscript or similar sources.
seq (sequence) assigns a sequence number related to the order in which the encoded features carrying this attribute are believed to have occurred. status indicates the effect of the intervention, for example in the case of a deletion, strikeouts which include too much or too little text, or in the case of an addition, an insertion which duplicates some of the text already present. hand signifies the hand of the agent which made the intervention.
sight. Others — and here is one of them — <add hand="#mb">do
ever</add> improve by recognition ....
<handNote xml:id="mb">Max Beerbohm holograph</handNote>
...
<handNote xml:id="dhl">D H Lawrence holograph</handNote>
If deletions are classified systematically, the type attribute may be useful to indicate the classification; when they are classified by the manner in which they were effected, or by their appearance, however, this will lead to a certain arbitrariness in deciding whether to use the type or the rend attribute to hold the information. In general, it is recommended that the rend attribute be used for description of the appearance or method of deletion, and that the type attribute be reserved for higher level or more abstract classifications.

precedents <del hand="#RG">in the</del>: current,
obsolete, <add hand="#RG" place="supralinear">cant,</add>
cataphretic and nonce-words are all included.
