egXML
indicates the person, or group of people, to whom the element content is ascribed.
provides an externally-defined means of identifying the entity (or entities) being
named, using a coded value of some kind.
(reference) provides an explicit means of locating a full definition for the entity being named by
means of one or more URIs.
gives a minimum estimated value for the approximate measurement.
(\-?[\d]+/\-?[\d]+)
gives a maximum estimated value for the approximate measurement.
(\-?[\d]+/\-?[\d]+)
where the measurement summarizes more than one observation
or a range, supplies the minimum value
observed.
(\-?[\d]+/\-?[\d]+)
where the measurement summarizes more than one observation
or a range, supplies the maximum value
observed.
(\-?[\d]+/\-?[\d]+)
names the unit used for the measurement
Suggested values include: 1] cm(centimetres) ; 2] mm(millimetres) ; 3] in(inches) ; 4] lines; 5] chars(characters)
cm
(centimetres)
mm
(millimetres)
in
(inches)
lines
lines of text
chars
(characters) characters of text
specifies the length in the units specified
(\-?[\d]+/\-?[\d]+)
indicates the size of the object concerned using a project-specific vocabulary combining
quantity and units in a single string of words.
(\p{L}|\p{N}|\p{P}|\p{S})+
(\p{L}|\p{N}|\p{P}|\p{S})+
characterizes the precision of the values specified by the other attributes.
high
medium
low
unknown
where the measurement summarizes more than one observation, specifies the applicability
of this measurement.
Sample values include: 1] all; 2] most; 3] range
indicates whether or not the element
bearing this attribute should be considered to mark the end of
an orthographic token in the same way as whitespace.
supplies a pointer to some location defining a named
period of time within which the datable item is understood to
have occurred.
supplies the value of the date or time in a standard form,
e.g. yyyy-mm-dd.
specifies the earliest possible date for the event in
standard form, e.g. yyyy-mm-dd.
specifies the latest possible date for the event in
standard form, e.g. yyyy-mm-dd.
indicates the starting point of the period in standard form, e.g. yyyy-mm-dd.
indicates the ending point of the period in standard
form, e.g. yyyy-mm-dd.
indicates whether or not this element is selected by default when
its parent is selected.
true
This element is selected if its parent is selected
false
This element can only be selected explicitly, unless it is the
only one of its kind, in which case it is selected if its parent is selected.
describes the status of a document either currently or, when
associated with a dated element, at the time indicated.
Sample values include: 1] approved; 2] candidate; 3] cleared; 4] deprecated; 5] draft; 6] embargoed; 7] expired; 8] frozen; 9] galley; 10] proposed; 11] published; 12] recommendation; 13] submitted; 14] unfinished; 15] withdrawn
(certainty) signifies the degree of certainty associated with the intervention or interpretation.
high
medium
low
unknown
(responsible party) indicates the agency responsible for the intervention or interpretation, for example an
editor or transcriber.
indicates the nature of the evidence supporting the reliability or accuracy of the
intervention or interpretation.
Suggested values include: 1] internal; 2] external; 3] conjecture
internal
there is internal evidence to support the intervention.
external
there is external evidence to support the intervention.
conjecture
the intervention or interpretation has been made by the editor, cataloguer, or
scholar on the basis of their expertise.
internal
there is internal evidence to support the intervention.
external
there is external evidence to support the intervention.
conjecture
the intervention or interpretation has been made by the editor, cataloguer, or
scholar on the basis of their expertise.
contains a list of one or more pointers indicating sources
supporting the given intervention or interpretation.
(identifier) provides a unique identifier for the element bearing the attribute.
(number) gives a number (or other label) for an element, which is not necessarily unique within
the document.
(\p{L}|\p{N}|\p{P}|\p{S})+
(\p{L}|\p{N}|\p{P}|\p{S})+
(language) indicates the language of the element content using a tag generated
according to BCP 47
signals an intention about how white space should be
managed by applications.
default
the processor should treat white space according to the
default XML white space handling rules
preserve
the processor should preserve unchanged any and all
white space in the source
gives a name or other identifier for the scribe
believed to be responsible for this hand.
points to a full description of the scribe concerned, typically supplied by a person element
elsewhere in the description.
characterizes the particular script or writing style used by
this hand, for example secretary, copperplate, Chancery, Italian, etc.
points to a full description of the script or writing style used by
this hand, typically supplied by a scriptNote element
elsewhere in the description.
describes the tint or type of ink, e.g. brown, or other
writing medium, e.g. pencil
specifies how widely this hand is used in the manuscript.
sole
only this hand is used throughout the manuscript
major
this hand is used through most of the manuscript
minor
this hand is used occasionally in the manuscript
(MIME media type) specifies the applicable multimedia internet mail extension (MIME) media type
(\p{L}|\p{N}|\p{P}|\p{S})+
may be used to specify further information about the entity referenced by
this name, for example the occupation of a person, or the status of a place.
(reference to the canonical name) provides a means of locating the canonical form
(nym) of the names associated with the object
named by
the element bearing it.
Suggested values include: 1] below; 2] bottom; 3] margin; 4] top; 5] opposite; 6] overleaf; 7] above; 8] end; 9] inline; 10] inspace
below
below the line
bottom
at the foot of the page
margin
in the margin (left, right, or both)
top
at the top of the page
opposite
on the opposite, i.e. facing, page
overleaf
on the other side of the leaf
above
above the line
end
at the end of e.g. chapter or volume.
inline
within the body of the text.
inspace
in a predefined space, for example left by an earlier scribe.
below
below the line
bottom
at the foot of the page
margin
in the margin (left, right, or both)
top
at the top of the page
opposite
on the opposite, i.e. facing, page
overleaf
on the other side of the leaf
above
above the line
end
at the end of e.g. chapter or volume.
inline
within the body of the text.
inspace
in a predefined space, for example left by an earlier scribe.
characterizes the element in some sense, using any convenient
classification scheme or typology.
provides a sub-categorization of the element, if needed
specifies the destination of the reference by supplying one or more URI References
specifies the intended meaning when the target of a
pointer is itself a pointer.
all
if the element pointed to is itself a pointer, then
the target of that pointer will be taken, and so on, until
an element is found which is not a pointer.
one
if the element pointed to is itself a pointer, then
its target (whether a pointer or not) is taken as the target
of this pointer.
none
no further evaluation of targets is carried out
beyond that needed to find the element specified in the
pointer's target.
characterizes the function of the segment.
specifies whether or not the segment is fragmented by some other
structural element, for example a clause which is divided between two
or more sentences.
Y
(yes) the segment is incomplete in some respect
N
(no) either the segment is complete, or no claim is made as to
its completeness
I
(initial) the initial part of an incomplete segment
M
(medial) a medial part of an incomplete segment
F
(final) the final part of an incomplete segment
(edition) supplies an arbitrary identifier for the source edition in which
the associated feature (for example, a page, column, or line
break) occurs at this point in the text.
indicates the end of a span initiated by the element
bearing this attribute.
indicates the kind of information held in this cell or
in each cell of this row.
Suggested values include: 1] label; 2] data
label
labelling or descriptive information only.
data
data values.
indicates the number of rows occupied by this cell or row.
(columns) indicates the number of columns occupied by this cell or
row.
signifies the hand of the agent which made the intervention.
indicates the effect of the intervention, for example in
the case of a deletion, strikeouts
which include too much or too little text, or in the case of an
addition, an insertion which duplicates some of the text
already present.
Sample values include: 1] duplicate; 2] duplicate-partial; 3] excessStart; 4] excessEnd; 5] shortStart; 6] shortEnd; 7] partial; 8] unremarkable
(sequence) assigns a sequence number related to the order in which
the encoded features carrying this attribute are believed to have occurred.
indicates whether the name component is given in full, as an
abbreviation or simply as an initial.
yes
the name component is spelled out in full.
abb
(abbreviated) the name component is given in an abbreviated form.
init
(initial letter) the name component is indicated only by
one initial.
specifies the sort order of the name component in relation
to others within the personal name.
(paragraph) marks paragraphs in prose.
(foreign) identifies a word or phrase as belonging to some language other than that of the
surrounding text.
(emphasized) marks words or phrases which are stressed or emphasized for
linguistic or rhetorical effect.
(highlighted) marks a word or phrase as graphically distinct from the
surrounding text, for reasons concerning which no claim is
made.
(quotation) contains a phrase or passage attributed by the narrator or author to some agency external
to the text.
(separated from the surrounding text with quotation marks) contains material which is marked as (ostensibly) being somehow different than the
surrounding text, for any one of a variety of reasons including, but not limited to: direct
speech or thought, technical terms or jargon, authorial distance, quotations from elsewhere, and
passages that are mentioned but not used.
may be used to indicate whether the offset passage is spoken or thought, or to
characterize it more finely.
Suggested values include: 1] spoken; 2] thought; 3] written; 4] soCalled; 5] foreign(foreign words) ; 6] distinct(linguistically distinct) ; 7] term(technical term) ; 8] emph(rhetorically emphasized) ; 9] mentioned
spoken
representation of speech
thought
representation of thought, e.g. internal monologue
written
quotation from a written source
soCalled
authorial distance
foreign
(foreign words)
distinct
(linguistically distinct)
term
(technical term)
emph
(rhetorically emphasized)
mentioned
refering to itself, not its normal referant
(cited quotation) contains a quotation from some other document, together with a bibliographic reference to
its source. In a dictionary it may contain an example text with at least one occurrence of the
word form, used in the sense being described, or a translation of the headword, or an example.
contains a single-word, multi-word, or symbolic designation which is regarded as a technical
term.
specifies the destination of the reference by supplying one or more URI References
(latin for thus or so
) contains text reproduced although apparently incorrect or inaccurate.
(correction) contains the correct form of a passage apparently erroneous in the copy text.
groups a number of alternative encodings for the same point in
a text.
(regularization) contains a reading which has been regularized or normalized in some sense.
(original form) contains a reading which is marked as following the original, rather than being normalized
or corrected.
(gap) indicates a point where material has been omitted in a transcription, whether for editorial
reasons described in the TEI header, as part of sampling practice, or because the material is
illegible, invisible, or inaudible.
gives the reason for omission. Sample values include sampling,
inaudible, irrelevant, cancelled.
(\p{L}|\p{N}|\p{P}|\p{S})+
(\p{L}|\p{N}|\p{P}|\p{S})+
in the case of text omitted from the transcription because of deliberate deletion by an
identifiable hand, signifies the hand which made the deletion.
In the case of text omitted because of damage, categorizes the cause of the damage, if
it can be identified.
Sample values include: 1] rubbing; 2] mildew; 3] smoke
contains a word, phrase, or passage which cannot be transcribed with certainty because it
is illegible or inaudible in the source.
indicates why the material is hard to transcribe.
(\p{L}|\p{N}|\p{P}|\p{S})+
(\p{L}|\p{N}|\p{P}|\p{S})+
Where the difficulty in transcription arises from action (partial deletion, etc.)
assignable to an identifiable hand, signifies the hand responsible for the action.
Where the difficulty in transcription arises from damage, categorizes the cause of
the damage, if it can be identified.
Sample values include: 1] rubbing; 2] mildew; 3] smoke
(referencing string) contains a general purpose name or referring string.
(You must indicate the type of the referring string) indicates more specifically the object referred to by the referencing string.
Values might include person, place, ship,
element etc.
person
(natural person)
figure
(literary figure)
geographic-entity
(self explanatory)
organization
(any type of organization)
other
contains a postal address, for example of a
publisher, an organization, or an individual.
(address line) contains one line of a postal address.
contains a date in any format.
(You must enter a date in ISO format) supplies the value of the date or time in a standard form,
e.g. yyyy-mm-dd.
[0-9.,DHMPRSTWYZ/:+\-]+
(You must enter a date in ISO format) specifies the earliest possible date for the event in
standard form, e.g. yyyy-mm-dd.
[0-9.,DHMPRSTWYZ/:+\-]+
(You must enter a date in ISO format) specifies the latest possible date for the event in
standard form, e.g. yyyy-mm-dd.
[0-9.,DHMPRSTWYZ/:+\-]+
(You must enter a date in ISO format) indicates the starting point of the period in standard form, e.g. yyyy-mm-dd.
[0-9.,DHMPRSTWYZ/:+\-]+
(You must enter a date in ISO format) indicates the ending point of the period in standard
form, e.g. yyyy-mm-dd.
[0-9.,DHMPRSTWYZ/:+\-]+
contains a phrase defining a time of day in any format.
(abbreviation) contains an abbreviation of any sort.
allows the encoder to classify the abbreviation according to some convenient
typology.
Sample values include: 1] suspension; 2] contraction; 3] brevigraph; 4] superscription; 5] acronym; 6] title; 7] organization; 8] geographic
(expansion) contains the expansion of an abbreviation.
(takes the function of ref and ptr. Can therefore also be empty) defines a reference to another location, possibly modified by additional text or comment.
Only one of the
attributes 'target' and 'cRef' may be supplied.
(The target of this pointer / reference) specifies the destination of the reference by supplying one or more URI References
(list) contains any sequence of items organized as a list.
describes the form of the list.
Suggested values include: 1] ordered; 2] bulleted; 3] simple; 4] gloss
ordered
list items are numbered or lettered.
bulleted
list items are marked with a bullet or other typographic device.
simple
list items are not numbered or bulleted.
gloss
each list item glosses some term or concept, which is given by a label element
preceding the list item.
contains one component of a list.
contains the label associated with an item in a list; in glossaries, marks the term being
defined.
(heading) contains any type of heading, for example the title of a section, or the heading of a list,
glossary, manuscript description, etc.
contains a note or annotation.
indicates whether the copy text shows the exact place of reference for the note.
(The place of the note should be specified
whenever the note refers to a note present in the
original text. The current text is a simplified version
of the P5 one)
margin
other
app
foot
end
inline
interlinear
margin
other
app
foot
end
inline
interlinear
specifies the destination of the reference by supplying one or more URI References
(responsible party) indicates the agency responsible for the intervention or interpretation, for example an
editor or transcriber.
(The place of the note must be specified. The
current text is a simplified version of the P5
one) characterizes the element in some sense, using any convenient
classification scheme or typology.
annotation
gloss
citation
other
(index entry) marks a location to be indexed for whatever purpose.
(You must indicate the type of this index) supplies a name to specify which index (of several) the index entry belongs to.
name
place
subject
citation
other
indicates the location of an inline graphic, illustration, or figure.
(The notation used to express this formula) A URL which refers to the image itself.
marks a boundary point separating any kind of section of a text, typically but not
necessarily indicating a point at which some part of a standard reference system changes, where
the change is not represented by a structural element.
(You must indicate the unit that this milestone
marks. The proposed values have been taken from the P5
documentation) provides a conventional name for the kind of section changing at this milestone.
book
poem
canto
stanza
act
scene
section
absent
(page break) marks the boundary between one page of a text and the next in a standard reference system.
(line break) marks the start of a new (typographic) line in some edition or version of a text.
(column break) marks the boundary between one column of a text and the next
in a standard reference system.
(monographic level) contains bibliographic elements describing an item (e.g. a book or journal) published as an
independent item (i.e. as a separate physical object).
in a bibliographic reference, contains the name(s) of the
author(s), personal or corporate, of a work; for example in the same
form as that provided by a recognized bibliographic name authority.
secondary statement of responsibility for a bibliographic item, for example the name of an
individual, institution or organization, (or of several such) acting as editor, compiler,
translator, etc.
contains a title for any kind of work.
indicates the bibliographic level for a title, that is, whether
it identifies an article, book, journal, series, or
unpublished material.
a
(analytic) analytic title (article, poem, or other item
published as part of a larger item)
m
(monographic) monographic title (book, collection, or
other item published as a distinct item,
including single volumes of multi-volume
works)
j
(journal) journal title
s
(series) series title
u
(unpublished) title of unpublished material (including
theses and dissertations unless
published by a commercial press)
classifies the title according to some convenient typology.
Sample values include: 1] main; 2] sub(subordinate) ; 3] alt(alternate) ; 4] short; 5] desc(descriptive)
groups information relating to the publication or distribution
of a bibliographic item.
provides the name of the organization responsible for the publication or distribution of a
bibliographic item.
(publication place) contains the name of the place where a bibliographic item was published.
(bibliographic citation) contains a loosely-structured bibliographic citation of which the sub-components may or may
not be explicitly tagged.
(structured bibliographic citation) contains a structured bibliographic citation, in which only bibliographic sub-elements
appear and in a specified order.
(verse line) contains a single, possibly incomplete, line of verse.
specifies whether or not the line is metrically complete.
Y
(yes) the line is metrically incomplete
N
(no) either the line is complete, or no claim is made as to its completeness
I
(initial) the initial part of an incomplete line
M
(medial) a medial part of an incomplete line
F
(final) the final part of an incomplete line
(line group) contains a group of verse lines functioning as a formal unit, e.g. a stanza, refrain,
verse paragraph, etc.
(speech) An individual speech in a performance text, or a passage presented as such in a prose or
verse text.
A specialized form of heading or label, giving the name of one or more speakers in a
dramatic text or fragment.
(stage direction) contains any kind of stage direction within a dramatic text or fragment.
indicates the kind of stage direction.
Suggested values include: 1] setting; 2] entrance; 3] exit; 4] business; 5] novelistic; 6] delivery; 7] modifier; 8] location; 9] mixed
setting
describes a setting.
entrance
describes an entrance.
exit
describes an exit.
business
describes stage business.
novelistic
is a narrative, motivating stage direction.
delivery
describes how a character speaks.
modifier
gives some detail about a character.
location
describes a location.
mixed
more than one of the above
indicates type of entry, in dictionaries with multiple types.
Suggested values include: 1] main; 2] hom(homograph) ; 3] xref(cross reference) ; 4] affix; 5] abbr(abbreviation) ; 6] supplemental; 7] foreign
main
a main entry (default).
hom
(homograph) groups information relating to one homograph within an entry.
xref
(cross reference) a reduced entry whose only function is to point to another main
entry (e.g. for forms of an irregular verb or for variant spellings:
was pointing to be, or
esthete to aesthete).
affix
an entry for a prefix, infix, or suffix.
abbr
(abbreviation) an entry for an abbreviation.
supplemental
a supplemental entry (for use in dictionaries which issue supplements to their main
work in which they include updated information about entries).
foreign
an entry for a foreign word in a monolingual dictionary.
contains a (sortable) character sequence reflecting the entry's alphabetical position in
the printed dictionary.
(\p{L}|\p{N}|\p{P}|\p{S})+
gives an expanded form of information presented more concisely in the dictionary
(normalized) gives a normalized form of information given by the source text in a
non-normalized form
gives the list of split values for a merged form
gives a value which lacks any realization in the printed source text.
(original) gives the original string or is the empty string when the element does not appear
in the source text.
provides a reference to an anchor element elsewhere in the document
indicating the original location of this component.
gives a reference to another element, where the original appears as a merged form.
(optional) indicates whether the element is optional or not
groups successive entries for a set of homographs.
contains a reasonably well-structured dictionary entry.
groups together all information relating to one word sense in a dictionary entry, for
example definitions, examples, and translation equivalents.
gives the nesting depth of this sense.
(\-?[\d]+/\-?[\d]+)
(form information group) groups all the information on the written and spoken forms of one headword.
classifies form as lemma (line), headword or simple.
lemma
(lemma, containing the headword)
headword
(the headword itself)
simple
(single free lexical item)
(gender) identifies the morphological gender of a lexical item, as given in the dictionary.
(grammatical information group) groups morpho-syntactic information about a lexical item, e.g. pos, gen, number, case, or
iType (inflectional class).
(part of speech) indicates the part of speech assigned to a dictionary
headword such as noun, verb, or adjective.
(subcategorization) contains subcategorization information (transitive/intransitive, countable/non-countable,
etc.)
(definition) contains definition text in a dictionary entry.
(etymology) encloses the etymological information in a dictionary entry.
(usage) contains usage information in a dictionary entry.
classifies the usage information using any convenient typology.
Sample values include: 1] geo(geographic) ; 2] time; 3] dom(domain) ; 4] reg(register) ; 5] style; 6] plev(preference level) ; 7] lang(language) ; 8] gram(grammatical) ; 9] syn(synonym) ; 10] hyper(hypernym) ; 11] colloc(collocation) ; 12] comp(complement) ; 13] obj(object) ; 14] subj(subject) ; 15] verb; 16] hint
(cross-reference phrase) contains a phrase, sentence, or icon referring the reader to some other location in this or
another text.
indicates the type of cross reference, using any convenient typology.
Sample values include: 1] syn(synonym) ; 2] etym(etymological) ; 3] cf(compare or consult) ; 4] illus(illustration)
(related entry) contains a dictionary entry for a lexical item related to the headword, such as a compound
phrase or derived form, embedded inside a larger entry.
(setting) contains a description of the setting, time, locale, appearance, etc., of the action of a
play, typically found in the front matter of a printed performance text (not a stage direction).
contains the prologue to a drama, typically spoken by an actor out of character, possibly in
association with a particular performance or venue.
contains the epilogue to a drama, typically spoken by an actor out of character, possibly in
association with a particular performance or venue.
contains a section of front or back matter describing how a dramatic piece is to be
performed in general or how it was performed on some specific occasion.
(cast list) contains a single cast list or dramatis personae.
(cast list grouping) groups one or more individual castItem
elements within a cast list.
(cast list item) contains a single entry within a cast list, describing
either a single role or a list of non-speaking roles.
characterizes the cast item.
role
the item describes a single role.
list
the item describes a list of non-speaking roles.
the name of a dramatic role, as given in a cast list.
(role description) describes a character's role in a drama.
Name of an actor appearing within a cast list.
(movement) marks the actual entrance or exit of one or more characters on stage.
characterizes the movement, for example as an entrance or exit.
Suggested values include: 1] entrance; 2] exit; 3] onStage
entrance
character is entering the stage.
exit
character is exiting the stage.
onStage
character moves on stage
specifies the direction of a stage movement.
Sample values include: 1] L(left) ; 2] R(right) ; 3] C(center)
(performance) identifies the performance or performances in which this movement occurred as specified.
describes the visual context of some part of a screen play in
terms of what the spectator sees, generally independent of any
dialogue.
describes a particular camera angle or viewpoint in a screen play.
describes a sound effect or musical sequence specified within a screen play or radio script.
categorizes the sound in some respect, e.g. as music, special effect, etc.
indicates whether the sound overlaps the surrounding speeches or interrupts them.
unknown
inapplicable
contains the text of a caption or other text displayed as part of
a film script or screenplay.
(technical stage direction) describes a special-purpose stage direction that is not
meant for the actors.
categorizes the technical stage direction.
light
a lighting cue
sound
a sound cue
prop
a prop cue
block
a blocking instruction
(performance) identifies the performance or performances to which this
technical direction applies.
contains text displayed in tabular form, in rows and columns.
indicates the number of rows in the table.
(columns) indicates the number of columns in each row of the table.
contains one row of a table.
contains one cell of a table.
contains a mathematical or other formula.
(The notation used to express this formula) supplies the name of a previously defined notation used for the content of the element.
groups elements representing or containing graphic information such as an illustration or
figure.
(description of figure) contains a brief prose description of the appearance or content
of a graphic figure, for use when documenting an image without
displaying it.
(character or glyph) represents a non-standard character or glyph.
points to a description of the character or glyph intended.
(character) provides descriptive information about a character.
(character name) contains the name of a character, expressed following Unicode conventions.
(character property) provides a name and value for some property of the parent character or glyph.
(character declarations) provides information about nonstandard characters and glyphs.
(character glyph) provides descriptive information about a character glyph.
(character glyph name) contains the name of a glyph, expressed following Unicode conventions for character names.
(locally-defined property name) contains a locally defined name for some property.
(character mapping) contains one or more
characters which are related to the parent character or glyph
in some respect, as specified by the type
attribute.
(unicode property name) contains the name of a registered Unicode normative or informative property.
specifies the version number of the Unicode Standard in which this property name is
defined.
[\d]+(\.[\d]+){0,2}
(value) contains a single value for some property, attribute, or other
analysis.
(TEI Header) supplies the descriptive and declarative information making up an electronic title page
prefixed to every TEI-conformant text.
specifies the kind of document to which the header is attached, for example whether it
is a corpus or individual text.
Sample values include: 1] text; 2] corpus
(file description) contains a full bibliographic description of an electronic file.
(title statement) groups information about the title of a work and those responsible for its intellectual
content.
(edition) describes the particularities of one edition of a text.
describes the approximate size of a text as stored on some carrier medium, whether digital
or non-digital, specified in any convenient units.
(publication statement) groups information concerning the publication or distribution of an electronic or other
text.
(release authority) supplies the name of a person or other agency responsible for
making an electronic file available, other than a publisher or
distributor.
(identifier) supplies any form of identifier used to identify some object,
such as a bibliographic item, a person, a title, an organization,
etc. in a standardized way.
categorizes the identifier, for example as an ISBN, Social
Security number, etc.
supplies information about the availability of a text, for example any restrictions on its
use or distribution, its copyright status, etc.
supplies a code identifying the current availability of the text.
free
the text is freely available.
unknown
the status of the text is unknown.
restricted
the text is not freely available.
(series statement) groups information about the series, if any, to which a publication belongs.
(notes statement) collects together any notes providing information about a text additional to that recorded
in other parts of the bibliographic description.
(source description) describes the source from which an electronic text was derived or generated, typically a
bibliographic description in the case of a digitized text, or a phrase such as "born digital"
for a text which has no previous existence.
(fully-structured bibliographic citation) contains a fully-structured bibliographic citation, in which all components of the TEI file
description are present.
(encoding description) documents the relationship between an electronic text and the
source or sources from which it was derived.
(editorial practice declaration) provides details of editorial principles and practices applied
during the encoding of a text.
indicates the extent of normalization or regularization of the original source carried out
in converting it to electronic form.
indicates the authority for any normalization carried out.
indicates the method adopted to indicate normalizations within the text.
silent
normalization made silently
markup
normalization represented using markup
(text-profile description) provides a detailed description of non-bibliographic aspects of a text, specifically the
languages and sublanguages used, the situation in which it was produced, the participants and
their setting.
(note on hand) describes a particular style or hand distinguished within a manuscript.
(revision description) summarizes the revision history for a file.
summarizes a particular change or correction made to a particular version of an electronic
text which is shared between several researchers.
describes a particular font or other significant typographic feature distinguished within
the description of a printed resource.
describes a particular script distinguished within
the description of a manuscript or similar resource.
(apparatus entry) contains one entry in a critical apparatus, with an optional
lemma and at least one reading.
Only one <lem> element may appear within a single
apparatus entry, whether it appears outside a <rdgGrp>
element or within it.
classifies the variation contained in this element according to
some convenient typology.
identifies the beginning of the lemma in the base text, if
necessary.
identifies the endpoint of the lemma in the base text, if
necessary.
(location) indicates the location of the variation, when the
location-referenced method of apparatus markup is used.
(\p{L}|\p{N}|\p{P}|\p{S})+
(\p{L}|\p{N}|\p{P}|\p{S})+
(lemma) contains the lemma, or base text, of a textual variation.
(reading) contains a single reading within a textual variation.
(reading group) within a textual variation,
groups two or more readings perceived to have a genetic
relationship or other affinity.
(witness detail) gives further information about a particular witness, or
witnesses, to a particular reading.
(witnesses) indicates the sigil or sigla for the witnesses to which the
detail refers.
describes the type of information given about the witness.
contains a list of one or more sigla of witnesses attesting a
given reading, in a textual variation.
(witness list) lists definitions for all the witnesses referred to by a critical
apparatus, optionally grouped hierarchically.
contains either a description of a single witness referred to
within the critical apparatus, or a list of witnesses which is to be
referred to by a single sigil.
(fragmented witness start) indicates the beginning, or resumption, of the text of a
fragmentary witness.
(fragmented witness end) indicates the end, or suspension, of the text of a fragmentary
witness.
indicates the beginning of a lacuna in the text of a mostly
complete textual witness.
indicates the end of a lacuna in a mostly complete textual
witness.
declares the method used to encode text-critical variants.
indicates which method is used to encode the apparatus of
variants.
location-referenced
apparatus uses line numbers or other canonical reference scheme
referenced in a base text.
double-end-point
apparatus indicates the precise locations of the beginning and
ending of each lemma relative to a base text.
parallel-segmentation
alternate readings of a passage are given in parallel in the
text; no notion of a base text is necessary.
indicates whether the apparatus appears within the running text
or external to it.
internal
apparatus appears within the running text.
external
apparatus appears outside the base text.
(witness or witnesses) contains a list of one or more sigla indicating the witnesses
which begin or end at this point.
(witness or witnesses) contains a list of one or more pointers indicating the witnesses
which attest to a given reading.
classifies the reading according to some useful typology.
Sample values include: 1] substantive; 2] orthographic
classifies the cause for the variant reading, according to
any appropriate typology of possible origins.
Sample values include: 1] homeoteleuton; 2] homeoarchy; 3] paleographicConfusion; 4] haplography; 5] dittography; 6] falseEmendation
(variant sequence) provides a number indicating the position of this reading in a
sequence, when there is reason to presume a sequence to the variants
on any one lemma.
signifies the hand responsible for a particular reading in the
witness.
(manuscript description) contains a description of a single identifiable
manuscript or other text-bearing object.
describes the system used to ensure
correct ordering of the quires making up a codex or incunable,
typically by means of annotations at the foot of the page.
contains a dimensional specification.
Width element may appear once only
Height element may appear once only
Depth element may appear once only
indicates which aspect of the object is being measured.
Sample values include: 1] leaves; 2] ruled; 3] pricked; 4] written; 5] miniatures; 6] binding; 7] box
contains any single measurement forming part of a dimensional
specification of some sort.
contains a measurement measured along the
axis at right angles to the bottom of the written surface,
i.e. parallel to the spine for a codex or book.
contains a measurement measured across the
spine of a book or codex, or (for other text-bearing objects) perpendicular to the measurement given by the
width element.
contains a measurement measured along the
axis parallel to the bottom of the written surface, i.e.
perpendicular to the spine of a book or codex.
contains a heraldic formula
or phrase, typically found as part of a blazon, coat of arms, etc.
defines a location within a manuscript or manuscript part, usually as a
(possibly discontinuous) sequence of folio references.
identifies the foliation scheme in terms of which the location is being
specified.
specifies the starting point of the location in a normalized form.
(\p{L}|\p{N}|\p{P}|\p{S})+
specifies the end-point of the location in a normalized form.
(\p{L}|\p{N}|\p{P}|\p{S})+
groups a number of locations which together form a
distinct but discontinuous item within a manuscript or manuscript
part, according to a specific foliation.
identifies the foliation scheme in terms of which all the
locations contained by the group are specified.
contains a word or phrase describing the
material of which the object being described is composed.
contains a word or phrase describing the type of object being refered to.
(origin date) contains any form of date, used to
identify the date of origin for a manuscript or manuscript part.
(origin place) contains any form of place name, used to identify the
place of origin for a manuscript or manuscript part.
(second folio) The word or words taken from a fixed point
in a codex (typically the beginning of the
second leaf) in order
to provide a unique identifier for it.
contains discussion of the leaf or quire signatures found within a codex.
contains a word or phrase describing a
stamp or similar device.
contains a word or phrase describing a
watermark or similar device.
(manuscript identifier) contains the information required to identify
the manuscript being described.
You must supply either a locator of some type or a
name
contains the name of an organization such as a university or
library, with which a manuscript is identified, generally its
holding institution.
contains the name of a repository within which manuscripts are stored, possibly forming part of an institution.
contains the name of a collection of
manuscripts, not necessarily located within a single repository.
(alternative identifier) contains an alternative or former structured identifier used for
a manuscript, such as a former catalogue number.
(alternative name) contains any form of unstructured alternative
name used for a manuscript, such as an ocellus
nominum, or nickname.
contains the colophon
of a manuscript item: that is, a statement providing information regarding the date, place, agency, or reason for production of the manuscript.
contains the explicit of a
manuscript item, that is, the closing words of the text proper,
exclusive of any rubric or colophon which might follow it.
contains information concerning the manuscript's filiation, i.e. its relationship to other surviving manuscripts of the same text, its protographs, antigraphs and apographs.
contains the string of words that denotes the end of a text division, often with an assertion as to its author and title, usually set off from the text itself by red ink, by a different size or type of script, or by some other such visual device.
contains the incipit of a manuscript item, that is the opening words of the text proper, exclusive of any rubric which might precede it, of sufficient length to identify the work uniquely; such incipts were, in fomer times, frequently used a means of reference to a work, in place of a title.
(manuscript contents) describes the intellectual content of a manuscript or manuscript
part, either as a series of paragraphs or as a series of structured manuscript items.
identifies the text types or classifications applicable to this object.
(manuscript item) describes an individual work or item within the intellectual
content of a manuscript or manuscript part.
identifies the text types or classifications applicable to this
item
(structured manuscript item) contains a structured description for an
individual work or item within the intellectual content of a
manuscript or manuscript part.
identifies the text types or classifications applicable to this
item
contains the text of any rubric or heading attached to a particular manuscript item, that is, a string of words through which a
manuscript signals the beginning of a text division, often with an assertion as to its author and title, which is in some way set off from the text itself, usually in red ink, or by use of different size or type of script, or some other such visual device.
contains an overview of the available
information concerning some aspect of an item (for example, its
intellectual content, history, layout, typography etc.) as a
complement or alternative to the more detailed information carried by
more specific elements.
(text language) in a manuscript description, describes
the languages and writing systems identified within the manuscript
being described.
(main language) supplies a code which identifies the chief language used in the manuscript.
(other languages) one or more codes identifying any other languages used in the manuscript.
indicates whether the passage being quoted is defective,
i.e. incomplete through loss or damage.
unknown
inapplicable
(physical description) contains a full physical description of a
manuscript or manuscript part, optionally subdivided using more
specialised elements from the model.physDescPart class.
contains a description of the physical
components making up the object which is being described.
a short project-specific name identifying the physical form of
the carrier, for example as a codex, roll, fragment, partial leaf,
cutting etc.
(support description) groups elements describing the physical support for the written part of a manuscript.
a short project-defined name for the material composing
the majority of the support
Suggested values include: 1] paper (paper) ; 2] parch(parchment) ; 3] mixed
paper
(paper)
parch
(parchment)
mixed
contains a description of the materials
etc. which make up the physical support for the written part of a manuscript.
contains a description of how the leaves or bifolia are physically
arranged.
describes the numbering system or systems used to
count the leaves or pages in a codex.
contains a description of the physical
condition of the manuscript.
(layout description) collects the set of layout descriptions applicable to a manuscript.
describes how text is laid out on the
page, including information about any ruling, pricking, or other
evidence of page-preparation techniques.
specifies the number of columns per page
specifies the number of ruled lines per column
specifies the number of written lines per
column
(description of hands) contains a description of all the different kinds of writing used in a manuscript.
specifies the number of distinct hands identified within the manuscript
contains a description of the typefaces or other aspects of
the printing of an incunable or other printed source.
contains a description of the scripts used in a manuscript or similar source.
contains description of type of musical notation.
(decoration description) contains a description of the decoration of a manuscript, either as a sequence of paragraphs, or as a sequence of topically organised decoNote elements.
(note on decoration) contains a note describing either a
decorative component of a manuscript, or a fairly homogenous class of
such components.
contains a description of any significant additions found
within a manuscript, such as marginalia or other annotations.
(binding description) describes the present and former bindings of a manuscript, either
as a series of paragraphs or as a series of distinct binding elements,
one for each binding of the manuscript.
contains a description of one binding, i.e. type of covering, boards,
etc. applied to a manuscript.
specifies whether or not the binding is contemporary with the majority of its
contents
unknown
inapplicable
(seal description) describes the seals or other external items attached to a manuscript, either
as a series of paragraphs or as a series of distinct seal elements,
possibly with additional decoNotes.
contains a description of one seal or similar
attachment applied to a manuscript.
specifies whether or not the seal is contemporary with the
item to which it is affixed
unknown
inapplicable
(accompanying material) contains details of any significant additional
material which may be closely associated with the manuscript being
described, such as non-contemporaneous documents or fragments bound in
with the manuscript at some earlier historical period.
groups elements
describing the full history of a manuscript or manuscript part.
contains any descriptive or other information
concerning the origin of a manuscript or manuscript part.
contains any descriptive or other information
concerning a single identifiable episode during the history of a manuscript
or manuscript part, after its creation but before its acquisition.
contains any descriptive or other information
concerning the process by which a manuscript or manuscript part entered the holding
institution.
groups additional information, combining
bibliographic information about a manuscript, or surrogate copies of
it with curatorial or administrative information.
(administrative information) contains information about the present
custody and availability of the manuscript, and also about the record
description itself.
(recorded history) provides information about the source and
revision status of the parent manuscript description itself.
describes the original source for the information contained with a manuscript description.
(custodial history) contains a description of a manuscript's custodial history, either
as running prose or as a series of dated custodial events.
(custodial event) describes a single event during the custodial history of a manuscript.
contains information about any representations of the manuscript being described which
may exist in the holding institution or elsewhere.
(manuscript part) contains information about an originally
distinct manuscript or part of a manuscript, now forming part of a composite manuscript.
(TEI document) contains a single TEI-conformant document,
comprising a TEI header and a text, either in isolation or as part of a
teiCorpus element.
specifies the version number of the TEI Guidelines against
which this document is valid.
[\d]+(\.[\d]+){0,2}
contains a single text of any kind, whether unitary or composite, for example a poem or
drama, a collection of essays, a novel, a dictionary, or a corpus sample.
(text body) contains the whole body of a single unitary text, excluding any front or back matter.
contains the body of a composite text, grouping together a sequence of distinct texts (or
groups of such texts) which are regarded as a unit for some purpose, for example the collected
works of an author, a sequence of prose essays, etc.
(text division) contains a subdivision of the front, body, or back of a text.
contains a closing title or footer appearing at the end of a division of a text.
contains the primary statement of responsibility given for a work
on its title page or at the head or end of the work.
contains a brief description of the place, date, time, etc. of production of a letter,
newspaper story, or other work, prefixed or suffixed to it as a kind of heading or trailer.
contains a quotation, anonymous or attributed, appearing at the start of a section or
chapter, or on a title page.
groups together dateline, byline, salutation, and similar phrases appearing as a preliminary
group at the start of a division, especially of a letter.
groups together salutations, datelines, and similar phrases appearing as a final group at
the end of a division, especially of a letter.
(salutation) contains a salutation or greeting prefixed to a foreword, dedicatory epistle, or other
division of a text, or the salutation in the closing of a letter, preface, etc.
(signature) contains the closing salutation, etc., appended to a foreword,
dedicatory epistle, or other division of a text.
contains a postscript, e.g. to a letter.
(title page) contains the title page of a text, appearing within the front or back matter.
classifies the title page according to any convenient typology.
(document title) contains the title of a document, including all its
constituents, as given on a title page.
contains a subsection or division of the title of a work, as
indicated on a title page.
specifies the role of this subdivision of the title.
Suggested values include: 1] main; 2] sub(subordinate) ; 3] alt(alternate) ; 4] short; 5] desc(descriptive)
main
main title of the work
sub
(subordinate) subtitle of the work
alt
(alternate) alternative title of the work
short
abbreviated form of title
desc
(descriptive) descriptive paraphrase of the work
(document author) contains the name of the author of the document, as given on the
title page (often but not always contained in a byline).
contains a formal statement authorizing the publication of a work, sometimes required to
appear on a title page or its verso.
(document edition) contains an edition statement as presented on a title page of a
document.
(document imprint) contains the imprint statement (place and date of publication,
publisher name), as given
(usually) at the foot of a title page.
(document date) contains the date of a document, as given
(usually) on a title page.
gives the value of the date in standard form, i.e. YYYY-MM-DD.
(front matter) contains any prefatory matter (headers,
title page, prefaces, dedications, etc.)
found at the start of a document, before the main body.
(back matter) contains any appendixes, etc. following the main part of a text.
(metrical structure, conventional) contains a user-specified encoding for the conventional
metrical structure of the element.
(metrical structure, realized) contains a user-specified encoding for the actual realization
of the conventional metrical structure applicable to the element.
(rhyme scheme) specifies the rhyme scheme applicable to a group of verse lines.
(enjambement) indicates that the end of a verse line is marked by enjambement.
Sample values include: 1] no; 2] yes; 3] weak; 4] strong
marks the point at which a metrical line may be divided.
marks the rhyming part of a metrical line.
provides a label to identify which part of a rhyme scheme this rhyming string
instantiates.
(\p{L}|\p{N}|\p{P}|\p{S})+
(anchor point) attaches an identifier to a point within a text, whether or not it corresponds with a textual element.
(identifier) provides a unique identifier for the element bearing the attribute.
(identifier) provides a unique identifier for the element bearing the attribute.
(corresponds) points to elements that correspond to the current
element in some way.
(synchronous) points to elements that are synchronous with the current
element.
points to an element that is the same as the current
element.
points to an element of which the current element is a
copy.
points to the next element of a virtual aggregate of which
the current element is part.
(previous) points to the previous element of a virtual aggregate of
which the current element is part.
points to elements that are in exclusive alternation
with the current element.
selects one or more alternants; if one alternant is
selected, the ambiguity or uncertainty is marked as resolved. If
more than one alternant is selected, the degree of ambiguity or
uncertainty is marked as reduced by the number of alternants not
selected.
(word) represents a grammatical (not necessarily orthographic) word.
provides a lemma for the word, such as an uninflected
dictionary entry form.
provides a pointer to a definition of the lemma for the
word, for example in an online lexicon.
(character) represents a character.
(punctuation character) a character or string of characters regarded as constituting a
single punctuation mark.
indicates the extent to which this punctuation mark
conventionally separates words or phrases
strong
the punctuation mark is a word separator
weak
the punctuation mark is not a word separator
inter
the punctuation mark may or may not be a
word separator
provides a name for the kind of unit delimited by this punctuation mark.
indicates whether this punctuation mark precedes or
follows the unit it delimits.
(analysis) indicates one or more elements containing interpretations of the
element on which the ana attribute appears.
(facsimile) points to all or part of an image which corresponds with the content of the element.
(added span of text) marks the beginning of a longer sequence of text added by an
author, scribe, annotator or corrector (see also add).
The spanTo= attribute of is required.
L'attribut spanTo est requis.
(deleted span of text) marks the beginning of a longer sequence of text deleted,
marked as deleted, or otherwise signaled as superfluous or spurious by an
author, scribe, annotator, or corrector.
The spanTo= attribute of
is required.
L'attribut spanTo est requis.
(gathering begins) marks the point in a transcribed codex at which a new
gathering or quire begins.
contains one or more handNote elements documenting the
different hands identified within the source texts.
(Texte superflu) marks text present in the source which the editor believes to
be superfluous or redundant.
indicates the grounds for believing this text to be superfluous.
(\p{L}|\p{N}|\p{P}|\p{S})+
(\p{L}|\p{N}|\p{P}|\p{S})+
(feature structure) represents a feature structure, that is, a
collection of feature-value pairs organized as a
structural unit.
specifies the type of the feature structure.
(features) references the feature-value specifications making up this feature structure.
(feature) represents a feature value specification, that
is, the association of a name with a value of any of several different types.
A feature value cannot
contain both text and element content
A feature value can contain
only one child element
provides a name for the feature.
(feature value) references any element which can be used to represent the
value of a feature.
(symbolic value) represents the value part of a feature-value specification
which contains one of a finite list of symbols.
supplies the symbolic value for the feature, one of a finite list that
may be specified in a feature declaration.
(\p{L}|\p{N}|\p{P}|\p{S})+
(feature library) assembles a library of feature elements.
(organization name) contains an organizational name.
(personal name) contains a proper noun or proper-noun phrase referring to a person, possibly including any
or all of the person's forenames, surnames, honorifics, added names, etc.
contains a family (inherited) name, as opposed to a given, baptismal, or nick name.
contains a forename, given or baptismal name.
(generational name component) contains a name component used to distinguish otherwise similar names on the basis of the relative ages or generations of the persons
named.
(name link) contains a connecting phrase or link used within a name but not regarded as part of it, such as van der or
of.
(additional name) contains an additional name component, such as a nickname, epithet, or alias, or any other descriptive phrase used within a personal
name.
contains a name component which indicates that the referent has a particular role or position in society, such as an official title or
rank.
(country) contains the name of a geo-political unit, such as a nation, country, colony, or
commonwealth, larger than or administratively superior to a region and smaller than a bloc.
that part of a relative temporal or spatial expression which indicates the direction of the offset between the two place names, dates, or
times involved in the expression.
(geographical feature name) contains a common noun identifying some geographical feature contained within a geographic
name, such as valley, mount, etc.
(affiliation) contains an informal description of a person's present or past affiliation with some
organization, for example an employer or sponsor.
(age) specifies the age of a person.
supplies a numeric code representing the age or age group
(birth) contains information about a person's birth, such as its date and place.
(climate) contains information about the physical climate of a place.
(death) contains information about a person's death, such as its date and place.
contains a description of the educational experience of a person.
(event) contains data relating to any kind of significant event associated with a person, place, or organization.
indicates the location of an event by pointing to a place element
specifies the faith, religion, or belief set of a person.
contains information about a person's period of activity.
(geographical coordinates) contains any expression of a set of geographic coordinates, representing a point, line, or area on the surface of the earth in some
notation.
(language knowledge) summarizes the state of a person's linguistic knowledge, either as prose or by a list of langKnown elements.
supplies one or more valid language tags for the languages specified
(language known) summarizes the state of a person's linguistic competence, i.e., knowledge of a single language.
supplies a valid language tag for the language concerned.
a code indicating the person's level of knowledge for this language
(\p{L}|\p{N}|\p{P}|\p{S})+
(list of organizations) contains a list of elements, each of which provides information about an identifiable
organization.
(list of events) contains a list of descriptions, each of which provides information
about an identifiable event.
(list of persons) contains a list of descriptions, each of which provides information about an identifiable
person or a group of people, for example the participants in a language interaction, or the
people referred to in a historical source.
(list of places) contains a list of places, optionally followed by a list of relationships (other than
containment) defined amongst them.
defines the location of a place as a set of geographical coordinates, in terms of a other named geo-political entities, or as an
address.
contains an informal description of a person's present or past nationality or citizenship.
contains an informal description of a person's trade, profession or occupation.
identifies the classification system or taxonomy in use by supplying the identifier of a taxonomy element elsewhere in the
header.
identifies an occupation code defined within the classification system or taxonomy defined by the scheme
attribute.
(organization) provides information about an identifiable organization such as a business, a tribe, or
any other grouping of people.
specifies a primary role or classification for the organization.
(\p{L}|\p{N}|\p{P}|\p{S})+
(\p{L}|\p{N}|\p{P}|\p{S})+
(relation group) provides information about relationships identified amongst people, places, and
organizations, either informally as prose or as formally expressed relation links.
provides information about an identifiable individual, for example a participant in a language interaction, or a person referred to in a
historical source.
specifies a primary role or classification for the person.
specifies the sex of the person.
0
1
2
9
specifies an age group for the person.
(personal group) describes a group of individuals treated as a single person for analytic purposes.
specifies the role of this group of participants in the interaction.
specifies the sex of the participant group.
0
1
2
9
mixed
specifies the age group of the participants.
specifies the size or approximate size of the group.
(\p{L}|\p{N}|\p{P}|\p{S})+
(\p{L}|\p{N}|\p{P}|\p{S})+
contains data about a geographic location
contains information about the population of a place.
(relationship) describes any kind of relationship or linkage amongst a specified group of participants.
Only one of the attributes
'active' and 'mutual' may be supplied
the attribute 'passive'
may be supplied only if the attribute 'active' is
supplied
categorizes the relationship in some respect, e.g. as social, personal or other.
Suggested values include: 1] social; 2] personal; 3] other
social
relationship concerned with social roles
personal
relationship concerned with personal roles, e.g. kinship, marriage, etc.
other
other kinds of relationship
supplies a name for the kind of relationship of which this is an instance.
identifies the passive participants in a non-mutual relationship.
identifies the active participants in a non-mutual relationship, or all the participants in a mutual
one.
supplies a list of participants amongst all of whom the relationship holds equally.
(residence) describes a person's present or past places of residence.
specifies the sex of a person.
0
1
2
9
(socio-economic status) contains an informal description of a person's perceived social or economic status.
identifies the classification system or taxonomy in use.
identifies a status code defined within the classification system or taxonomy defined by the source attribute.
contains information about the physical terrain of a place.
contains a description of some culturally-determined and in principle unchanging characteristic attributed to a person or place.
(canonical name) contains the definition for a canonical name or namepart of any kind.
points to constituent nyms
(list of canonical names) contains a list of nyms, that is, standardized names for any thing.
supplies the value of a date or time in a standard form.
[0-9.,DHMPRSTWYZ/:+\-]+
specifies the earliest possible date for the event in standard form, e.g. yyyy-mm-dd.
[0-9.,DHMPRSTWYZ/:+\-]+
specifies the latest possible date for the event in standard form, e.g. yyyy-mm-dd.
[0-9.,DHMPRSTWYZ/:+\-]+
indicates the starting point of the period in standard form.
[0-9.,DHMPRSTWYZ/:+\-]+
indicates the ending point of the period in standard form.
[0-9.,DHMPRSTWYZ/:+\-]+