Th e T E I H e a d e r
According to present international standards, metadata is of particular significance for retrieval,
and it will become even more important in the future. Clear and accurate metadata will also be
critical for the identification of sources on the Internet and questions of long-term archiving.
How can an electronic document be described, "so that the text itself, its source, its encoding,
and its revisions are all thoroughly documented? Such documentation is equally necessary for
scholars using the texts, for software processing them, and for cataloguers in libraries and archives.
Together these descriptions and declarations provide an electronic analogue to the title page
attached to a printed work. Every TEI-conformant text must carry such a set of descriptions.The
set is known as the TEI Header."
For the HAB project we have selected the bibliographical metadata elements for the electronic
documents on the basis of the TEI Guidelines. Usually, a large part of the desired metadata can
be taken from the existing catalog records of the online catalog for both the electronic document
as well as for the printed source, provided that the print edition is already catalogued.This meta-
data becomes the hierarchically structured content of the TEI element "TEI Header." A script
specifically developed for this converts the relevant catalog data and inserts this data into TEI-
conformant XML documents according to the DTD.The Dublin Core metadata elements for
the encoding of Internet sources can be added at the transformation to HTML.
A TEI Header can be a very large and complex object, or it may be a very simple one.We made
use of a simple model, using the following header for the electronic edition:
Th e T E I Te x t b l o c k < b o d y >
Independent of the type of source, the metadata for the electronic document has a uniform
structure consistent with the DTD in the header. In the textblock <body> the structural features
of the original text are marked up.The components for recording and subject indexing of the
source differ according to their genres.That means that an analysis of the documents first had to
be carried out for each genre, in which the features of the genre were defined and formalized for
the DTD components. For emblem books, the following structure emerged:The basic unit of
our digitization is the entire emblem book, not the individual emblem.We digitize and structure
Illustration 1: Example of a
TEI Header for Isselburg's
Emblemata Politica, Herzog
DC_Emblemsbook_180204 19.02.2004 11:25 Uhr Seite 73