iSchool Capstone

Apitext: An API for TEI-XML Transcriptions

Project tags:

archives & special collections

data curation

knowledge organization

Project poster

For over twenty years the Text Encoding Initiative (TEI) has managed and developed a set of encoding guidelines for the representation of humanities, social science, and linguistics -- to preserve and share -- texts in digital form. Using Extensible Markup Language (XML) as its backbone, TEI is the generally accepted encoding model for the digital humanities. Due to XML’s extensible nature, it can often be difficult to share these files, and problematic to make them interoperable. Our Application Programming Interface (API) for TEI-XML documents addresses these challenges. It requires no prior programming experience to use, can be installed using standard File Transfer Protocols (FTP), and is able to return multiple interoperable views of a TEI-XML file using a Uniform Resource Identifier (URI) as its method of query.

Project participants:

Guiyan Bai

Informatics

Michael Andrea

Informatics

Chris Sumption

Informatics