Project Digital ModelsDigital Corpus
Digital Corpus
The corpus of the texts is managed and exploited in a digital archive containing the formal representation of the texts leveraging the TEI/EpiDoc encoding schema, the de facto standard schema adopted in digital epigraphy. The corpus can be accessed through the DigItAnt platform.
TEI/EpiDoc is an international consortium which establishes guidelines and implements tools for digital encoding scholarly editions of ancient documents according to the Leiden conventions. In particular, it specifies a subset of the Text Encoding Initiative (TEI)'s standard vocabulary, providing elements for the edition of the texts (edition, translation, apparatus, commentary, bibliography) and for the description of the history (provenance, location, date, repository) and materiality (physical description) of the objects.
The XML format allows compatibility and interoperability with other text projects created according to the XML/TEI format and, thus, their availability for the research community.
TEI/EpiDoc is a versatile and customizable tool which can be easily adapted to specialised needs. ItAnt proposed some ad hoc solutions to manage the peculiarities presented by a language of
fragmentary attestation by describing more carefully its linguistic issues (cfr.
Murano et al. 2023).
Each text in the archive is enriched with shared and standard
metadata allowing for their accurate description, both as a linguistic object (text: language, alphabet, date, etc.) and as a material object (support: chronology, data of discovery, material, etc.).
In the perspective of the best possible data integration, ItAnt takes advantage of concepts coming from widely accredited vocabularies and gazetteers: The Art & Architecture Thesaurus (AAT) provided bt the J. Paul Getty Trust, gli iDAI. thesauri provided bt the Deutsches Archäologisches Institut, the EAGLE vocabularies, specifically designed for epigraphy, Pleiades and GeoNames.
ItAnt also provides the
Trismegistos IDs
to provide an additional strong integration to our data.
As part of the project, a Domain Specific Language (ItAnt DSL) has been developed with the aim of making the encoding of texts in XML according to the TEI/EpiDoc standard more familiar, transparent, and compact, and consequently faster and less prone to errors. Specifically, the application of the ItAnt DSL allows for the encoding of texts by compiling a plain and intuitive text, which is then transformed by the ItAnt DSL parser into XML with a proprietary schema. This XML, combined with another XML file containing recurring information—encoded in YAML—relevant to the specific text, is then transformed through an XSLT stylesheet into an XML file conforming to the TEI/EpiDoc standard.
References:
- Elliott, Tom, Gabriel Bodard, Elli Mylonas, Simona Stoyanova, Charlotte Tupman, e Scott Vanderbilt. 2022. «EpiDoc Guidelines: Ancient documents in TEI XML (Version 9)». 2022. https://epidoc.stoa.org/gl/latest/.
- Murano, Francesca, Valeria Quochi, Angelo Mario Del Grosso, Luca Rigobianco, e Mariarosaria Zinzi. c.s.. «Describing Inscriptions of Ancient Italy. The ItAnt Project and Its Information Encoding Process». Journal on Computing and Cultural Heritage
- Boschetti, Federico, Luca Rigobianco, e Valeria Quochi. 2024. «Domain-Specific Languages for Epigraphy: The Case of ItAnt». In Selected papers from the CLARIN Annual Conference, 191–202. Linköping Electronic Conference Proceeding.
Resources:
EpiDoc Editors: