1. Greek Papyri
The Ancient Greek civilization used numerous materials as support for their texts. Of these, the best preserved are epigraphic texts and the papyri. We can estimate the number of extant papyri to be well over 200,000, of which only a part have been published. They are either literary texts, subliterary texts (private letters, magical texts, etc.) or documentary texts, both public and private (contracts, legal texts, etc.).
Papyri, as we have them now, are very often fragmentary texts, usually poorly preserved. They have transmitted to our days words and expressions that we did not know from literature, and often must be interpreted within a context that can only be conjectured.
Naturally, the chronological extent of the documents testifies to numerous changes in writing practices, and, even more importantly, reflects numerous social changes that took place in the area during these more than a thousand years.
2. Digitization of Greek papyri
A considerable part of all the papyri that have already been scholarly edited have been digitized thanks to several projects, many of them gathered under the aegis of Papyri.info, using as encoding scheme EpiDoc, a subset of the TEI standard adapted to the edition of ancient texts.
Texts in this repository do not identify words separately (other than putting spaces among them), nor do they distinguish normally between words and word fragments, or words and numerals, nor do they lemmatize words.
There are currently more than 81,000 digitized papyri in the Papy.info repository. The site's software allows basic queries within the text of the papyri (without lemmatization) and the metadata that have been incorporated from the Heidelberger Gesamtverzeichnis der Griechischen Papyrusurkunden Ägyptens (HGV) project.
Callimachus (https://glg.csic.es/Callimachus/) is a new project currently in development at Spain’s CCHS-CSIC, that aims to make available to all users the metadata incorporated in the digital edition of the papyri from Papyri.info, as well as the result of some calculations and estimations about the formal content of the papyri based on such metadata and the data itself. Callimachus processes all digitized papyri and presents the following information using 55 categories.
1. All values and attributes meaningful for the study of Greek papyri contained in the XML markup.
2. It puts this information in relation to the data that about each papyrus has been collected by the HGV.
3. Identifies words and separates them from fragments and numerals.
4. Performs a series of calculations on the textual content of the papyrus, such as the number of words, letters per line, editorial corrections, scribal hands, etc.
The Callimachus number is a decimal number between zero and one that objectively indicates the state of preservation and readability of a document. A Callimachus number 0 indicates no preservation of the contents and 1 indicates that the entire contents of the papyrus have been preserved and are visible without problems. The Callimachus number is based on an algorithm that can be used to determine the state of preservation of any ancient document.
To calculate the Callimachus number we use the XML values and parameters of the document that indicate whether the text has been restored, whether it is fully or partially legible, etc. and perform some kind of calculations to estimate the percentage of missing text. All these calculations result in a matrix that assigns each letter (present or estimated according to the editors' judgment about the extent of the gaps) a value from 0 to 1 according to the criteria in the following table. The sum of all values is divided by the number of letters to obtain the Callimachus number:
5. Some applications of Callimachus
Callimachus can be used in papyrology research as well as in linguistics-related projects. One can use Callimachus to search papyri containing any specific feature, or a combination of features. For example, you can search for papyri containing any specific trait, or combination of traits. The use of Callimachus in combination with Polyphemus will allow to combine lexical information with all the data types of Callimachus.
If this content appears in violation of your intellectual property rights, or you see errors or omissions, please reach out to Scott B. Weingart to discuss removing or amending the materials.
July 25, 2022 - July 29, 2022
361 works by 945 authors indexed
Held in Tokyo and remote (hybrid) on account of COVID-19
Conference website: https://dh2022.adho.org/
Contributors: Scott B. Weingart, James Cummings
Series: ADHO (16)