École Pratique des Hautes Études – Université PSL, France
A number of papers have been given at the ADHO conferences and elsewhere regarding the modelling of writing, one of which is the Archetype model for palaeographical analysis (Diehr et al., 2018; Stokes, 2012; Stokes, 2018). As well as its implementation in the Archetype framework, this model has also been aligned to the CIDOC-CRM and CRMtex extension and then implemented in OWL (based on the Erlangen version 7.1.1), an extract of which is shown below. Although useful, this model (and indeed most others) treat the signs in written communication as distinct entities: they address the selection of a particular sign amongst a set of possible options, but they pay little or no explicit attention to the relationships between signs in a given piece of writing. In Structuralist terms, they focus primarily or exclusively on the paradigmatic aspect, but do not address the syntagmatic relations. This paper will therefore discuss some use cases and possible responses.
Extract of the Archetype model of handwriting aligned to CIDOC-CRM and CRMtex (Stokes, 2021; Stokes, forthcoming).
For Archetype, the system is designed primarily for palaeographical analysis such as the distribution of allographs by a given scribe (Brookes et al., 2015), but the choice of allographs is largely or even entirely determined by context such as the position in a word or the formality of the script (Morison, 1972), and the way in which a letter is written will often be influenced by that which immediately precedes or follows it, so an effective model should take this into account.
For the use of ‘allograph’, ‘idiograph’ and ‘graph’ in this discussion, see Stokes (2012) drawing on Davis (2007).
Another important use-case is meaning and disambiguation. As Robinson has pointed out (2009: 42–43), most examples of
i in medieval manuscripts do not comprise ‘a line with a dot over it’, but we recognise it anyway; however, this recognition depends in part on the linguistic context, and because we compare it with other nearby signs and adjust our expectations accordingly. This ambiguity is relatively common in practice, particularly in multigraphic contexts (for examples and discussion see e.g. Coueignoux, 1983: 61; Bugarski, 1993; Pierazzo, 2015; Stokes, 2018) and again suggests the need to model syntagmatic relationships.
Ambiguity in script: the word reads 'indecatur', but the
a is indistinguishable from two
cs (Pierazzo, 2015: 87). St Gallen Stiftsbibliothek MS 189, p. 76 (detail).
A third and very large case the function of signs. As Klinkenberg and Polis (2019: 68 ff.) and others have discussed, graphemic function is often relational, whether paradigmatic or syntagmatic. One example of a syntagmatic relational function is found in alphabetic systems, where letters often influence or determine the phonetic value of others. In English, for example, the pronunciation of the in
hate is partly determined by the following (contrast
hat). In some cases such as French, letters indicate the grammatical accord or function of words even when they are not pronounced (Klinkenberg and Polis, 2019: 85): contrast
il mange and
ils mangent. Punctuation and features such as underlining or italic also function syntagmatically, modifying the sense and/or pronunciation of that which follows and/or precedes them. Similarly, Anis (1983: 33–34) proposed graphematic supra-segmentals to describe elements in print such as italics when they are graphematically distinctive (contrast ‘dating
Beowulf’ with ‘dating Beowulf’). Determinants in writing-systems such as Egyptian hieroglyphics and Sumerian cuneiform also operate at this level, since the sign is usually at the start or end of a word but establishes the reading for that word, in terms of meaning and also often in sound. Finally, in some systems such as Japanese
kanji, the pronunciation and meaning of a sign can change dramatically depending on its context.
Figure 3: Determinatives in Egyptian hieroglyphics (Klinkenberg, 2005: 178). The two words have the same pronunciation, but different determinatives (‘human’ and ‘enemy’).
Finally, the cases so far all assume linear or ‘chronosyntactic’ writing, but one also finds ‘toposyntactic’ cases where meaning is determined in part by the spatial relationship between signs (Klinkenberg, 1996; Klinkenberg, 2005: esp. 190-95). This is particularly the case in Egyptian hieroglyphs (Polis, 2018) but can be found even in alphabetic scripts, tables being an obvious example (see Pierazzo and Stokes, 2011, for some others).
The question, then, is how to model these relationships. The most obvious is through graph or network models, where relationships can be explicitly recorded. For Archetype, this would comprise different types of relationships between instances of arch:Ontograph (essentially generalised graphemes: see Stokes, 2021). For instance an arch:Ontograph
modifies phonology of other arch:Ontographs; a determinant arch:Ontograph
modifies signification of the other arch:Ontographs in the same word; and so on. The example of “dating
Beowulf” is less clear, and here Anis’ concept of the suprasegmental grapheme (or arch:Ontograph) seems necessary; an arch:Suprasegmental_Ontograph then groups instances of arch:Ontographs and is itself associated with a given function or other indicator. A similar approach can be used to indicate other factors, such as the language of a text or even the script (or formality of) where that is relevant. However, this raises difficulties over the distinction between the graph, the allograph, and the grapheme. CRMtex is clear that the style of script (TX10 Style) is associated with a physical instance of writing (TX7 Written Segment: Murano et al., 2020: 20–22), and this seems correct insofar as graphemes are by definition independent of details such as style. However, CRMtex does not consider allographs or idiographs, and style (and other suprasegmental markers) are at least partly also associated with these. This then suggests several types of suprasegmental, each of which specifies syntagmatic relationships between letters but at different levels of ‘letter’: denotative for the grapheme, connotative for the allograph and graph, topological for the graph, and so on (see also Monella, 2020a; Monella, 2020b for denotation and connontation in this context).
The conceptual model proposed here is undoubtedly complex and probably too unwieldy for most practical purposes. It is also far from complete and has its own limitations. However, it is offered as a step towards a common conceptual model for handwriting that would allow for the mapping of different projects and datasets, and, perhaps more importantly, could provide a framework for analysis and comparison of different scripts in different alphabets and writing systems. In addition, such modelling has already proven important in the refining and increasing precision in the fraught subject of palaeographical terminology and descriptions (Stokes, 2015). To paraphrase Donald Knuth (1982: 5), the process of trying to produce as complete a conceptual model as possible is surely instructive to all concerned.
Bibliography
Anis, J. (1983). Pour une graphématique autonome.
Langue française,
59(1): 31–44 doi:10.3406/lfr.1983.5164.
Brookes, S., Stokes, P. A., Watson, M. and Matos, D. M. D. (2015). The DigiPal Project for European Scripts and Decorations. In Conti, A., Da Rold, O. and Shaw, P. A. (eds),
Writing Europe 500–1450: Texts and Contexts. Woodbridge: D. S. Brewer, pp. 25–58
www.jstor.org/stable/10.7722/j.ctt17mvjbg.9 (accessed 22 October 2021).
Bugarski, R. (1993). Graphic relativity and linguistic constructs. In Scholes, R. J. (ed),
Literacy and Language Analysis. London: Routledge, pp. 5–18.
Coueignoux, P. (1983). Approche structurelle de la lettre.
Langue française,
59(1): 45–67 doi:
10.3406/lfr.1983.5165.
Davis, T. (2007). The Practice of Handwriting Identification.
The Library: The Transactions of the Bibliographical Society,
8(3): 251–76.
Diehr, F., Gronemeyer, S., Prager, C., Wagner, E., Diederichs, K., Grube, N. and Brodhun, M. (2018). Organising the Unknown: A Concept for the Sign Classification of not yet (Fully) Deciphered Writing Systems Exemplified by a Digital Sign Catalogue for Maya Hieroglyphs.
Digital Humanities 2018 Book of Abstracts. Mexico City: Red de Humanidades Digitales A. C.
https://dh2018.adho.org/en/organising-the-unknown-a-concept-for-the-sign-classification-of-not-yet-fully-deciphered-writing-systems-exemplified-by-a-digital-sign-catalogue-for-maya-hieroglyphs/ (accessed 6 December 2021).
Klinkenberg, J.-M. (1996).
Précis de Sémiotique Générale. De Boeck Université.
Klinkenberg, J.-M. (2005). Vers une typologie générale des fonctions de l’écriture. De la linéarité à la spatialité.
Bulletin de La Classe Des Lettres,
1–6: 157–96.
Klinkenberg, J.-M. and Polis, S. (2019). Les fonctions de l’écriture : un modèle général Liège
https://orbi.uliege.be/handle/2268/241566 (accessed 27 November 2021).
Knuth, D. E. (1982). The Concept of a Meta-Font.
Visible Language,
16(1): 3–27.
Monella, P. (2020a). An Ontology for Digital Graphematics and Philology Wuppertal
http://www1.unipa.it/paolo.monella/wuppertal2020/ (accessed 2 October 2021).
Monella, P. (2020b). Un’ontologia della grafematica per la filologia digitale Siena
http://www1.unipa.it/paolo.monella/siena2020/ (accessed 2 October 2021).
Morison, S. (1972).
Politics and Script: Aspects of Authority and Freedom in the Development of Graeco-Latin Script from the Sixth Century B.C. to the Twentieth Century. (Edited and Completed by N. Barker, The Lyell Lectures, 1957). Oxford: Clarendon Press.
Murano, F., Felicetti, A. and Doerr, M. (2020).
Definition of the CRMtex: An Extension of CIDOC CRM to Model Ancient Textual Entities. Version 1.0. ICOM-CIDOC
http://www.cidoc-crm.org/crmtex/sites/default/files/CRMtex_v1.0_March_2020.pdf (accessed 18 September 2021).
Pierazzo, E. (2015).
Digital Scholarly Editing: Theories, Models and Methods. Farnham, Surrey, UK ; Burlington, VT: Ashgate.
Pierazzo, E. and Stokes, P. A. (2011). Putting the Text back into Context: A Codicological Approach to Manuscript Transcription. In Fischer, F., Fritze, C. and Vogeler, G. (eds),
Kodikologie und Paläographie im digitalen Zeitalter 2 - Codicology and Palaeography in the Digital Age 2, vol. 3. Norderstedt: Books on Demand (BoD), pp. 397–429.
Polis, S. (2018). The Functions and Toposyntax of Ancient Egyptian Hieroglyphs: Exploring the Iconicity and Spatiality of Pictorial Graphemes.
Signata(9): 291–363. doi:
10.4000/signata.1920.
Robinson, P. M. W. (2009). What Text Really is Not, and Why Editors have to Learn to Swim.
Literary and Linguistic Computing,
24(1): 41–52 doi:
10.1093/llc/fqn030.
Stokes, P. A. (2012). Modelling Medieval Handwriting: A New Approach to Digital Palaeography. In Meister, J. C., Schönert, K., Lomsché, B., Schernus, W., Schüch, L. and Stegkemper, M. (eds),
DH2012 Book of Abstracts. Hamburg: Hamburg University Press, pp. 382–85.
Stokes, P. A. (2015). Digital approaches to paleography and book history: some challenges, present and future.
Frontiers in Digital Humanities,
2(5). doi:
10.3389/fdigh.2015.00005
Stokes, P. A. (2018). Modelling Multigraphism: The Digital Representation of Multiple Scripts and Alphabets. In Palau, J. G. and Russell, I. G. (eds),
Digital Humanities 2018 Book of Abstracts. Mexico City: Red de Humanidades Digitales A. C., pp. 292–96.
Stokes, P. A. (2021).
The Archetype Ontology. doi:
10.5281/zenodo.5771601.
https://github.com/pastokes/archetype-ontology.
Stokes, P. A. (forthcoming). Aligning Archetype: Towards a Formal Model for a Transversal Palaeography.
COMSt Bulletin.
If this content appears in violation of your intellectual property rights, or you see errors or omissions, please reach out to Scott B. Weingart to discuss removing or amending the materials.
In review
Tokyo, Japan
July 25, 2022 - July 29, 2022
361 works by 945 authors indexed
Held in Tokyo and remote (hybrid) on account of COVID-19
Conference website: https://dh2022.adho.org/
Contributors: Scott B. Weingart, James Cummings
Series: ADHO (16)
Organizers: ADHO