Pundit. Semantic Annotation for Digital Humanities

poster / demo / art installation
Authorship
  1. 1. Giulio Andreini

    Net7 SRL, Internet Open Solutions

  2. 2. Francesca Di Donato

    Net7 SRL, Internet Open Solutions

  3. 3. Danilo Giacomi

    Net7 SRL, Internet Open Solutions

  4. 4. Enrico Giusti

    Net7 SRL, Internet Open Solutions

  5. 5. Raffaele Masotti

    Net7 SRL, Internet Open Solutions

Work text
This plain text was ingested for the purpose of full-text search, not to preserve original formatting or readability. For the most complete copy, refer to the original conference program.


Students and researchers are used to study on books and printed articles, underlining and taking notes on the text itself in the meantime.
Pundit (http://thepund.it/) allows to perform the same actions on any web page, being it an online magazine, a blog or a digital library. In fact, Pundit (Morbidoni et al., 2015) is an open source suite of applications that allows users to build semantic annotations with different levels of expressivity on web pages, collaborating in the meantime with others. The knowledge base created by annotations can be reused inside the Pundit applications or by external third party projects.
Pundit implements a client-server architecture and is made up of several components which interact with each other, but, if needed, are able to work independently:

Pundit Annotator: a simple and lightweight annotator tool that allows to highlight and comment text in web pages with ease. This tool is intended for general users, students and journalists.

Pundit Annotator Pro: an advanced tool for web annotation that allows to create semantic annotations (built by one or more triples) using text fragments, web pages, Cultural Heritage Objects or Linked Data entities. This tool is intended for scholars and researchers.

Pundit Annotations Manager: the Annotation Manager is a web application that allows users to review and manage their annotations as well as to export them in different formats.

Pundit Server: it is where all annotations are stored in a graph format. The data model of annotations is an extension of the Web Annotation Data Model standard (http://www.w3.org/TR/2014/WD-annotation-model-20141211/), defined by the Web Annotation Working Group of W3C.

Pundit Annotator Pro used in the project The European correspondence to Jacob Burckhardt
By using Pundit, users contribute to create a new knowledge layer on top of web pages, thus enriching the web of data.
Thanks to the possibility of sharing public annotations, adding comments to or evaluating existing annotations, Pundit can be portrayed as a collaborative platform, allowing the users to cooperate and share their results with workgroups, thus enabling revisions and a form of peer reviewing. Saved annotations are collected in notebooks that can be made public or private.
Users are able to visualize and manage their annotations in the so-called ‘Annotations Manager’. Besides browsing their notebooks and performing advanced research, there users can export their collected data in various standard formats. In this way, is possible to save copies of collected data, to be potentially exported in different systems, compliant with the Web Annotation standard.

The Pundit Annotations Manager is the application where users can manage their annotations
The Annotation Server implements a set of REST APIs, which allow to expose public resources through content negotiation. The knowledge base can be integrated in third party projects such as Digital Libraries

In particular, we integrated Pundit with Muruca, the open source Digital Library Framework developed by Net7 (http://muruca.org/en/).
and be used to build advanced research systems or advanced semantic data visualizations.

In these last years the software underwent continuous refactoring with the objective to make it more and more modular and adaptable in several contexts. Besides the graphical interface flexibility, with its pro and light modes, the client allows the use of custom vocabularies and ontologies.
Pundit can be customized for tailor made projects where requirements lie outside its standard features. Starting from open source code new features not present in the original version are developed.
The first version of Pundit was developed from 2010 within the EU Semlib project (http://cordis.europa.eu/result/rcn/57391_en.html). Then in 2012 the development continued in the context of the EU DM2E project (http://dm2e.eu/). In 2014 the StoM project (http://www.stom-project.eu) started with the aim of bringing Pundit in the market as a software-as-a-service platform. Specific features of Pundit are also under development within EU Europeana Sounds (http://www.europeanasounds.eu/) project.
Pundit is used for semantic annotations in research projects such as the ERC AdG LOOKINGATWORDS (http://cordis.europa.eu/project/rcn/102545_en.html) and ERC AdG EUROCORR (http://www.burckhardtsource.org) funded projects. In both applications it is used by teams of researchers to semantically enrich the corpus of text of their digital library.

Bibliography

Morbidoni, C. and Piccioli, A. (2015). Pundit 2.0.
Semantic Web journal, First published January 31, 2015: http://www.semantic-web-journal.net/system/files/swj1003.pdf.

If this content appears in violation of your intellectual property rights, or you see errors or omissions, please reach out to Scott B. Weingart to discuss removing or amending the materials.

Conference Info

Complete

ADHO - 2016
"Digital Identities: the Past and the Future"

Hosted at Jagiellonian University, Pedagogical University of Krakow

Kraków, Poland

July 11, 2016 - July 16, 2016

454 works by 1072 authors indexed

Conference website: https://dh2016.adho.org/

Series: ADHO (11)

Organizers: ADHO