Identifying relations between characters in Afrikaans, Tshivenḓa, and Xitsonga book

paper, specified "short paper"
  1. 1. Menno van Zaanen

    South African Centre for Digital Language Resources (SADiLaR)

  2. 2. Benito Trollip

    South African Centre for Digital Language Resources (SADiLaR)

  3. 3. Phathuthsedzo Ramukhadi

    South African Centre for Digital Language Resources (SADiLaR)

  4. 4. Respect Mlambo

    South African Centre for Digital Language Resources (SADiLaR)

Work text
This plain text was ingested for the purpose of full-text search, not to preserve original formatting or readability. For the most complete copy, refer to the original conference program.

The usefulness of computational linguistic tools, such as named entity recognition (NER) systems, in linguistic or literary studies of under-resourced languages is an area that is still relatively unexplored. We applied NER systems to one Afrikaans novel and two scanned dramas, one in Tshivenḓa and one in Xitsonga. Personal relations are identified through character name co-occurence in sentences and these relationships are visualized using Gephi, following the approach by Van de Ven et al. (2018). The research identified several practical problems: low quality OCR, low quality NER, limited amounts of NE and language specific issues.

If this content appears in violation of your intellectual property rights, or you see errors or omissions, please reach out to Scott B. Weingart to discuss removing or amending the materials.

Conference Info

In review

ADHO - 2020
"carrefours / intersections"

Hosted at Carleton University, Université d'Ottawa (University of Ottawa)

Ottawa, Ontario, Canada

July 20, 2020 - July 25, 2020

475 works by 1078 authors indexed

Conference cancelled due to coronavirus. Online conference held at Data for this conference were initially prepared and cleaned by May Ning.

Conference website:


Series: ADHO (15)

Organizers: ADHO