Information extraction across textual corpora: semi-automatic text-tagging workflow with Chinese local gazetteers

poster / demo / art installation
Authorship
  1. 1. Calvin Yeh

    Max Planck Institute for the History of Science / Institution Max Planck Institut für Wissenschaftsgeschichte

  2. 2. Sean Wang

    Max Planck Institute for the History of Science / Institution Max Planck Institut für Wissenschaftsgeschichte

  3. 3. Shih-Pei Chen

    Max Planck Institute for the History of Science / Institution Max Planck Institut für Wissenschaftsgeschichte

Work text
This plain text was ingested for the purpose of full-text search, not to preserve original formatting or readability. For the most complete copy, refer to the original conference program.

Textual information extraction is necessary for many humanities projects. Since 2013, we have been developing “Local Gazetteers Research Tools” (LoGaRT), and its text-tagging component is designed for that purpose. This poster introduces the practical implementation of information extraction and organization in LoGaRT and discusses how this component could be applied to other corpora with consistent internal structures.

If this content appears in violation of your intellectual property rights, or you see errors or omissions, please reach out to Scott B. Weingart to discuss removing or amending the materials.

Conference Info

In review

ADHO - 2020
"carrefours / intersections"

Hosted at Carleton University, Université d'Ottawa (University of Ottawa)

Ottawa, Ontario, Canada

July 20, 2020 - July 25, 2020

475 works by 1078 authors indexed

Conference cancelled due to coronavirus. Online conference held at https://hcommons.org/groups/dh2020/. Data for this conference were initially prepared and cleaned by May Ning.

Conference website: https://dh2020.adho.org/

References: https://dh2020.adho.org/abstracts/

Series: ADHO (15)

Organizers: ADHO