Toward Three-Stage Automation of Annotation for Human Values

Emi Ishita, Satoshi Fukuda, Toru Oga, Douglas W. Oard, Kenneth R. Fleischmann, Yoichi Tomiura, An Shou Cheng

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Citations (Scopus)

Abstract

Prior work on automated annotation of human values has sought to train text classification techniques to label text spans with labels that reflect specific human values such as freedom, justice, or safety. This confounds three tasks: (1) selecting the documents to be labeled, (2) selecting the text spans that express or reflect human values, and (3) assigning labels to those spans. This paper proposes a three-stage model in which separate systems can be optimally trained for each of the three stages. Experiments from the first stage, document selection, indicate that annotation diversity trumps annotation quality, suggesting that when multiple annotators are available, the traditional practice of adjudicating conflicting annotations of the same documents is not as cost effective as an alternative in which each annotator labels different documents. Preliminary results for the second stage, selecting value sentences, indicate that high recall (94%) can be achieved on that task with levels of precision (above 80%) that seem suitable for use as part of a multi-stage annotation pipeline. The annotations created for these experiments are being made freely available, and the content that was annotated is available from commercial sources at modest cost.

Original languageEnglish
Title of host publicationInformation in Contemporary Society - 14th International Conference, iConference 2019, Proceedings
EditorsNatalie Greene Taylor, Caitlin Christian-Lamb, Bonnie Nardi, Michelle H. Martin
PublisherSpringer Verlag
Pages188-199
Number of pages12
ISBN (Print)9783030157418
DOIs
Publication statusPublished - 2019
Event14th International Conference on Information in Contemporary Society, iConference 2019 - Washington, United States
Duration: Mar 31 2019Apr 3 2019

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11420 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference14th International Conference on Information in Contemporary Society, iConference 2019
Country/TerritoryUnited States
CityWashington
Period3/31/194/3/19

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint

Dive into the research topics of 'Toward Three-Stage Automation of Annotation for Human Values'. Together they form a unique fingerprint.

Cite this