Keywords
Annotated Corpora, Annotation, User Study
Abstract
Manual annotation of large textual corpora can be cost-prohibitive, especially for rare and under-resourced languages. One potential solution is pre-annotation: asking human annotators to correct sentences that have already been annotated, usually by a machine. Another potential solution is correction propagation: using annotator corrections to dynamically improve to the remaining pre-annotations within the current sentence. The research presented in this paper employs a controlled user study to discover under what conditions these two machine-assisted annotation techniques are effective in increasing annotator speed and accuracy and thereby reducing the cost for the task of morphologically annotating texts written in classical Syriac. A preliminary analysis of the data indicates that pre-annotations improve annotator accuracy when they are at least 60% accurate, and annotator speed when they are at least 80% accurate. This research constitutes the first systematic evaluation of pre-annotation and correction propagation together in a controlled user study.
Original Publication Citation
Paul Felt, Eric Ringger, Kevin Seppi, Kristian Heal, Robbie Haertel, Deryle Lonsdale (2012). First Results in a Study Evaluating Pre-labeling and Correction Propagation for Machine-Assisted Syriac Morphological Analysis. In (Nicoletta Calzolari. Khalid Choukri, ThierryDeclerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Jan Odijk, and SteliosPiperidis, Eds.) Proceedings of the Eighth International Conference on Language Resources andEvaluation (LREC '12), European Language Resources Association (ELRA); pp. 878-885, ISBN 978-2-9517408-7-7.
BYU ScholarsArchive Citation
Lonsdale, Deryle W.; Felt, Paul; Ringger, Eric; Seppi, Kevin; Heal, Kristian; and Haertel, Robbie A., "First Results in a Study Evaluating Pre-annotation and Correction Propagation for Machine-Assisted Syriac Morphological Analysis" (2012). Faculty Publications. 6863.
https://scholarsarchive.byu.edu/facpub/6863
Document Type
Conference Paper
Publication Date
2012
Publisher
European Language Resources Association
Language
English
College
Humanities
Department
Linguistics
Copyright Use Information
https://lib.byu.edu/about/copyright/