The need for annotated corpora in a variety of different types of research grows constantly. Unfortunately creating annotated corpora is frequently cost-prohibitive due the number of person-hours required to create the corpus. This project investigates one solution that helps to reduce the cost of creating annotated corpora through the use of a new user interface which includes a specially built framework and component for annotating part-of-speech information and the implementation of a dictionary. This project reports on a user study performed to determine the effect of dictionaries with different levels of coverage on a part-of-speech annotation task. Based on a pilot study with thirty-three participants the analysis shows that a part-of-speech tag dictionary with greater than or equal to 60% coverage helps to improve the time required to complete the part-of-speech annotation task while maintaining high levels of accuracy.
College and Department
Humanities; Linguistics and English Language
BYU ScholarsArchive Citation
Carmen, Marc A., "Utilizing Human-Computer Interactions to Improve Text Annotation" (2010). Theses and Dissertations. 2143.
part-of-speech annotation, user study, CCASH, active learning, cost-reduction