College and Department
Physical and Mathematical Sciences; Computer Science
BYU ScholarsArchive Citation
Packer, Thomas L., "Scalable Detection and Extraction of Data in Lists in OCRed Text for Ontology Population Using Semi-Supervised and Unsupervised Active Wrapper Induction" (2014). All Theses and Dissertations. 4258.
information extraction, data, ontology, conceptual modeling, ontology population, grammar induction, wrapper induction, hidden Markov model, HMM, regular expression, regex, OCR, plain text, OCRed text document, list, active learning, unsupervised active learning, document analysis and recognition, historical document