•  
  •  
 

Journal of Undergraduate Research

Keywords

extracted data fields, genealogical microfilm, ASCII

College

Physical and Mathematical Sciences

Department

Computer Science

Abstract

A wealth of information is locked up in millions of microfilm documents. The process of extracting and organizing this enormous amount of data is overwhelming. Automated data extraction techniques offer the ability to rapidly and systematically capture this information so that it can be stored and queried in databases. The complete process consists of these steps: convert the hand-written text to ASCII, record the text’s location, classify the text, create records from the classified text, store the records. My research focused on developing data extraction techniques for automating the recognition and creation of genealogical records from the extracted text.

Share

COinS