digital libraries, microfilm, image processing, automated processing
Large-scale, multi-terabyte digital libraries are becoming feasible due to decreasing costs of storage, CPU, and bandwidth. However, costs associated with preparing content for input into the library remain high due to the amount of human labor required. This paper describes the Digital Microfilm Pipeline – a sequence of image processing operations used to populate a large-scale digital library from a “mountain” of microfilm and reduce the human labor involved. Essential parts of the pipeline include algorithms for document zoning and labeling, consensus-based template creation, reversal of geometric transformations and Just-In-Time Browsing, an interactive technique for progressive access of image content over a low-bandwidth medium. We also suggest more automated approaches to cropping, enhancement and data extraction.
Original Publication Citation
W. A. Barrett, L. Hutchison, D. Quass, H. Nielson, and D. Kennard, "Digital Mountain: From Granite Archive to Global Access," IEEE Proceedings, International Workshop on Document Image Analysis for Libraries (DIAL 24), pp. 14-121, Palo Alto, CA, January, 24.
BYU ScholarsArchive Citation
Barrett, William A.; Hutchison, Luke A. D.; Kennard, Douglas J.; Nielson, Heath E.; and Quass, Dallan, "Digital Mountain: From Granite Archive to Global Access" (2004). All Faculty Publications. 462.
Physical and Mathematical Sciences
© 2004 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
Copyright Use Information