Keywords
Fourier-Mellin transform, tabular document images, transformations
Abstract
A new technique is presented for quickly identifying global affine transformations applied to tabular document images, and to correct for those transformations. This technique, based on the Fourier-Mellin transform, is used to register (align) a set of tabular documents to each other. Each component of the affine transform is handled separately, which dramatically reduces the total parameter space of the problem. This method is robust, and deals with all components of the affine transform in a uniform way. The Fourier-Mellin transform is also extended to handle shear, which can approximate a small amount of perspective distortion, and to not need Blackman windowing for document images. In order to limit registration to foreground pixels only, and to eliminate Fourier “edge effects”, a novel, locally adaptive foreground-background segmentation algorithm is introduced, based on the median filter. An original method is also presented for automatically obtaining blank document templates from a set of registered document images. Finally, image registration is demonstrated as a tool for compression of document images which share the same template.
Original Publication Citation
L. Hutchison and W. A. Barrett, "Fast Registration of Tabular Document Images Using Fourier-Mellin Transform," IEEE Proceedings, International Workshop on Document Image Analysis for Libraries (DIAL 24), pp. 253-269, Palo Alto, CA, January, 24.
BYU ScholarsArchive Citation
Barrett, William A. and Hutchison, Luke A. D., "Fast Registration of Tabular Document Images Using the Fourier-Mellin Transform" (2004). Faculty Publications. 463.
https://scholarsarchive.byu.edu/facpub/463
Document Type
Peer-Reviewed Article
Publication Date
2004-01-01
Permanent URL
http://hdl.lib.byu.edu/1877/2616
Publisher
IEEE
Language
English
College
Physical and Mathematical Sciences
Department
Computer Science
Copyright Status
© 2004 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
Copyright Use Information
http://lib.byu.edu/about/copyright/