Demonstration: A Robust Web Data-Extraction Technique With High Recall and Precision

Keywords

data extraction, unstructured web documents, database technology

Abstract

Our demo shows how to extract and structure data found in data-rich, unstructured, multiple-record Web documents. Users may either apply pre-built extraction applications or build and apply their own. The demo is significant because it (1) attacks an important data-centric problem and (2) uses database technology to produce good results with minimal effort.

Original Publication Citation

D.M. Campbell, Y. Ding, D.W. Embley, K. Hewett, D.L. Jackman, S.S. Jeffries, Y.S. Jiang, D.Lewis, S.W. Liddle, D.W. Lonsdale, Y.-K. Ng, A.L. Peacock, D.J. Seer, R.D. Smith, S.H. Yau,M. Xu, and L. Xu (1999). A Robust Web Data-Extraction Technique With High Recall and Precision; BYU CS Data Extraction Group Technical Report (11 pages).

BYU ScholarsArchive Citation

Lonsdale, Deryle W.; Campbell, D. M.; Ding, Yihong; Embley, David W.; Hewett, K.; Jackman, D. L.; Jeffries, S. S.; Jiang, Y. S.; Lewis, D.; Liddle, Stephen W.; Ng, Y. K.; Peacock, A. L.; Seer, D. J.; Smith, R. D.; Yau, S. H.; Xu, M.; and Xu, L., "Demonstration: A Robust Web Data-Extraction Technique With High Recall and Precision" (1999). Faculty Publications. 6816.
https://scholarsarchive.byu.edu/facpub/6816

Document Type

Report

Publication Date

1999

Publisher

Brigham Young University

Language

English

College

Humanities

University Standing at Time of Publication

Associate Professor

Copyright Use Information

https://lib.byu.edu/about/copyright/

BYU ScholarsArchive

Faculty Publications

Demonstration: A Robust Web Data-Extraction Technique With High Recall and Precision

Keywords

Abstract

Original Publication Citation

BYU ScholarsArchive Citation

Document Type

Publication Date

Publisher

Language

College

University Standing at Time of Publication

Copyright Use Information

Included in

Search

Browse

BYU Links

Author Corner

Hosted by the

BYU ScholarsArchive

Faculty Publications

Demonstration: A Robust Web Data-Extraction Technique With High Recall and Precision

Authors

Keywords

Abstract

Original Publication Citation

BYU ScholarsArchive Citation

Document Type

Publication Date

Publisher

Language

College

University Standing at Time of Publication

Copyright Use Information

Included in

Share

Search

Browse

BYU Links

Author Corner

Hosted by the