Journal of Undergraduate Research
Keywords
attribute extraction, web page clustering, web people searches, person names
College
Physical and Mathematical Sciences
Department
Computer Science
Abstract
The disambiguation of person names in web people searches is a long standing problem within the semantic search community. A query such as the name “Henry Eyring” would produce thousands of results with references to more than one entity with that same name. In order to mitigate this problem, person-related attributes such as birth dates are extracted and used to group pages that refer to the same entity. The level of confidence that the pages are correctly grouped together is thus directly dependent upon the level of confidence that the person-related attributes were correctly extracted and properly associated to the correct entity.
Recommended Citation
Park, Joseph and Embley, Dr. David
(2014)
"Attribute Extraction for Web Page Clustering in Web People Searches,"
Journal of Undergraduate Research: Vol. 2014:
Iss.
1, Article 1173.
Available at:
https://scholarsarchive.byu.edu/jur/vol2014/iss1/1173