•  
  •  
 

Journal of Undergraduate Research

Keywords

attribute extraction, web page clustering, web people searches, person names

College

Physical and Mathematical Sciences

Department

Computer Science

Abstract

The disambiguation of person names in web people searches is a long standing problem within the semantic search community. A query such as the name “Henry Eyring” would produce thousands of results with references to more than one entity with that same name. In order to mitigate this problem, person-related attributes such as birth dates are extracted and used to group pages that refer to the same entity. The level of confidence that the pages are correctly grouped together is thus directly dependent upon the level of confidence that the person-related attributes were correctly extracted and properly associated to the correct entity.

Share

COinS