•  
  •  
 

Journal of Undergraduate Research

Keywords

web documents, word similarity, fuzzy set information retrieval model, IR

College

Physical and Mathematical Sciences

Department

Computer Science

Abstract

In the 2007 MEG proposal, I specified the following Internet problems to be solved using a Fuzzy set information retrieval (IR) model: (i) detecting plagiarism, which is the act of using another’s words or ideas as one’s own, (ii) filtering junk emails, which are undesirable because junk emails waste valuable resources and time and include offensive content in addition to the monetary cost that reaches billions of dollars per year, the bill that is paid by public users, and (iii) identifying spam Web pages that include contents that are useless to the Web users. We have successfully found the solution to each of the proposed problems to be solved, and our claim is supported by the articles published in the academic journals and conference proceedings (see Section 4 for the list of published work). In addition, Maria Soledad (Sole) Pera, one of my former M.S. students and my current Ph.D. student, has been actively involved in this MEG project and assisting me throughout the past 2 1⁄2 years in this research work and its publications. Sole has gained valuable experience working on the various research problems involved in this mentoring project, successfully solved the problems, and published a number of articles that are the results of the research work conducted in this funded project.

Share

COinS