IDENTIFYING TOPICS FOR WEB DOCUMENTS THROUGH FUZZY ASSOCIATION LEARNING

CHOOCHART Haruechaiyasak; MEI-LING Shyu; SHU-CHING Chen

doi:10.1142/S1469026802000609

Back

IDENTIFYING TOPICS FOR WEB DOCUMENTS THROUGH FUZZY ASSOCIATION LEARNING

Journal article

Peer reviewed

IDENTIFYING TOPICS FOR WEB DOCUMENTS THROUGH FUZZY ASSOCIATION LEARNING

CHOOCHART Haruechaiyasak, MEI-LING Shyu and SHU-CHING Chen

International journal of computational intelligence and applications, Vol.2(3), pp.277-285

2002-09

DOI: https://doi.org/10.1142/S1469026802000609

Abstract

Due to the explosive growth of available information on the World Wide Web (WWW), users have suffered from the information overload. To alleviate this problem, there is a need for an intelligent tool to help the users screening and filtering for interesting and useful information. In this paper, a method of automatically identifying topics for Web documents via a classification technique is proposed. Topic identification can be applied as a filtering tool for recommender systems to prune down the number of documents to within some particular topics. We adopt the fuzzy association concept as a machine learning technique to classify the documents into some predefined categories or topics. Our approach is compared to the vector space model with the cosine coefficient using the data sets collected from three different Web portals: Yahoo!, Open Directory Project and Excite. The results show that our approach yields higher classification accuracy compared to the vector space model.

Metrics

5 Record Views

Details

Title: IDENTIFYING TOPICS FOR WEB DOCUMENTS THROUGH FUZZY ASSOCIATION LEARNING
Creators: CHOOCHART Haruechaiyasak - University of Miami
MEI-LING Shyu - University of Miami
SHU-CHING Chen - Florida International University
Publication Details: International journal of computational intelligence and applications, Vol.2(3), pp.277-285
Academic Unit: CoE - Electrical & Computer Engineering; College of Engineering
Language: English
Resource Type: Journal article
Record Identifier: 991031725883302976