节点文献
基于模糊相关的Web文档分类方法
A Classification Approach Based on Fuzzy Related Technology for Web Document
【机构】 海南大学信息科学技术学院;
【摘要】 <正>1引言随着Internet应用的不断普及和发展,WWW已成为一个巨大的分布式信息空间。然而,对于Internet用户所需的有趣的和有用的信息,经常被淹没在浩瀚的信息海洋中不易发现,这种现象被称为信息"过载"。人们迫切需要能够从WWW上快
【Abstract】 Due to the explosive growth of available information on the WWW,it is not uncommon that the users on WWW often find themselves overwhelmed with the large amount of information that might be of their interest and usefulness.To alleviate this problem,there is a need for an intelligent tool to help the users screening and filtering for interesting and useful information.Web documents tend to have unpredictable characteristics,i.e.differences in length,quality and authorship.Motivated by these fuzzy characteristics,the fuzzy related technology in classifying Web documents into a predefined set of categories is adopted in this paper.The experimental results show that our approach yields higher classification accuracy compared to the vector space model.
- 【会议录名称】 第二十一届中国数据库学术会议论文集(技术报告篇)
- 【会议名称】第二十一届中国数据库学术会议
- 【会议时间】2004-10-14
- 【会议地点】中国福建厦门
- 【分类号】TP391.1
- 【主办单位】中国计算机学会数据库专业委员会