节点文献
基于用户兴趣分析的网页生命周期建模
Modeling Lifetime of Web Pages Based on User Interest Analysis
【摘要】 网页在其生命周期内的活跃程度会随时间发生变化。有的网页只在特定的阶段有价值,此后就会过时。从用户的角度对网页的生命周期进行分析可以提高网络爬虫和搜索引擎的性能,改善网络广告的效果。利用一台代理服务器收集的网页访问量信息,我们对网页的生命周期进行了研究,给出了用户兴趣演变的模型。这个模型有助于更好地理解网络的组织与运行机理。
【Abstract】 The activeness of a web page varies during its lifetime.Some pages are valuable only in a specific period,and then become obsolescent.Web page lifetime analysis from users’ perspective is important to enhance the performance of web crawlers and search engines,and to improve the efficiency of web advertising.With page view data collected by a proxy server,we were able to perform large scale analysis in web page lifetime.A model is given to describe user interest evolution based on an experiment conducted with the page view data of more than 36000000 web pages for two months.The model is the foundation to better understand how the web is organized and operates.
【Key words】 computer application; Chinese information processing; user behavior analysis; web page lifetime; web log mining;
- 【文献出处】 中文信息学报 ,Journal of Chinese Information Processing , 编辑部邮箱 ,2008年02期
- 【分类号】TP182;TP393.092
- 【被引频次】8
- 【下载频次】335