Free and Latest article publishing for websites and ezines!


Web Usage Mining Based Logs

With the rapid development of the Internet applications, there is a sharply increased demand on information services via the Internet. Too many choices and information people faced can not be digested. Such phenomenon is known as information overload and information lost. How to use effective methods to find the useful information and make it valid is a challenge. Then, the research on Web mining is brought forward and is becoming a hot spot.Data mining based on Web Log is a main aspect of Web mining. How to make the users find the information they are interested in more quickly and expediently is the aim of every Web site. If the site's capability is improved, it will attract more users to visit it. Whether the site can provide the individuation service is an important factor to estimate it. Through data mining on Web log, we can find the user's traversal patterns. It will help us to improve the site's structure and provide the better service to the users.This paper researches how to mine the users' usage patterns based on Web log and researches collaborated recommendation based enquiry log of search engine. The main works are following:1. The Web usage mining was studied completely, including data collection, data preparation, pattern discovery, pattern analysis and applications.2. The thesis presents the basic idea and process of the algorithm of the hard K-means clustering and the fuzzy K-means clustering. It studies the fuzzy K-means clustering parameters, and detailed discusses the clustering question of center initialization. Meanwhile we presents a improved validity function that can be used effectively to find the optimal number of centers.3. The paper proposes an improved Web user and URL clustering method, the algorithm effectively integrated visit time and the number of hits. Experiment by the real server log confirmed the effectiveness of the algorithm.4. This dissertation researches topic-attentive ranking algorithm which used in search engine recommendation. Use query log to analysis keyword clustering and presents an improved similarity function and certificates it by artificial data.Finally, this dissertation summarized the author's works and discussed the future.

Recommended Articles from the Networks Category:

Most Viewed Articles in the Networks Category:

  1. Design and Realization of Task Scheduling Algorithm in Grid Environment
  2. Research on Trust Model in P2P Based on Improved Chord Protocol
  3. Design and Implement of VPN with Dynamic Password
  4. The Research of Task Scheduling in Computational Grid Based on DCG3A
  5. Research on Scheduling Disciplines with Self-Similar Traffic Input
  6. Research on Extension of Network Management Functions and System Realization
  7. Research of Incentive Model in P2P Network
  8. Research on Grid Resource Scheduling Model with Three-level and Algorithm
  9. Research on the Replica Selection Strategies in Spatial Information Grid
  10. Research on IP Multicast Access to SUPANET Multicast Management


© 2004-2009 Information-Technology-Articles.com - All Rights Reserved Worldwide.