This document discusses techniques for analyzing web server logs to better understand user behavior and improve website design and content. It describes clustering web logs to group similar users and page content, using association rule mining to discover relationships between pages users visit, and identifying sequential patterns in user visits to predict future behavior. The goal is to gain insights that allow websites to evolve and optimize their presence online through adaptive changes.
This document discusses techniques for analyzing web server logs to better understand user behavior and improve website design and content. It describes clustering web logs to group similar users and page content, using association rule mining to discover relationships between pages users visit, and identifying sequential patterns in user visits to predict future behavior. The goal is to gain insights that allow websites to evolve and optimize their presence online through adaptive changes.
This document discusses techniques for analyzing web server logs to better understand user behavior and improve website design and content. It describes clustering web logs to group similar users and page content, using association rule mining to discover relationships between pages users visit, and identifying sequential patterns in user visits to predict future behavior. The goal is to gain insights that allow websites to evolve and optimize their presence online through adaptive changes.
Adaptive Web Presence and Evolution through Web Log Analysis
traffic. It would be beneficial if the website can FUTURE TRENDS
cache these small size documents in memory. For A e-commerce websites, the highly populated items With the explosive growth of the Internet and ever should be arranged to allow fast access because increasing popularity of e-commerce, privacy is be- these items will compose over 50% of the total coming a sensitive topic that attracts many research requests. These insights are helpful for the better efforts. How to make sure the identity of an individual design and adaptive evolution of the web sites. is not compromised while effective web log analysis 2. Clustering and Classification – Techniques to can be conducted is a big challenge. An initiative called group a set of items with similar characteristics Platform for Privacy Preference (P3P) is ongoing at the and/or to map them into predefined classes. World Wide Web Consortium (W3C). How to analyze In the web log analysis domain, there are two the web log online and make timely decision to update major clusters of interest to discover: web usage and evolve the web sites is another promising topic. clustering and web pages clustering. Clustering of web usage can establish the groups of users that exhibit similar browsing behaviors and infer CONCLUSION user demographic information. Such knowledge is especially useful for marketing campaign in An effective web presence is crucial to enhance the e-commerce applications and personalized web image of a company, increase the brand and product presence. On the other hand, clustering analysis awareness, provide customer services, and gather of the web pages can discover the web pages with information. The better understanding of the web’s related content. This is useful for the development topology and user access patterns, along with model- of Internet search engine. Classification can be ing and designing efforts, can help to develop search accomplished through well developed data mining engines and strategies to evolve the web sites. Web logs algorithms including Bayesian classifier, k-nearest contain potentially useful information for the study of neighbor classifier, support vector machines, and the effectiveness of web presence. The components so on (Duda et al., 2000). of web log analysis are described in this chapter. The 3. Associative Rules – Associative rules mining approaches and challenges of acquisition and prepro- is to find interesting associations or correlations cessing of web logs are presented. Pattern discovery among large data sets. In the web log mining techniques including statistical analysis, clustering and domain, one is interested in discovering the classification, associative rules and sequential pattern implications or correlations of user access pat- are discussed in the context of web log analysis towards terns. For example, users who access page A also adaptive web presence and evolution. visit page B; customers who purchase product C also purchase product D. A typical associative rule application is market basket analysis. This REFERENCES knowledge is useful for effective web presence and evolution by laying out user friendly hyper Adamic, L.A. and Huberman, B.A. (1999). The Nature links for easier access. It can help for e-commerce of Markets in the World Wide Web. Computing in web site to promote products as well. Economics and Finance, no. 521. 4. Sequential Patterns – The sequential patterns mining attempts to find inter-transaction patterns Albert, R., Jeong, H. and Barabási, A.L. (1999). The such that the presence of one event is followed Diameter of the World Wide Web. Nature, 401:130- by another (Mannila et al., 1995, Srikant and 130. Agrawal, 1996). In the context of web log analy- Asllani A. and Lari A. (2007). Using genetic algo- sis, the discovery of sequential patterns helps to rithm for dynamic and multiple criteria web-site predict user visit patterns and to target certain optimizations. European Journal of Operational Re- groups based on these patterns. search, 176(3): 1767-1777.
Content 1) Introduction 2) Brief Review of The Work Done in The Related Field 3) ) Noteworthy Contributions 4) Proposed Methodology 5) Expected Outcome 6) References