Author : Dr. S.Vijayarani 1
Date of Publication :16th February 2018
Abstract: The World Wide Web has a vast amount of information resources and services. Every website is comprised of a number of web pages. Whenever, a user access the websites, the server saved this information in web log files which is a plain text (.txt) file. Web log files contain unnecessary and noisy data. It can be preprocessed using web mining techniques. Data preprocessing is the process of selecting standardized data from the original log files. Data cleaning, user identification, session identification and path completion are different stages of data preprocessing. Log files contain the information about the users like user name, visiting path, the path traversed, time stamp, page last visited, success rate, user agent and URL. The log files are stored in different locations like web server, web proxy server and the client browser. This paper has provided a detailed review of web log files; i.e. concepts of web server data, application server data, application level data, web server logs, log file parameter, types of log file format, various locations of web log files and the different types of web log files. In addition to this, we also surveyed the existing research works and given the information about how web log files are used in web usage mining research.
Reference :
-
- Roop Ranjan, Sameena Naaz and Neeraj Kaushik, “Web Miner: A Tool for Discovery of Usage Patterns From Web Data”, International Journal on Computer Science and Engineering (IJCSE), Vol. 5 No. 05 May 2013, pp. 286-293, ISSN: 0975-3397.
- The W3C Technology Stack; “World Wide Web Consortium”, Retrieved April 21, 2012.
- Arvind K Sharma, P. C. Gupta,“Enhancing the Performance of the Website through Web Log Analysis and Improvement”, International Journal of Computer Science and Technology (IJCST) Vol. 3, Issue 4, Oct-Dec 2012.
- Huiping Peng, “Discovery of Interesting Association Rules Based on Web Usage Mining”, International Conference 2010.
- Cooley, R.,“Web Usage Mining: Discovery and Application of Interesting Patterns from Web data”, 2000.
- Liu, H., et al., “Combined mining of Web server logs and web contents for classifying user navigation patterns and predicting user‟s future requests”, Data and Knowledge Engineering, 2007, Vol. 61, Issue 2, pp. 304- 330.
- M. Spiliopoulou and L. C. Faulstich. Wum, “A web utilization miner”. In Proc. of EDBT WorkshopWebDB98, Valencia, Spain, March 1998.
- M. Malarvizhi, S. A. Sahaaya Arul Mary, “Preprocessing of Educational Institution Web Log Data for Finding Frequent Patterns using Weighted Association Rule Mining Technique”, European Journal of Scientific Research ISSN 1450-216X Vol.74 No.4 (2012), pp. 617- 633.
- Sanjay Madria, Sourav s Bhowmick, w. -k ng, e. P. Lim, “Research Issues in Web Data Mining”.
- A. Jebaraj Ratnakumar, “An Implementation of Web Personalization Using Web Mining Techniques”, Journal Of Theoretical And Applied Information Technology, 2005 - 2010 JATIT
- Tsuyoshi, M and Saito, K., “Extracting User‟s Interest for Web Log Data”, IEEE 2006, pp. 343-346, ISBN: 0-7695-2747-7