Author : Ramkrishna R. Gaikwad 1
Date of Publication :13th March 2018
Abstract: Nowadays the growth of World Wide Web has better a lot with more assumption. Large amount of text, multimedia files, images website documents were present in the web and it is still increasing in its forms. Education Search engine has become an important daily network application tool to search information. Data mining is the form of extracting data present in the internet. We propose an Education Search Engine in two-stage technique, namely Smart Crawler, for efficient gathering deep web interfaces. To achieve more accurate results for a focused crawl, Smart Crawler ranks websites links to prioritize highly relevant result in websites link rankings. In the second stage, Web Usages mining in web scraping is a method for extracting textual characters from screens so that they could be analyzed. Web scraping is the process of collecting information from the World Wide Web. The results showed that the smart crawler and scarper can realize the high-efficient and flexible data collection function, and laid the foundation for Web data mining. This efficiently retrieves web data mining interface from large-scale sites and achieves higher.
Reference :
-
- ZHENG Guojun2, JIA Wenchao1, SHI Jihui2, SHI Fan1, ZHU Hao2, LIU Jiang “Design and Application of Intelligent Dynamic Crawler for Web Data Mining,“2017 Ninth IEEE International Conference on e-Business Engineering.
- Syed Md. Galib, Ajay Shah, Md. Motiur Rahman, Maitri Debnath ” Clustered and Smarter Web mining using Semantic Web,” 2015 Ninth IEEE International Conference on eBusiness Engineering.
- Simona Bernardi, Ra´ul Pirac´es Alastuey, Raquel Trillo-Lado “Web Content Mining Techniques Tools & Algorithms – A Comprehensive Study” International Journal of Computer Trends and Technology (IJCTT) – volume 4 Issue 8–August 2013
- Jingtao Shang, Jianjun Lin, Van Qin, Bo Li, MengmengWu, “Design of Analysis System for Documents Based on Web Crawler” 2016 2nd IEEE International Conference on Computer and Communications
- Simona Bernardi, Ra´ul Pirac´es Alastuey, Raquel Trillo-Lado, “Using Process Mining and Model-driven Engineering to Enhance Security of Web Information Systems” 2017 IEEE European Symposium on Security and Privacy Workshops (EuroS&PW)
- Deepak Kumar Mahto, Lisha Singh, “A Dive into Web Scraper World” 2016 International Conference on Computing for Sustainable Global Development (INDIACom)
- Suvarn Sharma, Amit Bhagat, “Data Preprocessing Algorithm for Web Structure Mining” 2016 Fifth International Conference on Eco-Friendly Computing and Communication Systems (ICECCS-2016).
- Srinaganya.G., Dr.J.G.R.Sathiaseelan, “A Technical Study on Information Retrieval using Web Mining Techniques” IEEE Sponsored 2nd International Conference on Innovations in Information, Embedded and Communication systems (ICIIECS) 2015.