International Journal of Engineering Research in Computer Science and Engineering (IJERCSE)

Monthly Journal for Science , Computer Science and Engineering

Volume5,March 2018,

Topic : Enhanced Indexing and Scraping for Educational Search Engine using Web Usage Mining

Authors:Ramkrishna R. Gaikwad || Mansi Bhonsle

Abstract:Nowadays the growth of World Wide Web has better a lot with more assumption. Large amount of text, multimedia files, images website documents were present in the web and it is still increasing in its forms. Education Search engine has become an important daily network application tool to search information. Data mining is the form of extracting data present in the internet. We propose an Education Search Engine in two-stage technique, namely Smart Crawler, for efficient gathering deep web interfaces. To achieve more accurate results for a focused crawl, Smart Crawler ranks websites links to prioritize highly relevant result in websites link rankings. In the second stage, Web Usages mining in web scraping is a method for extracting textual characters from screens so that they could be analyzed. Web scraping is the process of collecting information from the World Wide Web. The results showed that the smart crawler and scarper can realize the high-efficient and flexible data collection function, and laid the foundation for Web data mining. This efficiently retrieves web data mining interface from large-scale sites and achieves higher..

Keywords: Information extraction, web crawler, web usage mining, web scraping

Download Paper


DOI: 01.1617/vol5/iss3/pid07425


Related Articles

Wormhole Attack Prevention and Detection Approaches in Mobile Ad hoc Networks: A Survey.

Authors: Avinash Singh || Ram Singar Verma

Doi : 01.1617/vol4/iss03/pid34591

Volume4 ,March 2018.

Developing a GIS Platform for Tourism marketing and promotion In Nigeria: Case Study Of Bauchi State

Authors: Abubakar Siddiq Ango || Abdulsalam Waheed Abdulfatah, Idrissa Djibo

Doi : 01.1617/vol3/iss09/pid91783

Volume3 ,September 2018.

Decentralized Trust Management and Trust Worthiness of Cloud Environments

Authors: Qhubaib Syed || Syed Afzal Ahmed,Syed Abdul Haq

Doi : 01.1617/vol4/iss4/pid21386

Volume4 ,April 2018.

Brain Controlled Chess Based on Virtual Reality Control for Paralyzed Patients

Authors: Sindhura Rao || Ashwini S,Kusuma Mohanchandra

Doi : 01.1617/vol4/iss4/pid82074

Volume4 ,April 2018.

Multilevel Authentication System

Authors: Nichita Silva Lobo || Beverly Rodrigues,Pavni Alluri,Prathibha Singh,Nicole Alvares

Doi : 01.1617/vol4/iss4/pid82135

Volume4 ,April 2018.

Stock Prediction Using Clustering And Regression Techniques

Authors: Shalini Lotlikar || Megha Ainapurkar

Doi : 01.1617/vol4/iss4/pid39708

Volume4 ,April 2018.

Brain Tumor Detection using Image Segmentation

Authors: Siddhi N. Nerurkar ||

Doi : 01.1617/vol4/iss4/pid02845

Volume4 ,April 2018.

Video Mining using Query by Example

Authors: Rosebud Valadares ||

Doi : 01.1617/vol4/iss4/pid89542

Volume4 ,April 2018.

Fuzzy Opinion Mining for Product Recommender System

Authors: Aarti Bandodker ||

Doi : 01.1617/vol4/iss4/pid03652

Volume4 ,April 2018.

Aadhaar and Server based Electoral system

Authors: Yugansh Garg || Sakshi Mishra

Doi : 01.1617/vol4/iss4/pid69053

Volume4 ,April 2018.

.

Editor-in-Chief

Editor Image


Dr. Allon Guez
Professor, Drexel University,
USA


View more


IMPACT FACTOR: 4.890

ISSN(Online):2394-2320

Google Scholar Profile

Thomson Reuters ID : q-6288-2016.
ORCiD Research ID : 0000-0001-9540-6799

All Issues


ACCEPTANCE RATIO

ACCEPTANCE RATIO: 28.69%
ARTICLES PUBLISHED:0521
PAPER RECEIVED:01730
Journal Code : IJERCSE
Electronic ISSN : 2394-2320
Impact Factor : 4.890
Frequency : monthly
Contact : info@ijercse.com


IFERP OTHER JOURNALS


Subscribe

           Email:

SOCIAL MEDIA