Author : Gurdas Singh 1
Date of Publication :11th April 2018
Abstract: Currently, Search engines only provide URL links for search queries. Crawling strategy adopted by most search engines only search on HTML keywords and index the pages, but semantic web retains most rich information in RDF files and crawlers don’t index the RDF. In this work, we deal with the problem and design a smart crawler which can retrieve semantic information for keyword queries. In addition to retrieving the information, the proposed solution also focus on ranking the semantic information. Ordinarily, a covetous framework is used to pick the terms that enlarge the new returns each cost unit. We comprehended that not each record is square with while selecting the request to cover them. Broad reports can be secured by various requests, paying little respect to how the inquiries are picked. In like manner, the criticalness of a record is then again with respect to its size. Our further examination of this issue finds that the noteworthiness of the record depends not simply on the amount of the terms it contains, furthermore the sizes of those terms.
Reference :
-
- Kevin Chen-Chuan Chang, Bin He, and Zhen Zhang. ―Toward large scale integration: Building a metaquerier over databases on the web,‖ In CIDR, 2005, pages 44–55.
- Denis Shestakov. ―Databases on the web: national web domain survey,‖ Proc. of the 15th Symposium on International Database Engineering & Applications, ACM 2011, pages179–184.
- Denis Shestakov and Tapio Salakoski. ―Host-IP clustering technique for deep web characterization,‖ Proc. of the 12th International Asia-Pacific Web Conference (APWEB), IEEE 2010, pages 378–380.
- Denis Shestakov and Tapio Salakoski. ―On estimating the scale of national deep web,‖ Database and Expert Systems Applications, Springer 2007, pages 780–789.
- Shestakov Denis. ―On building a search interface discovery system,‖ Proc. of the 2nd international conference on Resource discovery, Lyon France, 2010 Springer, pages 81–93.
- Luciano Barbosa and Juliana Freire. ―Searching for hidden-web databases,‖ WebDB, 2005, pages 1–6.
- Luciano Barbosa and Juliana Freire. ―An adaptive crawler for locating hidden-web entry points,‖ Proc. of the 16th international conference on World Wide Web, ACM 2007, pages 441–450.
- Soumen Chakrabarti, Martin Van den Berg, and Byron Dom. ―Focused crawling: a new approach to topic-specific web resource discovery,‖ Computer Networks, 31(11): 1999, 1623–1640