International Journal of Engineering Research in Computer Science and Engineering (IJERCSE)

Open Access Journal

ISSN : 2394-2320 (Online)

International Journal of Engineering Research in Computer Science and Engineering (IJERCSE)

Monthly Journal for Computer Science and Engineering

Open Access Journal

International Journal of Engineering Research in Computer Science and Engineering (IJERCSE)

Monthly Journal for Computer Science and Engineering

ISSN : 2394-2320 (Online)

Call For Paper : Vol 11, Issue 03, March 2024

Improve the Efficiency of Real World Entity Set Using Progressive Methods

Author : Dipalee A. More ¹ Prof. Nitin N. Patil ²

Date of Publication :17th August 2017

Abstract: Database contains very large data sets, where various duplicate records are present. The duplicate records occur when data entries are stored in a uniform manner in the database, resolving the structural heterogeneity problem. Maximum the gain of the overall process within time availability by reporting most results much earlier than traditional approaches. Detection of duplicate records is difficult to find and it takes more execution time. The authors described various techniques used to find duplicate records in the database but there are some issues in these techniques. To address this, Progressive Algorithms have been said, for that, which significantly increases the efficiency of finding duplicates, if the execution time is limited and improves the quality of records. The authors will combine base paper progressive approaches with scalable approaches for duplicate detection to deliver results even faster

Reference :

1. Rohit Ananthakrishna, Surajit Chaudhuri, and Venkatesh Ganti, “Eliminating fuzzy duplicates in data warehouses,” In Proceedings of the International Conference on Very Large Databases (VLDB), 2002
2. Rohan Baxter, Peter Christen, and Tim Churches. “A comparison of fast blocking methods for record linkage,” In SIGKDD Workshop on Data Cleaning, Record Linkage and Object Consolidation, 2003.
3. Mikhail Bilenko, Beena Kamath, and Raymond J. Mooney, “Adaptive blocking: Learning to scale up record linkage,” In Industrial Conference on Data Mining (ICDM), 2006.
4. Peter Christen, “Towards parameter-free blocking for scalable record linkage,” Technical Report TR-CS-07-03, The Australian National University, August 2007.
5. S. E. Whang, D. Marmaros, and H. GarciaMolina, “Pay-as-you-go entity resolution,”IEEE Trans. Knowl. Data Eng., vol. 25, no. 5, pp. 1111–1124, May 2012.
6. Ashwini V. Lake, Lithin K, “A study and survey on various progressive duplicate detection mechanisms,” in IJRET: International Journal of Research in Engineering and Technology, vol. 05 pp. 2319-1163, Mar. 2016.
7. Ahmed K. Elmagarmid, Panagiotis G. Ipeirotis, and Vassilios S. Verykios, “Duplicate record detection: A survey,” IEEE Transactions on Knowledge and Data Engineering (TKDE), 19, 2007.
8. Mauricio A. Hernandez and Salvatore J. Stolfo, “The merge/purge problem for large databases,” In Proceedings of the ACM International Conference on Management of Data (SIGMOD), 1995
9. Mauricio A. Hernandez and Salvatore J. Stolfo, “Real-world data is dirty: Data cleansing and the merge/purge problem,” Data Mining and Knowledge Discovery, 2(1), 1998.
10. Alvaro E. Monge and Charles Elkan, “An efficient domain-independent algorithm for detecting approximately duplicate database records, ” In Proceedings of the Workshop on Research Issues on Data Mining and Knowledge Discovery, 1997.
11. Sven Puhlmann, Melanie Weis, and Felix Naumann, “XML duplicate detection using sorted neighborhoods,” In Proceedings of the International Conference on Extending Database Technology (EDBT), 2006.

Recent Article

● REDUCING LEAKAGE POWER IN CMOS CIRCUITS

● Review on Glucose Biosensors

● Wireless Battery Charger: Architecture

● Review of Quantum Cryptography

● The Air Taxi: A Futuristic Travel System Nearing Reality

● A Review of Recent Research on Nanosensors

● A Research Paper on the Fundamentals of Plastic Welding

● Review Article on Internet of Things in Smart Cultivation

● An Article on Understanding Quantum Theory

● REDUCING LEAKAGE POWER IN CMOS CIRCUITS

● Review on Glucose Biosensors

● Wireless Battery Charger: Architecture

● Design of Location Based Tracking Device

● Review of Quantum Cryptography

● The Relationship between Law and Morals

● Case study of Crisis of Leadership in Fast Changing World

● Drishti - IoT based Blind Stick

● Batch Normalization and Its Optimization Techniques: Review

● Mitigating denial of service attacks in search engines using OLSR Method

● A Exposure Based Technique With Dwt To Enhance The Image

● Advanced communication Through Fleshredtacton

● Hybrid Model for Image Classification and Analysis of Best Enhancement Technique

● A Modernization Approach to Software Engineering

● Encountering Evidence of a Node with Proximity based Mobile Opportunistic Social Network

● Hydrogeomorphology and Watershed management studies of Kosigi Mandal, Kurnool District, Andhra Pradesh, India using Remote Sensing and GIS

● Instant Medical Insurance Claims Through Aadhaar Based Authentication

● Automatic Follow-up Actions for Medical Treatment

● Avoiding Intrusion and Privacy Protection for Cloudlet-based Medical Data Sharing

● Image Processing: A New Step to Security

● Simple & Secure Mechanism for establishing connection between D2D Communication in 5G Scenario

● Dr. House - Warehouse Manager

● An Improved Genetic Algorithm in C for Knapsack Problem

● An Investigation on Nearest Neighbor Search Techniques

● Preventing Obfuscated Malware via Differential Fault Analysis

● Simulation of Special Mathematical Functions

● Parallel Computing of Fractional Integral Operators

● Comparative Analysis of Heart Disease Dataset using KNN and Decision Tree Classification

● Enhanced Forensics Enabled Cloud through Secured Logging as a Service

● Reversible Data hiding by Reversible image Transformation Algorithm for Encrypted Images

● Dictionary Learning Arrangement for Multi-Label Image Annotation

● â€œAn automated brain tumour detection and severity analysis using ANNâ€

● â€œMalicious Misbehavior Activity Detection Using Probabilistic Threat Propagation in Network Securityâ€

● Improve Performance of Crawler Using K-means Clustering

● Health Care in Smart Cities: A Survey based on IoT data analytics

● A Multi-Authority Access Control System in Cloud Computing Using Network Security

● DWT & SVD Based Watermarking Scheme for Copyright Protection In Medical Images

● Meet-O-Mania (Meeting Scheduler)

● Color and Texture Based Image Retrieval Based on Quadtree Segmentation Technique

● â€œSecurity privacy preserving for content leaksâ€

● â€œImplementation of file level and block level deduplication and detecting attacks in cloud environment.â€

● Reversible Data Hiding using Context free Reversible Grammar

● Hiding Sensitive Association Rule Constructed from Table

● A Report on Business Intelligence The Role of Data Analysis and Data Mining in Contemporary Organizations and The Ethical Implications of Collecting, Storing and Using Data

● Hemorrhages Detection in Retinal Color Fundus Image

● A Survey on interactive multi-label segmentation using cellular automata

● Multiclass Sentiment Classification On Product Reviews

● An Intuitive Architecture for Next Generation Digital Personal Assistants

● â€œHybrid technique of Image Encryption to Enhance Securityâ€

● A Novel Approach For Image Security

● Theoretical Channel allocation for SDRs in Smart Grid neighbourhood area network

● Virtualization Concept and Live Virtual Machine Migration

● Searching Comparatively Better Result From Agglomerative algorithm

● Improve the Efficiency of Real World Entity Set Using Progressive Methods

● Exploring Big Data Analytics for Satellite Imagery Data Using Hadoop Technique

● Cluster Based Data Centric Trust Management in VANET

● Fault Tolerance Approach in Distributed Sensor Networks using Genetic Algorithm

● Hybrid Crypto-System Approach use in Secure Intrusion Detection System for MANET

● Automatic Image Annotation-A Proposed Method

● Automated Tormenting Recognition in light of Semantic-Enhanced Marginalized Stacked Denoisng Auto-Encoder

● Social Media Mining for Price Prediction of Stock Market Using Map Reduce Framework

● Detection of Red Lesions For Diabetic Retinopathy In Telemedicine Context