International Journal of Engineering Research in Computer Science and Engineering (IJERCSE)

Open Access Journal

ISSN : 2394-2320 (Online)

International Journal of Engineering Research in Computer Science and Engineering (IJERCSE)

Monthly Journal for Computer Science and Engineering

Open Access Journal

International Journal of Engineering Research in Computer Science and Engineering (IJERCSE)

Monthly Journal for Computer Science and Engineering

ISSN : 2394-2320 (Online)

Call For Paper : Vol 11, Issue 03, March 2024

An Efficient Sentence Level Clustering using Hierarchical and Frequent Pattern Mining

Author : Dr.P.Kalyani ¹ N.Saranya ²

Date of Publication :7th February 2017

Abstract: Clustering is the process of assemble or aggregating of data items. Sentence clustering mainly used in types of applications such as classify and categorization of documents, automatic summary generation, organizing the documents, etc. In text processing, sentence clustering plays a vital role this is used in text mining activities. Size of the clusters may change from one cluster to another. The traditional clustering algorithms have some problems in clustering the input dataset. The problems such as, instability of clusters, complexity and sensitivity. To overcome the drawbacks of these clustering algorithms, this paper proposes a hierarchical hybrid frequent pattern mining algorithm and Hierarchical Fuzzy Relational Eigenvector Centrality based Clustering Algorithm (HFRECCA) which is used for clustering the sentences. Contents present in text documents contain hierarchical structure and there are many terms present in the documents which are related to more than one theme hence HFRECCA will be useful algorithm for natural language documents. Frequent pattern mining algorithm is an influential algorithm for mining frequent item sets for boolean association rules. It uses a "bottom up" approach, where frequent subsets are extended one item at a time (a step known as candidate generation, and groups of candidates are tested against the data).

Reference :

1. D.R. Radev, H. Jing, M. Stys, and D. Tam, “Centroid-Based Summarization of Multiple Documents,” Information Processing and Management: An Int’l J., vol. 40, pp. 919-938, 2004.
2. B. Frakes and R. Baeza-Yates, Information Retrieval: Data Structures and Algorithms. Prentice Hall, 1992.
3. R. Nock and F. Nielsen, “On Weighting Clustering,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 28, no. 8, pp. 1223- 1235, Aug. 2006
4. C.D. Manning, P. Raghavan, and H. Schu¨ tze, Introduction to Information Retrieval. Cambridge Univ. Press, 2008
5. Y. Chen, E.K. Garcia, M.R. Gupta, A. Rahimi, and L. Cazzanti, “Similarity-Based Classification: Concepts and Algorithms,” J. Machine Learning Research, vol. 10, pp. 747-776, 2009.
6. A. Rosenberg and J. Hirschberg, “V-Measure: A Conditional Entropy-Based External Cluster Evaluation Measure,” Proc Conf. Empirical Methods in Natural Language Processing (EMNLP ’07), pp. 410-420, 2007.
7. P. Corsini, F. Lazzerini, and F. Marcelloni, “A New Fuzzy Relational Clustering Algorithm Based on the Fuzzy C-Means Algorithm,” Soft Computing, vol. 9, pp. 439-447, 2005.
8. A. Budanitsky and G. Hirst, “Evaluating WordNetBased Measures of Lexical Semantic Relatedness,” Computational Linguistics, vol. 32, no. 1, pp. 13-47, 2006.
9. S. Shehata, F. Karray, and M. Kamel, “Enhancing Text Clustering Using Concept-Based Mining Model,” Proc. Sixth IEEE Int’l Conf. Data Mining (ICDM), 2006.
10. T. Hisamitsu and Y. Niwa, “A Measure of Term Representativeness based on the Number of CoOccurring Salient Words,” Proc. 19th Int’l Conf. Computational Linguistics (COLING ’02), vol. 1, pp. 1- 7, 2002
11. Adway Mitra; Soma Biswas; Chiranjib Bhattacharyya"Bayesian Modeling of Temporal Coherence in Videos for Entity Discovery and Summarization"in IEEE Transactions on Pattern Analysis and Machine Intelligence ,Year: 2016, Volume: PP, Issue: 99

Recent Article

● Opportunities, Challenges and Applications of Cancer Nanotechnology

● Analysis of Distortion of Steel in Heat Treatment

● Review on Internet of Things and Its Applications

● A System for Automatic Cloth Retrieving

● A Review Paper on Humanoid Robots

● Tele-Command System Using SOS

● Tele-Command System Using SOS

● ZigBee Based Smart Metering System

● Intelligent Car System for Accident Prevention

● GPS and GSM Based Smart Helmet

● The Modbus Protocol Based Wireless Sensor Networks

● A Novel Approach for Identification of Hadoop Cloud environment

● Productive and Privacy-Aware Data Aggregation in Mobile Sensing

● Empowering open check for secure conveyed information in the cloud

● An Implementation of Algorithm based on Back Pressure using Shadow in Wireless Ad Hoc Network

● A New Approach for Secure Traffic Data Analytics using Hadoop

● Detection of Cross Site Scripting Attack and Malicious Obfuscated Javascript Code

● Online Food Orderring System

● Constant-size cipher text in Cloud computingâ€

● Brain Wave Monitoring By Using Open Source Tools

● Educational Data Mining for Classification of Students according to their Performance

● Enhancement of Biometric Cryptosystem Using Fuzzy Vault

● Intelligent Store Room ForIndustries Using E – Kanban System

● Smart Glove as a Mouse

● “Android Application for Automatic Irrigation System with Fertilizer Recommendation”

● Customer Relationship Management

● Automated Attendance and Messaging System using Internet

● Optimizing Energy Consumption & Resource Usage while Executing the Analytics Applications in Data Centers

● Automated Emergency System in Ambulance to Control Traffic Signals using IoT

● IOT Based Distribution of Digital Information Using Multimedia Display

● Motion Detection Surveillance System Using Raspberry PI

● Automation for Flow Sensing in Ro plant and Database Management

● Security of Data Stored in Cloud by Regenerating Code Method

● A Survey on Recent Technologies in Computer Science

● Security and Privacy Challenges in Online Social Networks

● Visual Yield Estimation of Vineyard Production: Automated Crop Analysis Approach

● Comparative Study of PCA and LDA Algorithms for Automated Attendance System Using Face Recognition

● Sentiment Analysis and its Challenges

● Security of Critical Data In Database – An Overview

● A Survey Paper on Indoor Person Tracking Using Wireless Technology

● An Efficient Sentence Level Clustering using Hierarchical and Frequent Pattern Mining

● Security Issues, Research and Challenges in Cloud Computing

● An Efficient Sentence Level Clustering using Hierarchical and Frequent Pattern Mining

● “Attribute-based Access Control with Constant-size Cipher text in Cloud Computing”

● A Survey on Sentiment Analysis and Opinion Mining

● Survey of Associative Classification Techniques for Text Mining

● Power Loss Reduction by Line Reconfiguration for Efficient Operation of Power System

● Intrusion Detection System for MANET using Soft-Computing

● Garbage Monitoring System Using IOT

● A Survey on Web Application: Vulnerabilities, Attack and Detection Techniques

● Design of Automatic Text Summarization Approach for Hindi Text Document Using Semantic Graph and Particle Swarm Optimization

● HDMN: Hybrid Decentralized Military Network

● Big Data Clustering Algorithms - A Survey

● “Arduino Based Hardness Testing Machine”

● A Survey of Recent Approaches for Detection of Alzheimer’s Disease