International Journal of Engineering Research in Computer Science and Engineering (IJERCSE)

Web Document Clustering Algorithm and Similarity Measure

Author : Ms.S.M.Durge ¹ Mr.Y.M.Kurwade ² Dr.V.M.Thakare ³

Date of Publication :20th April 2017

Abstract: The Clustering is an unsupervised method to divide data into disjoint subsets with high intra-cluster similarity and low inter-cluster similarity. Most of the approaches perform web documents clustering, i.e., they assign each object to precisely one of a set of clusters. Objects in one cluster are similar to each other. The similarity between objects is based on a measure of the distance between them.This works well when clustering the compact and well-separated groups of data, but in many situations, clusters are different at rerun. This proposed method usek-means++ algorithm,is capable of identifying problem by spreading the initial centers evenly and improves performance

Reference :

Will Updated soon

Monthly Journal for Computer Science and Engineering

Monthly Journal for Computer Science and Engineering

Call for Paper

Indexing

Recent Article