An Updated Clustering Based Model to Perform Document Clustering
The Clustering is applied to the group of user’s set into k clusters by applying k-means algorithm and
Association mining is used for generating the rules. A cluster is a collection of data objects that are similar to one another
within the same cluster and dissimilar to the objects in other clusters. A cluster of data objects can be treated collectively as
one group in many applications. Document clustering is an extension of traditional clustering. This paper presents an
introduction to document clustering. It also presents an updated document clustering technique. Document clustering helps
in search optimization. Due to this quality, it helps up to a great extent in many real world applications like search engines.
Index Term- Data Mining, K-Mean, Clustering, Simhash Technique.