The challenges of Clustering techniques for High Dimensional Data
Labels: Computers , Data Mining , Data Warehousing , Free Download , Paper Presentations , 0 comments
Abstract: Clustering analysis divides data into groups (clusters) for the purposes of summarization or improved understanding. For example,cluster analysis has been used to group related documents for browsing, to find genes and proteins that have similar functionality,or as a means of data compression. While clustering has a long history and a large number of clustering techniques have been developed in statistics, pattern recognition, data mining,and other fields, significant challenges still remain.In this paper provide a short introduction to cluster analysis, and then focus on the challenge of clustering high dimensional data . Cluster analysis is a challenging task and there are a number of well-known issues associated with it , e.g., finding clusters in data where there are clusters of different shapes ,sizes and density or where the data has lots of noise and outliers. These issues become more important in the context of high dimensionality data sets. Clustering depends critically on density and distance (similarity) ,but these concepts become increasingly more difficult to define as dimensionality increases.
![](https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi3RhUTWe-qKfRqbo8D2KCVV1iAOWUtm4nmqfoSJHauy0xVJwAZ67ApQl-o2RH0bixPCQtZCGmrGzBoXCWPfb9yZPGcknwmmL-tDj9uJzOICYaERWyl5xCqk7n-hcG9-XpMQteU8JYt3dr1/s400/download_button.png)
0 Responses to "The challenges of Clustering techniques for High Dimensional Data"
Post a Comment