|
For very high dimensional data, subspace clustering algorithms attempt to find subspaces underlying the data and then find clusters in each such subspace. The final clustering is obtained by combining the clusters in each subspace. Basically, the hope is that individual representations in each subspace will remove the overall clutter, and lead to clearer view of the data which will be easily clusterable. What are some of the best known algorithms that have shown empirically good results? |
|
There's this survey paper from 2004. Does it answer your questions? Yeah, but it's a bit dated so I was curious to know if there have been other more recent approaches..
(Jul 15 '10 at 21:33)
spinxl39
|
Do mind describing a bit more how these subspaces are chosen?
It is usually based on some localized search for relevant features dimensions. More details can be found in the paper Alexandre has listed below.