For very high dimensional data, subspace clustering algorithms attempt to find subspaces underlying the data and then find clusters in each such subspace. The final clustering is obtained by combining the clusters in each subspace. Basically, the hope is that individual representations in each subspace will remove the overall clutter, and lead to clearer view of the data which will be easily clusterable.

What are some of the best known algorithms that have shown empirically good results?

asked Jul 15 '10 at 21:24

spinxl39's gravatar image

spinxl39
3698114869

Do mind describing a bit more how these subspaces are chosen?

(Jul 15 '10 at 21:26) Joseph Turian ♦♦

It is usually based on some localized search for relevant features dimensions. More details can be found in the paper Alexandre has listed below.

(Jul 15 '10 at 21:34) spinxl39

One Answer:

There's this survey paper from 2004. Does it answer your questions?

answered Jul 15 '10 at 21:27

Alexandre%20Passos's gravatar image

Alexandre Passos ♦
2554154278421

Yeah, but it's a bit dated so I was curious to know if there have been other more recent approaches..

(Jul 15 '10 at 21:33) spinxl39
Your answer
toggle preview

powered by OSQA

User submitted content is under Creative Commons: Attribution - Share Alike; Other things copyright (C) 2010, MetaOptimize LLC.