0
2

I'm trying to understand fuzzy k-modes algorithm (look mainly at page 3) in order to implement it. I'm stuck at the calculation of cluster centers they said as shown in the pics alt text alt text

I need to know whether the following is true or false and please correct me

In order to get the center of a cluster we need to:

  1. for each category in a variable calculate the sum of membership value for all the point, that the category belongs to, to this cluster
  2. the highest obtained value to be set as the category of the center
This question is marked "community wiki".

asked Oct 18 '11 at 10:26

Marwiya's gravatar image

Marwiya
1113

edited Oct 18 '11 at 10:46

To summarize; I need to understand how to find the cluster centers that satisfies the algorithm conditions, as I couldn't understand clearly what does the above mathematical expression means.Please help me :)

(Oct 18 '11 at 17:36) Marwiya

It looks like you pick the most popular value for each attribute. Does this make sense?

(Oct 18 '11 at 22:42) Alexandre Passos ♦

Wl-i is the membership value of the point i to the cluster l. points will have this structure A(1000,001011,000000001) 1000 means the first variable of A has the first category on ... B(0110,100000,101010000). suppose we're working on the 1st cluster so and we knew that W1-A=0.7 and W1-B=0.6 So will we take for each category the membership of the whole point? and make the sum for each point and chose the highest???

(Oct 19 '11 at 07:59) Marwiya

@Alexandre Passos any help?

(Oct 21 '11 at 11:36) Marwiya
1

This really looks like it's setting the value for each attribute in each cluster to the most popular value among all items assigned to that cluster. At least that's what eq (9) seems to be saying: this value has the highest sum among all other values.

Does this make sense with the rest of the paper?

(Oct 21 '11 at 11:40) Alexandre Passos ♦

@Alexandre Passos what do u mean by popular ? the one which's the most frequent or having the highest membership to the cluster ?

(Oct 21 '11 at 16:51) Marwiya

I didn't follow all of it but if I remember correctly fuzzy clustering mean assigning each variable some degree of membership to every cluster ( as opposed to hard clustering, which assign each object to a specific cluster). I think the algorithm first assign to each variable some degree of membership, and then determine the cluster center according to equation 9. ( the most popular value among all items)

(Oct 30 '11 at 14:14) Dov
showing 5 of 7 show all

One Answer:

I guess I found the answer that's mean that for the set of points in the cluster we do the sum of membership value for similar attributes domain for those points , the one which has the highest result is taken to be the attribute of the cluster center

answered Oct 30 '11 at 14:03

Marwiya's gravatar image

Marwiya
1113

edited Oct 30 '11 at 14:04

Your answer
toggle preview

powered by OSQA

User submitted content is under Creative Commons: Attribution - Share Alike; Other things copyright (C) 2010, MetaOptimize LLC.