What does sumd method in k-means clustering function exactly calculate?

Question

Onur Kapucu 2018년 5월 8일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/399776-what-does-sumd-method-in-k-means-clustering-function-exactly-calculate

댓글: Onur Kapucu 2018년 5월 8일

I am doing basic experiments with kmeans function. As a real simple example, say that I have a data set of 4 items with 1 attribute and this attribute is their value:

Data=[1;2;3;4];

If I want to split this data set into 2 clusters I should get one centroid in 1.5 and another in 3.5:

[idx,C,sumd]=kmeans(Data,2)
C =     
1.5000
3.5000

and I get it. However to my understanding sumd in this case should be:

abs(1-1.5)+abs(2-1.5) or  abs(3-3.5)+abs(4-3.5)
ans =
       1

but I am getting sumd as:

sumd =
      0.5000
      0.5000

for both clusters. Instead of getting 1's for both.

My question is what exactly does sumd calculate?

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Ameer Hamza 2018년 5월 8일

1
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/399776-what-does-sumd-method-in-k-means-clustering-function-exactly-calculate#answer_319322

편집: Ameer Hamza 2018년 5월 8일

MATLAB Online에서 열기

If you look at the documentation of kmeans(), you will know that it uses the square of the Euclidean distance, by default. So you should calculate it like this

abs(1-1.5).^2+abs(2-1.5).^2 or  abs(3-3.5).^2+abs(4-3.5).^2
ans = 
  0.5 (both cases)

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

Onur Kapucu 2018년 5월 8일

Thanks

댓글을 달려면 로그인하십시오.

Answer 2

the cyclist 2018년 5월 8일

1
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/399776-what-does-sumd-method-in-k-means-clustering-function-exactly-calculate#answer_319323

It's because the default distance metric used is the squared Euclidean distance (for minimization, and reporting). See the Distance input parameter.

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

Onur Kapucu 2018년 5월 8일

Thanks

댓글을 달려면 로그인하십시오.

What does sumd method in k-means clustering function exactly calculate?

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

추가 답변 (1개)

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

What does sumd method in k-means clustering function exactly calculate?

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 1 이전 댓글 -1개 표시이전 댓글 -1개 숨기기

추가 답변 (1개)

댓글 수: 1 이전 댓글 -1개 표시이전 댓글 -1개 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기