Unsupervised clustering of categorical data

조회 수: 8 (최근 30일)
Daniel Guignard
Daniel Guignard 2021년 11월 23일
답변: Pratyush Roy 2021년 12월 1일
Hi everyone,
I wanted to cluster a time serie dataset which has 30 timepoints and more than 50'000 rows. The dataset is categorical (from 1 to 6) which represent different categories.
The problem with my current clustergram method using the euclidian distance metrics, is that it will cluster the category 5 closer to 6. I don't want that, those categories are not somehow related. How is it possible to remove this bias in the clustering?
Hope my question is clear, thanks for your further help!
  댓글 수: 2
Image Analyst
Image Analyst 2021년 11월 23일
Could be clearer if you attached a .mat file with your table, as many rows as will fit into 5 MB (attachment size limit).
Daniel Guignard
Daniel Guignard 2021년 11월 23일
sure, here is the matrix

댓글을 달려면 로그인하십시오.

답변 (1개)

Pratyush Roy
Pratyush Roy 2021년 12월 1일
Hi Daniel,
The link here might be helpful for clustering categorical or non-numeric data.
Hope this helps!

카테고리

Help CenterFile Exchange에서 Cluster Analysis and Anomaly Detection에 대해 자세히 알아보기

제품


릴리스

R2021b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by