Matlab Clustering technique with textual data

조회 수: 3 (최근 30일)
Brian
Brian 2016년 12월 12일
답변: mizuki 2016년 12월 20일
Hi, I am trying to figure out the best way to cluster numeric information (stock returns) using a series of textual information. For instance, let's say I have 10 sectors with of stock returns that I'd like to cluster to 3 distinct groups. My first thought was to use the K-means clustering algorithm from the "Stats and ML" toolbox however, it doesn't take textual information as a descriptor.
Please advise.
Example data set
Industry, Return
Financials,2%
Consumer Disc,3%
Consumer Staples,4.5%
Energy,1%
Health Care,1.5%
Industrials,2.2%
Info Tech,3.7%
Materials,4.8%
Telecom,-2%
Utilities,-1%
  댓글 수: 1
Brian
Brian 2016년 12월 16일
Any ideas on this from statistical experts?

댓글을 달려면 로그인하십시오.

답변 (1개)

mizuki
mizuki 2016년 12월 20일
Make the textual data categorical to reduce information.

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by