Splitting data set using information gain
조회 수: 1 (최근 30일)
이전 댓글 표시
How find maximunm information gain and using this maximum information gain how can I split each row of my data set.
댓글 수: 2
KSSV
2019년 11월 8일
Can you elaborate? What is maximum information gain? What exactly you want to do with the data?
답변 (1개)
Shishir Singhal
2020년 7월 28일
Hi,
I seems like you want to split your data into two sets in the basis of information gain.
Please refer to the file exchange link: https://in.mathworks.com/matlabcentral/fileexchange/14996-entropy
I only need to decide which column of your dataset is represented as a "class" i,e. target variable and which columns are represented as features. Use function "entropyF" to calculate the entropy of each feature variable with respect to "class" variable. Function "getBestEnt" return the index of the feature having highest information gain.
Hope it helps. !!!
Thanks
댓글 수: 0
참고 항목
카테고리
Help Center 및 File Exchange에서 Spreadsheets에 대해 자세히 알아보기
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!