How to partition a table or matrix of observation data so that samples from every group are included?

조회 수: 4 (최근 30일)
I have a table of categorized data with 41 categories. Every category has at least 5 samples. If I partition it into a 90:10 training:test split like so:
cvp = cvpartition(data.Category,'Holdout',0.1);
it'll split it into the quantities I want but one group will frequently not contain samples from all 41 categories. How do I split it so that all categories are represented in both the training and test datasets? It's in Table format at the moment but can switch to cell array if needed.

답변 (1개)

Sachin Meena
Sachin Meena 2018년 9월 21일
Try the
cvpartition(group,'HoldOut',p,'Stratify',stratifyOption)
option, refer to cvpartition documentation for examples. You may not need to use tall arrays.

제품


릴리스

R2018a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by