How to Split test and training set

조회 수: 22 (최근 30일)
UTHOWAIPRU CHOWDHURY BAICHING
UTHOWAIPRU CHOWDHURY BAICHING 2022년 6월 22일
I am stuck on how split train and test data correctly.
XTrain = readtable("urbanGB.csv");
YTrain = readtable("urbanGB.labels.csv");

채택된 답변

Garmit Pant
Garmit Pant 2022년 6월 23일
Hello Uthowaipru
It is my understanding that you have data in the form of CSV files that you wish to load and split into train and test subsets.
You can use the 'cvpartition' function of the Statistics and Machine Learning Toolbox to do the same.
You'll can use the following code snippet to divide your data into training and test subsets.
XTrain = readtable("urbanGB.csv");
YTrain = readtable("urbanGB.labels.csv");
XTrain.Y = table2cell(YTrain); %assuming YTrain to be a single column
rng('default') % For reproducibility
n = length(XTrain.Y);
hpartition = cvpartition(n,'Holdout',0.3); % Nonstratified partition
idxTrain = training(hpartition);
tblTrain = tbl(idxTrain,:);
idxTest = test(hpartition);
tblTest = tbl(idxNew,:);
You can refer to the documentation of cvpartition for more details: cvpartition

추가 답변 (0개)

카테고리

Help CenterFile Exchange에서 Model Building and Assessment에 대해 자세히 알아보기

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by