필터 지우기
필터 지우기

TabularTextDataStore wont store more than 20000 rows

조회 수: 2 (최근 30일)
SkarletSam
SkarletSam 2020년 10월 21일
댓글: Walter Roberson 2020년 10월 21일
Im trying to import a large csv file (60,000 x 171). tabularTextDataStore is only storing up to 20,000 rows. Is there a more efficient way to import all of the data?

채택된 답변

Kevin Gurney
Kevin Gurney 2020년 10월 21일
The amount of data returned on each call to the read method of a TabularTextDatastore is controlled by the value of the ReadSize property.
If you want to read more than 20,000 rows on each call to read (20,000 is the default value), then you can adjust the value of ReadSize like so:
trainSet = tabularTextDatastore("data\trainingSet.csv");
trainSet.ReadSize = 60000; % You can adjust this value to your liking.
trainSet.TreatAsMissing = 'na';
trainSetR = read(trainSet)
  댓글 수: 2
SkarletSam
SkarletSam 2020년 10월 21일
It seems like TabularTextDatastore's read size has a max value of 45140
Walter Roberson
Walter Roberson 2020년 10월 21일
There seems to be an internal BUFFER_UPPERLIMIT property that is set to 32 megabytes, with 64 megabytes used for HADOOP . I do not see any method to set that property.

댓글을 달려면 로그인하십시오.

추가 답변 (0개)

카테고리

Help CenterFile Exchange에서 Logical에 대해 자세히 알아보기

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by