Whitespace problem with textscan
조회 수: 16 (최근 30일)
이전 댓글 표시
While using textscan, it doesn't seem to treat multiple whitespace as a single delimiter. The file has whitespace and tab delimeters,
2012-10-15 K01 5.83 5.05 5.73 6.41 4.28
2012-10-15 K01 5.25 5.80 6.41 4.28
2012-10-15 K01 4.28
Using data = textscan(fd, form,'HeaderLines',2); so data{:,3} becomes
5.83, 5.25 and 4.28
but should be
5.83, NaN and NaN.
I've tried using the whitespace function but cannot get it to work
채택된 답변
Stephen23
2014년 10월 8일
편집: Stephen23
2014년 10월 8일
It does if you use the 'MultipleDelimsAsOne' option:
This is explained in the section "Treat Repeated Delimiters as One".
Although, given your example output, you probably want to use the 'EmptyValue' option, which is explained in the section "Specify Delimiter and Empty Value Conversion".
댓글 수: 8
Stephen23
2014년 10월 8일
This works correctly:
>> S = sprintf('anna\t123.45\t \t67.890\nbob\t \t \t9.7');
>> textscan(S,'%s%f%f%f','Whitespace','','TreatAsEmpty',' ','Delimiter','\t')
ans = {{'anna';'bob'},[1.2345;NaN],[NaN;NaN],[67.89;9.7]}
Try the above options with your formatSpec, and just one line, then try it with two lines...
Stephen23
2014년 10월 8일
The file format uses mixed delimiters (a space and tab), and also uses the space as a placeholder for missing data. I would suggest that the easiest way to deal with this would be to do some pre-processing:
- replace the space between the date and 'K01' with a tab. This would be easy to achieve using regexprep (eg regexprep(str,' K','\tK') ).
- replace the place-holder spaces with nothing... keep those fields empty! This might be the better option, as then you might be able to use (almost) the default settings for textscan.
추가 답변 (0개)
참고 항목
카테고리
Help Center 및 File Exchange에서 Data Type Conversion에 대해 자세히 알아보기
제품
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!