How do I read data (from a .dat file) seperated by lines of text into individual vectors
조회 수: 6 (최근 30일)
이전 댓글 표시
Hi there,
I am struggling to read in a .dat file correctly. The file has a bunch of headers starting with #s at the beginning followed by data (2 columns) followed by another header and then more data etc. The data I wish to read into matlab is over 2 million lines so you can imagine that having to go back and delete these headers to load chunks of data in individually will take some time.
Is there a way for Matlab to read in a file like this and seperate the data into seperate vectors each time there's a line of text/ a gap (if I have to go through and delete all the headers). It will not 'load' the data as it contains text that over runs the amount of columns of data at the start. I have tried using the import tool in matlab and I have tried fscanf but not had much luck either.
Any advice would be much much appreciated.
Many thanks
댓글 수: 0
채택된 답변
Walter Roberson
2019년 3월 2일
편집: Walter Roberson
2019년 3월 2일
S = regexp(fileread('YourFileName.dat'), '\r?\n', 'split');
if isempty(S{end}); S(end) = []; end %regexp split leaves empty at bottom if file ended in \n which is common
nonheader = cellfun(@isempty, regexp(S, '^\s*#')); %permit space before #
starts = strfind([false nonheader], [false true]);
stops = strfind([nonheader false], [true false]);
num_blocks = length(starts);
blocks = cell(num_blocks, 1);
for K = 1 : num_blocks
fields = regexp(S(starts(K):stops(K)), '\s+', 'split');
lens = cellfun(@length, fields);
if any(lens ~= 2)
error('data that does not have exactly two fields is present between lines %d and %d', starts(K), stops(K));
end
temp = vertcat(fields{:});
blocks{K} = str2double(temp);
end
end
Now blocks will be a cell array consisting of numeric arrays with two columns each.
댓글 수: 3
Walter Roberson
2019년 3월 2일
hold on
cellfun(@(B) plot(B(:,1), B(:,2)), blocks);
hold off
You can omit the hold commands if hold is already in effect. However, you need hold to be in effect when the second plot() is executed, or else each plot() will end up removing the previous lines.
You could potentially have problems with xlim and ylim not ending up set nicely.
If you want to loop through the blocks then,
for K = 1 : length(blocks)
x = blocks{K}(:,1);
y = blocks{K}(:,2);
..... do things
end
추가 답변 (0개)
참고 항목
카테고리
Help Center 및 File Exchange에서 Text Files에 대해 자세히 알아보기
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!