Reading binary files consisting of different data types without a for loop.

조회 수: 13 (최근 30일)

Adam 2014년 9월 15일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/154920-reading-binary-files-consisting-of-different-data-types-without-a-for-loop

편집: Mark Thomson 2022년 5월 2일

I have a binary data file that consists of some M data sets. Each set of data is made up of Nbytes of a specific template, e.g. [uint16, uint16, uint16, uint32, double, uint32, int16]. Right now I'm just looping over how many data sets I have and reading the information in each data set according to it's type.

for j = 1:Mdatasets
     this(j) = fread(fid,1,'uint32');
     foo(j) = fread(fid,1,'uint16');
     foofoo(j) = fread(fid,1,'double');
     % and so on...
end

Is there a faster way to do this? It can't take a very long time to read some of my larger files (~500MB). I was thinking that if you could give fread() a data type template to repeat over and over like it can do with a single data type, that would be ideal. Not sure if there is a way to do this, or if someone has a way around it, but for loops take so long.

Best Regards,

Adam

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

Image Analyst 2014년 9월 15일

The for loop is definitely NOT the problem You can do tens of millions of iterations in less than a second and I'm sure you don't have that many files. The time is being taken up by the disk I/O rather than the for loop.

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

답변 (2개)

Guillaume 2014년 9월 15일

2
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/154920-reading-binary-files-consisting-of-different-data-types-without-a-for-loop#answer_151787

편집: Guillaume 2014년 9월 17일

MATLAB Online에서 열기

Use the skip argument of fread to read all the elements of the same type at once. From my reading of fread doc, skip is not very straightforward to calculate. This may work:

fieldsizes = [4 2 8 ...]; %uint32 uint16 double ...
skip = @(n) sum([fieldsize(1:n+1) fieldsize(n+1:end)]); %sum up of field sizes except field n
offset = @(n) sum(fieldsizes(1:n)); %offset to element n+1
this = fread(fid, Mdatasets, 'uint32', skip(1));
fseek(fid, offset(1), -1);
foo = fread(fid, Mdatasets, 'uint16', skip(2));
fseek(fid, offset(2), -1);
foofoo = fread(fid, Mdatasets, 'double', skip(3));
fseek(fid, offset(3), -1);
...

edit: added fseek as per Michael comment.

댓글 수: 6
이전 댓글 4개 표시이전 댓글 4개 숨기기

Guillaume 2014년 9월 17일

Thanks Michael for the reminder about the file pointer, I'd completely missed that. It's actually fseek that's needed since you need to go back to the right element.

Reading the file my or Iain's way is bound to be faster than reading it one element at a time.

A third option, probably the fastest is to write a mex file.

Mark Thomson 2022년 5월 2일

편집: Mark Thomson 2022년 5월 2일

Many thanks Guillaume for pointing out the Skip functionality (I should have read the fread help in full first)... I had a similar issue, only with Fortran UNFORMATTED files where this helped skip the record entries.

It seems that the overheads in a for loop are indeed due to the repetitive file access, not being streamlined by the functionality of the compiled built-in fread with Skip.

댓글을 달려면 로그인하십시오.

Image Analyst 2014년 9월 15일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/154920-reading-binary-files-consisting-of-different-data-types-without-a-for-loop#answer_151786

MATLAB Online에서 열기

You're reading 1 byte at a time - no wonder it takes so long. Read in a whole image at a time:

thisImage = fread(fid, [rows, columns], '*uint16');

댓글 수: 5
이전 댓글 3개 표시이전 댓글 3개 숨기기

Image Analyst 2014년 9월 16일

If you have patterns of data in one file, like a bunch of small variables (header info) and then maybe a big image, then you could make a subroutine to do the "common" part. It could take in a page or slice number and use fseek to go to the starting point for that page/slice. I've written readers for custom image formats, like CT data, and I could give you examples if you want. I read in the header and image data.

Adam 2014년 9월 16일

Right, so you're suggesting skipping around the binary file and reading the 'common' data types like Guillaume suggested.

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

카테고리

MATLAB Graphics Images Convert Image Type

Help Center 및 File Exchange에서 Convert Image Type에 대해 자세히 알아보기

제품

MATLAB

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by