How to efficiently integrate big data without using memory / (How to create big data)
이전 댓글 표시
- in a study i will produce large arrays.
- Each array will have at least 500 MB size.
- Each array will have the same number of rows.
- the total size of dataset will be approximately 20 GB or over.
- Somehow I have to create a single variable/array which includes all data and size of 20 GB.
matfile seems a good solution. However when the size of file increases, it gets slower. How can i handle this problem?
댓글 수: 9
blaat
2015년 8월 18일
How you store and access big data is strongly dependent on what you need to do with it. Do you need all arrays at the same time? Do you need specific values from these arrays at the same time? Can you somehow partition your calculations to a subset of the data?
Without more information it is very difficult to give advice on your problem.
Mehmet OZC
2015년 8월 18일
Mehmet OZC
2015년 8월 18일
blaat
2015년 8월 18일
If can process the arrays separately, perhaps it would be more convenient to keep them as separated files. Or is there another reason you want a single, large file?
If a single file is required, I would advise against storing everything in a single variable. As far as I know, there is no way of reading only part of an array from a .mat-file, so the file will require 20 GB of memory to load.
Steven Lord
2015년 8월 18일
Why do you need to write them to a single file? Why not put each in its own file; that way if something were to happen to one of the files you wouldn't lose all of your data?
Mehmet OZC
2015년 8월 18일
편집: Mehmet OZC
2015년 8월 18일
blaat
2015년 8월 18일
So, if I understand correctly, your problem is this: you want to write 20 GB of data to a single variable in a .mat-file, but it's getting unworkably slow? Or doesn't it work at all?
Mehmet OZC
2015년 8월 18일
편집: Mehmet OZC
2015년 8월 18일
Walter Roberson
2015년 8월 18일
I wonder if compression is leading to slowdowns? I do not know whether -v7.3 with matfile uses compression; see discussion http://www.mathworks.com/matlabcentral/answers/15521-matlab-function-save-and-v7-3 and http://www.mathworks.com/matlabcentral/answers/137592-compress-only-selected-variables-when-saving-to-mat
채택된 답변
추가 답변 (0개)
카테고리
도움말 센터 및 File Exchange에서 Standard File Formats에 대해 자세히 알아보기
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!