Import large .par file
조회 수: 12 (최근 30일)
이전 댓글 표시
I need to import a .par file with 2.5gb of dimension. The .par file is a numerical table with 10 columns and few millions of rows.
I tried to used the parquetread function but i got an error message ("Unable to read Parquet"). A reduced version of the parquet file, 500mb, is uploaded successfully. My conclusion is that the original .par file is too large.
Any suggestion on how to import large .par file?
댓글 수: 0
답변 (1개)
Saffan
2023년 5월 29일
Hi Carlo,
One way to solve this issue could be by splitting the original parquet file into multiple smaller files and create “parquetDatastore” of them. The data can then be obtained from the datastore using the “read” function. A datastore is a repository of files where each individual file can fit in the memory but the entire collection of files may not fit. They are used to work with the data incrementally and hence reduces the memory usage.
You can refer to the following link for more information on datastores: https://www.mathworks.com/help/releases/R2022b/matlab/ref/datastore.html
댓글 수: 0
참고 항목
카테고리
Help Center 및 File Exchange에서 Datastore에 대해 자세히 알아보기
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!