How do I import a file containing both numbers and strings

Question

0 개 추천

I've looked around and found multiple posts regarding this issue, however, none of them really helped me with this problem.

I'm trying to import a text file cointaining a square matrix, defined by the user, that contains values and the string X like, for example, this one:

01 24.02 1.03 8.04 
06 5.07 7.08 14.09 
01 X 13.03 20.04 
06 12.07 19.08 21.09 
01 18.02 25.03 2.04 

I've tried using importdata but when it reaches the X, the rest of the line gets replaced by NaN and the lines underneath get ignored. This is what happens:

0100   24.0200    1.0300    8.0400   
0600    5.0700    7.0800   14.0900   
0100       NaN       NaN       NaN      

Since the matrix is given by the user (randomly), its size and the 'X' positions will always be different. Knowing that the matrixes are always square, what can I do to solve this?

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Follow Question

Answer 1

Stephen23 2018년 11월 22일

편집: Stephen23 2018년 11월 22일

2 개 추천

Use textscan and set its 'TreatAsEmpty' option to 'X'.

This will be much more efficient than importing and post-processing (e.g. with regexp).

댓글 수: 3
이전 댓글 1개 표시 이전 댓글 1개 숨기기

Stephen23 2018년 11월 22일

temp1.txt

"but how do you solve the part about not knowing the size of the matrix?"

Read the first line using fgetl, count the delimiters, then frewind back to start of the file. Use the count to define a format string using repmat, then read the file using textscan. It sounds complex, but it isn't really:

opt = {'Delimiter',' ','TreatAsEmpty','X','CollectOutput',true};
[fid,msg] = fopen('temp1.txt','rt');
assert(fid>=3,msg)
cnt = nnz(strtrim(fgetl(fid))==' ');
frewind(fid)
fmt = repmat('%f',1,1+cnt);
C = textscan(fid,fmt,opt{:});
fclose(fid);

And checking (the test file is attached):

>> C{1}
ans =
0100   24.0200    1.0300    8.0400
0600    5.0700    7.0800   14.0900
0100       NaN   13.0300   20.0400
0600   12.0700   19.0800   21.0900
0100   18.0200   25.0300    2.0400

TADA 2018년 11월 22일

nice +1

still much faster

댓글을 달려면 로그인하십시오.

Answer 2

TADA 2018년 11월 22일

MATLAB Online에서 열기

1 개 추천

txt = fileread('blah.dat');
lines = strsplit(txt, newline);
x = regexp(lines, '[^ ]+', 'match');
items = cat(1,x{:});
A = str2double(items);

your X would now be represented by NaN in matrix A

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

Answer 3

dpb 2018년 11월 22일

0 개 추천

If you're going to read in text in random locations, unless you can determine or require the user to tell you where those locations are, the only alternative will be to read the whole array as cellstr array and then figure out after the fact "who's who in the zoo" as far as which locations are/aren't numeric.

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

How do I import a file containing both numbers and strings

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 3
이전 댓글 1개 표시 이전 댓글 1개 숨기기

추가 답변 (2개)

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

카테고리

제품

릴리스

태그

Community Treasure Hunt

How do I import a file containing both numbers and strings

댓글 수: 0 이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 3 이전 댓글 1개 표시 이전 댓글 1개 숨기기

추가 답변 (2개)

댓글 수: 0 이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글 수: 0 이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

카테고리

제품

릴리스

태그

참고 항목

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글 수: 3
이전 댓글 1개 표시 이전 댓글 1개 숨기기

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기