How to read in a text file that has two different formats.

조회 수: 3 (최근 30일)
Kevin
Kevin 2014년 8월 26일
답변: Kevin 2014년 8월 27일
How can I read in a text file that does not have a consistent format for every row of data?
Here is an exert of the file I need to read.
2014-08-05 08:56:09.936: From 10.10.30.3: bytes=60 seq=06bf ID=48f6 time=32.110ms
2014-08-05 08:56:10.435: From 10.10.30.3: bytes=60 seq=06c0 ID=48f7 time=32.160ms
2014-08-05 08:56:10.903: Timeout waiting for seq=06bc
2014-08-05 08:56:10.934: From 10.10.30.3: bytes=60 seq=06c1 ID=48f8 time=32.475ms
Notice that the third row is not like the others. The following code works when I don't have these occasional rows with the "Timeout" message.
fid=fopen('ping.txt')
PingFile=textscan(fid, '%s %s From%s bytes=%d seq=%s ID=%s time=%f %*[^\n]')
How do I capture the information in these occasional rows, and read in the entire file?
  댓글 수: 2
José-Luis
José-Luis 2014년 8월 26일
Will the message always be: "Timeout waiting..."?
Kevin
Kevin 2014년 8월 27일
Yes, the message will always be "Timeout waiting for seq=", followed by a hexadesimal number.

댓글을 달려면 로그인하십시오.

채택된 답변

Kevin
Kevin 2014년 8월 27일
Final conclustion based on the answers from dbp and Haderlein. Thank you both for your help.
AllData=textread('ping.txt','%s','delimiter','\n');
IsPing=cellfun(@isempty,strfind(AllData,'Timeout'));
PingData =AllData(IsPing);
TimeOut = AllData(~IsPing);
PDfn=@(s) textscan(s, '%s %s From%s bytes=%d seq=%s ID=%s time=%fms');
TOfn=@(s) textscan(s, '%s %s Timeout waiting for seq=%s');
PingCell=cellfun(PDfn,PingData,'uniformoutput',false);
ToutCell=cellfun(TOfn,TimeOut,'uniformoutput',false);

추가 답변 (1개)

dpb
dpb 2014년 8월 26일
I'd do it a little differently...bring the file into memory and eliminate the bum lines, then scan from memory.
>> file=textread('kevin.txt','%s','delimiter','\n'); % textread is handy sometimes, too...
>> file=file(cellfun(@isempty,strfind(file,'Timeout'))) % get rid of timeout lines
file =
'2014-08-05 08:56:09.936: From 10.10.30.3: bytes=60 seq=06bf ID=48f6 time=32.110ms'
'2014-08-05 08:56:10.435: From 10.10.30.3: bytes=60 seq=06c0 ID=48f7 time=32.160ms'
'2014-08-05 08:56:10.934: From 10.10.30.3: bytes=60 seq=06c1 ID=48f8 time=32.475ms'
Nos create a function handle for the scanning...
>> fn=@(s) textscan(s, '%s %s From%s bytes=%d seq=%s ID=%s time=%fms')
fn =
@(s)textscan(s,'%s %s From%s bytes=%d seq=%s ID=%s time=%fms')
>> cellfun(fn,file,'uniformoutput',false)
ans =
{1x7 cell}
{1x7 cell}
{1x7 cell}
  댓글 수: 2
Kevin
Kevin 2014년 8월 27일
Thank you dpb. This works very well. I do, however, want to retain the information in the "Timeout" rows. I don't know anything about using the "@" simble, so I'm not sure how to modify your code.
Michael Haderlein
Michael Haderlein 2014년 8월 27일
Just modify the following way:
isdata=cellfun(@isempty,strfind(file,'Timeout'));
data=file(isdata);
timeout=file(~isdata);
Then apply the fn function on data instead of file. The timeout information can be read out by similar means.

댓글을 달려면 로그인하십시오.

카테고리

Help CenterFile Exchange에서 Text Files에 대해 자세히 알아보기

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by