Extracting parts of a string
조회 수: 6 (최근 30일)
이전 댓글 표시
I have a text filewith information like this:
FileName; SampleFreq; Test;Modality;Channel;Description;StimIntensity; Position; RecordingTime
C:\Users\G10040419\Desktop\lp export application\Data 139\00000090_1.WAV; 22000; 2;1;1;5 CH Right; 0.00; -10000; 40147.491374
I need to extract the sampleFreq (22000) and the position (-10000). I tried to use regular expressions, but I cannot find specific delimiter for these data.
댓글 수: 0
채택된 답변
Paolo
2018년 6월 15일
편집: Paolo
2018년 6월 15일
The following code uses regexp to extract the data you want. You can play around with the expression here .
data = fileread('00000090Head.txt');
expression = '(?<=WAV;\s*)(\d*)(?:;\s*\d*;\d;\d;(.*?(?=;));\s*\d*\.\d*;\s*)(-?\d*)';
[tokens,match] = regexp(data,expression,'tokens','match');
sampleFrequency = cellfun(@(x) x(1,1),tokens);
position = cellfun(@(x) x(1,2),tokens);
Position and sampleFrequency are both 1x183 cell arrays and contain the data you are interested in.
position = {'-10000' '-9000' '-8000' '-7000' '-6000' '-5000' '-4500' '-4000' '-3500' '-3000' ................}
sampleFrequency = {'22000' '22000' '22000' '22000' '22000' '22000' '22000' '22000' '22000' '22000' .................}
댓글 수: 0
추가 답변 (3개)
per isakson
2018년 6월 15일
편집: per isakson
2018년 6월 15일
Is this what you are looking for?
fid = fopen( '00000090Head.txt', 'r' );
cac = textscan( fid, '%*s%f%*f%*f%*f%*s%*f%f%*f', 'Headerlines',1,'Delimiter',';' );
fclose( fid );
and inspect the result
>> cac
cac =
1×2 cell array
{183×1 double} {183×1 double}
>> cac{2}(1:3)
ans =
-10000
-9000
-8000
댓글 수: 0
Ana Maria Alzate
2018년 6월 15일
댓글 수: 5
Jan
2018년 6월 15일
@Ana Maria Alzate: Please do not post comments in the section for answers in the future. There is a section for comments for this job. Thanks.
Stephen23
2018년 6월 18일
편집: Stephen23
2018년 6월 18일
Importing the data as strings and then using regular expressions to parse them is inefficient, yet is not required because that file is very nicely formatted in delimited columns, and the required data can easily and efficiently be read directly as numeric (or char). The command textscan makes it easy specify how to read those columns, and the format string is much simpler and more intuitive that those regular expressions:
>> fmt = '%*s%f%*d%*d%*d%*s%*f%f%*f';
>> opt = {'HeaderLines',1,'Delimiter',';'};
>> [fid,msg] = fopen('00000090Head.txt','rt');
>> assert(fid>=3,msg)
>> C = textscan(fid,fmt,opt{:});
>> fclose(fid);
>> [C{:}]
ans =
22000 -10000
22000 -9000
22000 -8000
22000 -7000
22000 -6000
22000 -5000
22000 -4500
22000 -4000
22000 -3500
22000 -3000
22000 -3000
22000 -2500
22000 -2000
22000 -1500
22000 -1000
22000 -500
22000 0
22000 500
22000 1000
22000 1500
... lots of lines here
22000 -3000
22000 -2500
22000 -2000
22000 -1500
22000 -1000
22000 -500
22000 0
22000 500
22000 1000
22000 1500
22000 2000
22000 2500
22000 3000
22000 3500
22000 4000
22000 4000
22000 5000
댓글 수: 0
참고 항목
카테고리
Help Center 및 File Exchange에서 Text Files에 대해 자세히 알아보기
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!