Finding a string in a file

조회 수: 6 (최근 30일)
osminbas
osminbas 2011년 7월 22일
Hello,
I need help with this problem:
How can I find the files with the same name in different folders that have the number "5800" in them? Furthermore, how can I list these files/folders and which line(s) the number "5800" appear in the files?
I would really appreciate the help.
Thanks.
  댓글 수: 2
Fangjun Jiang
Fangjun Jiang 2011년 7월 22일
"5800" is in the folder name or in the file name? Is the file a ASCII text file?
Walter Roberson
Walter Roberson 2011년 7월 23일
"5800" is a target string to be found amongst several files.

댓글을 달려면 로그인하십시오.

답변 (8개)

Walter Roberson
Walter Roberson 2011년 7월 23일
CommonFileName = 'x3175.txt'; %or whatever the common name is
folderinfo = dir('*');
folderinfo(~[folderinfo.isdir]) = [];
folderinfo(strcmp({folderinfo.name},{'.','..'})) = [];
for FoIdx = 1 : length(folderinfo)
specificname = fullfile(folderinfo(FoIdx}.name, CommonFileName);
if exists(specificname, 'file)
%at this point, insert your code to examine specificfile
end
end

Jason Ross
Jason Ross 2012년 9월 21일
I realize this question is old, but in Windows Explorer (and not strictly a MATLAB question, either), you can put "content: " in the search box in the upper right hand corner, and it will search the files (including .doc files) for the string. So for this question, putting "content: 5800" would have searched and returned a list of the files that had "5800" in it.
Also, note that you can try finding something in Explorer, and then at the bottom it says "Search again in:", and one of the choices is "content". Not the most usable thing (the search dog in previous Windows versions exposed this more quickly), but it is there.
There is also a "find" command you can run from the Command Shell, just cd to the directory in question and type
find "5800" *
And the current directory will be searched. I checked a directory where I had some Word documents and it successfully searched them.

Honglei Chen
Honglei Chen 2011년 7월 22일
Hi osminbas,
You could write a script to achieve this. You can use cd to change directories, what to list all the files in the folder, then for each file, use textscan to read each line and strfind to find '5800'. You then just write the result to either the screen or a file.
HTH
Honglei

Walter Roberson
Walter Roberson 2011년 7월 23일
On unix systems:
!grep -Hn 5800 */TheFileName.txt

Fangjun Jiang
Fangjun Jiang 2011년 7월 23일
How about this? Open M-Editor, select menu Edit->Find file. Specify what text to find, in what files and what folder/sub-folder to find, the result will show all the files, folders and the lines that the text appears in.
Many other IDEs probably have the same capability too.

osminbas
osminbas 2011년 7월 25일
Thank you all. I realized that I misworded my question. I am trying to find the string "5800" in the file itself (it is a .doc file), not in the name of the file. Again, I appreciate your help.
  댓글 수: 6
Walter Roberson
Walter Roberson 2011년 7월 28일
The line "if exists(specificname, 'file)" that I put in the code checks to see whether the file exists in the subdirectory you are processing, and skips the string search if the file is not there.
For searching within a specific file, there are many ways. One way is:
fid = fopen(specificfile, 'rt');
lines = textscan(fid,'%[^\n]'); %reads line by line
fclose(fid);
L = find(~cellfun(@isempty,strfind(lines, '5800')),1,'first');
if ~isempty(L)
fprintf('found in %s at line %d\n', specificfile, L);
end
K E
K E 2012년 9월 21일
I just used this code in another project, so thanks Walter

댓글을 달려면 로그인하십시오.


osminbas
osminbas 2011년 7월 28일
Thank you all. Especially, thank you, Walter. In your code, when I define the folderinfo, it is the "big" folder that has all the subfolders in it. Some of these subfolders have the file I am looking for and some of them don't. How can I get only the ones with that file in them. And furthermore, how can I list these folder names in an excel sheet or text file (whichever is easier)?
Also, how do I search for the string "5800" inside the document? I know that you talked about the difficulties of reading inside .doc files but maybe you can answer my question assuming it is a txt file.
I really appreciate your help.

venkat vasu
venkat vasu 2012년 9월 22일
편집: Walter Roberson 2012년 9월 22일
a1=dir;
l=length(a1);
for i1=3:l
files=dir(a1(i1).name);
nfiles = length(files);
for i=3:nfiles
currentfilename = files(i).name;
if currentfilename==5800
%whatever operation
end
end
end
this code surely will help you...
  댓글 수: 1
Walter Roberson
Walter Roberson 2012년 9월 22일
The task was to search the content, not the file name.
Also, "currentfilename" from files(i).name will be a string, but you attempt to compare the string to the numeric value 5800 . That is not going to have the result you expect.

댓글을 달려면 로그인하십시오.

카테고리

Help CenterFile Exchange에서 Search Path에 대해 자세히 알아보기

태그

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by