data extract

Question

0 개 추천

I have data in a single column in the following format:

123456-123456.123.abcde

I would like to extract 123456 between - and .

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Follow Question

Answer 1

Fangjun Jiang 2011년 11월 9일

MATLAB Online에서 열기

0 개 추천

str='123456-123456.123.abcde';
num=regexp(str,'-[^\.]*','match');
num=str2double(num{1}(2:end))

Update

a=dir('*.bin');
b={a.name};
c=regexp(b,'-[^\.]*','match');
d=-cellfun(@str2double,c)
d =
      200000      200001      200002

댓글 수: 6
이전 댓글 4개 표시 이전 댓글 4개 숨기기

Fangjun Jiang 2011년 11월 9일

By the way, when I say valid data in MATLAB, I mean you write down something in your question so others can copy and paste to test it in the code. The three lines in your comment are not really valid data in MATLAB. You could provide it as str={'123456-200000.123.bin';'123456-200001.153.bin';'123456-200002.126.bin'}. So when others copy it, they have the data right away in MATLAB to work with.

Baba 2011년 11월 9일

Alright, no problem.

댓글을 달려면 로그인하십시오.

Answer 2

Walter Roberson 2011년 11월 9일

MATLAB Online에서 열기

1 개 추천

t = regexp(str, '-(\d+)', 'tokens');
str2double(t{1}{1})

댓글 수: 4
이전 댓글 2개 표시 이전 댓글 2개 숨기기

Walter Roberson 2011년 11월 9일

It is not very different from Fangjun's version, but involves fewer operations. regexp looks through each of the input strings, looking for a pattern of interest. The pattern of interest starts with a "-" and ends just before the first non-digit after that. The () indicate that whatever pattern inside the () is matched is to be recorded separately, so since the pattern is "one or more digits", those digits are recorded separately (i.e., without the leading "-" that was part of the matching pattern.) The 'tokens' parameter says to return the parts that were recorded separately (the "tokens" that the pattern marked as being of interest.)

The list of tokens is returned all in one cell array, and inside the cell array is a list of cell arrays, one per input string; inside there is the character array. The cellfun iterates over all of individual outputs (one per input line) and unwraps a cell array level from what is there and converts the result from text to a double precision number.

Baba 2011년 11월 9일

Thanks for the explanation.

댓글을 달려면 로그인하십시오.

data extract

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 6
이전 댓글 4개 표시 이전 댓글 4개 숨기기

추가 답변 (1개)

댓글 수: 4
이전 댓글 2개 표시 이전 댓글 2개 숨기기

카테고리

태그

Community Treasure Hunt

data extract

댓글 수: 0 이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 6 이전 댓글 4개 표시 이전 댓글 4개 숨기기

추가 답변 (1개)

댓글 수: 4 이전 댓글 2개 표시 이전 댓글 2개 숨기기

카테고리

태그

참고 항목

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글 수: 6
이전 댓글 4개 표시 이전 댓글 4개 숨기기

댓글 수: 4
이전 댓글 2개 표시 이전 댓글 2개 숨기기