Extracting consecutive digits using regexp

조회 수: 17 (최근 30일)
Hau Kit Yong
Hau Kit Yong 2019년 6월 22일
댓글: per isakson 2019년 6월 22일
I expected
regexp('ITEM 123', '.+(\d+)', 'tokens')
to return '123'. Why does it only return '3'? What would be the correct expression?

채택된 답변

per isakson
per isakson 2019년 6월 22일
편집: per isakson 2019년 6월 22일
These two returns "123"
%%
cac = regexp('ITEM 123', '.+?(\d+)', 'tokens' )
%%
cac = regexp('ITEM 123', '[^\d]+(\d+)', 'tokens' )
First, '.+' matches anything up til the end of the text, next it gives back just as little as needed to match '(\d+)' , which is one digit.
'.+?' matches as little as needed so that '(\d+)' is able to match the following text.
I prefer '[^\d]+(\d+)'
Or why not just
cac = regexp('ITEM 123', '\d+', 'match' )
  댓글 수: 2
Hau Kit Yong
Hau Kit Yong 2019년 6월 22일
Many thanks! For the first expression, what does the '?' character do? I've only seen it in lookaround operations, but always in the form of '?=', '?<=' etc. and never by itself.
per isakson
per isakson 2019년 6월 22일
Search Quantifiers and Lazy on the page Regular Expressions

댓글을 달려면 로그인하십시오.

추가 답변 (0개)

카테고리

Help CenterFile Exchange에서 Programming에 대해 자세히 알아보기

태그

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by