Hi!
I have a cell array b (attached); in each cell of b I have an expression like this: 'Weather booming Chilli Relax https://t.co/pwp00Ndw3d' or expressions with @,#,$. I want to delete from these expressions all the characters like @,#,$ and the links like https://t.co/pwp00Ndw3d.
Example: if I have 'Weather booming @Chilli Relax# https://t.co/pwp00Ndw3d', I will want it becames 'Weather booming Chill Relax'
Can you help me? thanks

댓글 수: 3

Jan
Jan 2017년 6월 21일
Fine. What is your question?
elisa ewin
elisa ewin 2017년 6월 22일
sorry, now I re-write the question
Jan
Jan 2017년 6월 22일
편집: Jan 2017년 6월 22일
Weather booming Chilli Relax https://t.co/pwp00Ndw3d
This looks strange. It reminds me to Google: Britney Spears Instagram account used by hackers.
Perhaps I'm too distrustful, but I've modified the URL slightly to be sure. This does not change the core of the question or the answer. Sorry, these are hard times in the world wide web. Please do not take this personally.

댓글을 달려면 로그인하십시오.

 채택된 답변

Andrei Bobrov
Andrei Bobrov 2017년 6월 22일

2 개 추천

regexprep(b,'[$#@]|\<https:/+\S*\>','')

댓글 수: 8

elisa ewin
elisa ewin 2017년 6월 22일
if I want delete all special characters and not only (@,#,$), how can I modify this expression?
Jan
Jan 2017년 6월 22일
@Andrei: [$#@]|\<https:/+\S*\> ??? Looks like a rocket. +1
Stephen23
Stephen23 2017년 6월 22일
@elisa ewin: what is a "special character" ?
elisa ewin
elisa ewin 2017년 6월 22일
all characters that are not letters or numbers: I know them like special characters
Stephen23
Stephen23 2017년 6월 22일
편집: Stephen23 2017년 6월 22일
@elisa ewin: Your explanation contradicts your examples: if "special characters" are "not letters or numbers", then why do your examples still contain space characters in the output? Following your definition of "special characters" the space characters should have been removed as well: "I want to delete from these expressions all the characters like @,#,$ and...": clearly your original question and your definition of "special characters" requires no space characters in the output.
So, is a space character special? What about a comma? What about a newline? Is a non-breaking space special? What about an underscore? Is a tab character special?
Jan
Jan 2017년 6월 22일
@Elisa: Do you mean:
  • split string at spaces to words
  • delete all words starting with 'https://' or all words containing '/'
  • remove all special characters identified by: ~isstrprop(S, 'alphanum')
elisa ewin
elisa ewin 2017년 6월 22일
yes
Andrei Bobrov
Andrei Bobrov 2017년 6월 22일
편집: Andrei Bobrov 2017년 6월 22일
Hi Jan! Yes! "Russian rocket". :)
regexprep(b,'\<[^A-Za-z \?\,]|https:/+\S*\>','')

댓글을 달려면 로그인하십시오.

추가 답변 (1개)

Jan
Jan 2017년 6월 22일
편집: Jan 2017년 6월 22일

0 개 추천

S = 'Weather booming Chilli Relax https://t.co/pwp00Ndw3d';
C = strsplit(S, ' ');
C(contains(C, '/')) = []; % Or how you identify a link
for iC = 1:numel(C)
aC = C{iC};
C{iC} = aC(isstrprop(aC, 'alphanum'));
end
Result = sprintf('%s ', C{:});
Result(end) = [];
The command contains was introduced in R2016b. If you have an older version, use:
function Tf = contains(C, Patterm)
Tf = ~cellfun('isempty', strfind(C, Pattern));
end

카테고리

질문:

2017년 6월 21일

편집:

2017년 6월 22일

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by