the number of occurences of each character of one string,in another

Question

hiva 2014년 12월 28일

1
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/168299-the-number-of-occurences-of-each-character-of-one-string-in-another

편집: Luuk van Oosten 2015년 1월 24일

i have a string of more than 100 characters (fasta format of a protein sequence. like

'MEQNGLDHDSRSSIDTTINDTQKTFLEFRSYTQLSEKLASSSSYTAPPLNEDGPKGVASAVSQGSESVVSWTTLTHVYSILGAYGGPTCLYPTATYFLMGTSKGCVLIFNYNEHLQTILVPTLSEDPSIH'

which is being shortened here for simplicity) and i want to find out whether or not it is hydrophobic. so i have to check the number of occurrences of each of the characters in the set 'A C F I L M P V W Y'(hydrophob amino acids) in my fasta string. considering the very long length of fasta strings, is there any easy way to do that by matlab string functions?

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Azzi Abdelmalek 2014년 12월 28일

1
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/168299-the-number-of-occurences-of-each-character-of-one-string-in-another#answer_163456

편집: Azzi Abdelmalek 2014년 12월 28일

MATLAB Online에서 열기

str='MEQNGLDHDSRSSIDTTINDTQKTFLEFRSYTQLSEKLASSSSYTAPPLNEDGPKGVASAVSQGSESVVSWTTLTHVYSILGAYGGPTCLYPTATYFLMGTSKGCVLIFNYNEHLQTILVPTLSEDPSIH'
p={'A' 'C' 'F' 'I' 'L' 'M' 'P' 'V' 'W' 'Y'}'
out=[p cellfun(@(x) nnz(ismember(str,x)),p,'un',0)]

댓글 수: 2
없음 표시없음 숨기기

hiva 2014년 12월 29일

thanks a lot.i guess this works well for a lot of similar cases that are supposed to work the same way in my code(since it is feature extraction and there are lots of features). also tells me how much i don't know from matlab.thanks.

Stephen23 2014년 12월 30일

편집: Stephen23 2014년 12월 30일

MATLAB Online에서 열기

This could be simplified and speeded-up by using arrayfun instead of cellfun, and removing the ismember:

>> t = 'ACFILMPVWY';
>> arrayfun(@(x)sum(str==x), t)
ans =
     6     2     4     6    13     2     7     7     1     7

댓글을 달려면 로그인하십시오.

Answer 2

Peter Perkins 2014년 12월 29일

2
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/168299-the-number-of-occurences-of-each-character-of-one-string-in-another#answer_163537

MATLAB Online에서 열기

Another possibility:

>> s = 'MEQNGLDHDSRSSIDTTINDTQKTFLEFRSYTQLSEKLASSSSYTAPPLNEDGPKGVASAVSQGSESVVSWTTLTHVYSILGAYGGPTCLYPTATYFLMGTSKGCVLIFNYNEHLQTILVPTLSEDPSIH';
>> t = 'ACFILMPVWY';
>> n = hist(double(s),1:90);
>> n(t)
ans =
     6     2     4     6    13     2     7     7     1     7

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

Jan 2014년 12월 30일

This is a histogram problem, so histc is an efficient and direct solution.

댓글을 달려면 로그인하십시오.

Answer 3

Luuk van Oosten 2015년 1월 24일

2
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/168299-the-number-of-occurences-of-each-character-of-one-string-in-another#answer_165835

편집: Luuk van Oosten 2015년 1월 24일

MATLAB Online에서 열기

I reckon you are using the BioInformatics Toolbox. In that case you can probably use:

aacount('SEQ')

Where SEQ is of course your sequence of interest: MEQNGLDHDSRSSIDTTINDTQKTFLEF....

and using

nr_A = All.A
nr_C = All.C
nr_F = All.F

etc. (you get the idea)

you get the numbers of your hydrophobic residues. Sum these and you have your hydrophobic score. You might want to 'normalize' this number by dividing this number by the total amount of amino acids in the sequence.

Of course you can write a loop for this and calculate the hydrophobic score for all your sequences in your FASTA file.

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

Answer 4

Shoaibur Rahman 2014년 12월 28일

1
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/168299-the-number-of-occurences-of-each-character-of-one-string-in-another#answer_163455

MATLAB Online에서 열기

s = 'MEQNGLDHDSRSSIDTTINDTQKTFLEFRSYTQLSEKLASSSSYTAPPLNEDGPKGVASAVSQGSESVVSWTTLTHVYSILGAYGGPTCLYPTATYFLMGTSKGCVLIFNYNEHLQTILVPTLSEDPSIH';
numA = sum(s=='A')
numC = sum(s=='C')
numF = sum(s=='F')
numI = sum(s=='I')
numL = sum(s=='L')
numM = sum(s=='M')
numP = sum(s=='P')
numV = sum(s=='V')
numW = sum(s=='W')
numY = sum(s=='Y')

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

hiva 2014년 12월 29일

very simple and delicate. really thanks

댓글을 달려면 로그인하십시오.

Answer 5

Stephen23 2014년 12월 30일

1
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/168299-the-number-of-occurences-of-each-character-of-one-string-in-another#answer_163616

편집: Stephen23 2014년 12월 30일

MATLAB Online에서 열기

A neat solution using bsxfun :

>> s = 'MEQNGLDHDSRSSIDTTINDTQKTFLEFRSYTQLSEKLASSSSYTAPPLNEDGPKGVASAVSQGSESVVSWTTLTHVYSILGAYGGPTCLYPTATYFLMGTSKGCVLIFNYNEHLQTILVPTLSEDPSIH';
>> t = 'ACFILMPVWY';
>> sum(bsxfun(@eq,s.',t))
ans =
     6     2     4     6    13     2     7     7     1     7

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

hiva 2014년 12월 30일

편집: hiva 2014년 12월 30일

wow!!! just wonderful. it works pretty well.thanks a lot.

댓글을 달려면 로그인하십시오.

the number of occurences of each character of one string,in another

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 2
없음 표시없음 숨기기

추가 답변 (4개)

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

the number of occurences of each character of one string,in another

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 2 없음 표시없음 숨기기

추가 답변 (4개)

댓글 수: 1 이전 댓글 -1개 표시이전 댓글 -1개 숨기기

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 1 이전 댓글 -1개 표시이전 댓글 -1개 숨기기

댓글 수: 1 이전 댓글 -1개 표시이전 댓글 -1개 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 2
없음 표시없음 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기