why nucleotides is more than 4?
조회 수: 3 (최근 30일)
이전 댓글 표시
hi as I know the no. of nucleotides is 4 letters. why in matlab consider it 17 letters as in table here:
thanks
댓글 수: 0
채택된 답변
Walter Roberson
2011년 11월 16일
The table there looks pretty straight-forward to me: http://www.mathworks.com/help/toolbox/bioinfo/ref/int2nt.html#bp_rekb-1 . It has codes for situations in which particular sets of nucleotides are known to be present or known to be absent.
Besides, the number of known nucleotides is not 4: it is currently 8. The 7th and 8th were announced in July 2011, with the 5th and 6th having been announced in April 2005.
추가 답변 (1개)
Lucio Cetto
2011년 11월 19일
Ambiguous nucleotide symbols are used to characterize sequences that can have variations. It was introduced in the 80's and they are useful nowadays in certain cases, for example describing restriction enzymes. (e.g. http://www.chem.qmul.ac.uk/iubmb/misc/naseq.html). In my personal opinion I think that there are other situations in which we have better options, such as sequence motifs, sequence profiles and the more elaborated profile HMMs. If you plan to convert to aa, Matlab can actually use also ambiguous aa codes when possible, although this is no longer a standard practice; most people now uses only ACGT.
댓글 수: 0
참고 항목
카테고리
Help Center 및 File Exchange에서 Genomics and Next Generation Sequencing에 대해 자세히 알아보기
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!