understanding the bioinformatic tool
조회 수: 2 (최근 30일)
이전 댓글 표시
Dear Matlab Team, I am really enjoying Matlab and your internet help. I am corrently working with http://www.mathworks.com/help/pdf_doc/bioinfo/bioinfo_ug.pdf
In page 2-77 I used two lines I am not sure about: fow_idx = find(~bitget(getFlag(bm1_filtered),5)); rev_idx = find(bitget(getFlag(bm1_filtered),5));
it says there: "get the indices for the forward and the reverse reads in each pair. This information is captured in the fifth bit of the flag field"
in the end of this page an example is shown (and I got the same): SRR054715.sra.6849385 163 20 60 40M AACCCTAAACCTCTGAATCCTTAATCCCTAAATCCCTAA SRR054715.sra.6849385 83 229 60 40M CCTATTTCTTGTGGTTTTCTTTCCTTCACTTAGCTATGG SRR054715.sra.6992346 99 20 60 40M AACCCTAAACCTCTGAATCCTTAATCCCTAAATCCCTAAA SRR054715.sra.6992346 147 239 60 40M GTGGTTTTCTTTCCTTCACTTAGCTATGGATGGTTTAT
when looking at the flags: the forward index include: 163 and 99 and the reverse: 83 and 147.
using: http://picard.sourceforge.net/explain-flags.html If I understand correctly the forward should include: 99 and 83 that are the firsts in the pair and not 163?
I think that the correct bit for the forward should be 7 or 8 and not 5?
Maybe I don't understand something, I will thank you to correct my way of thinking. Best, yishai
댓글 수: 0
채택된 답변
Joe Myint
2013년 3월 27일
편집: Joe Myint
2013년 3월 27일
Hello Yishai:
The fifth bit of SAM flag indicates whether a read is mapped to reverse strand or forward strand. If the bit is 1, then the read is mapped to the reverse strand, and 0 if the read is mapped to the forward strand.
Getting back to the two commands that you weren’t sure about:
|fow_idx = find(~bitget(getFlag(bm1_filtered),5));|
This command returns indices of all reads that are mapped to the forward strand.
|rev_idx = find(bitget(getFlag(bm1_filtered),5));|
This command returns indices all reads that are mapped to the reverse strand.
In this pair-end ChIP-Seq data, the firsts in pairs are not necessarily mapped to the forward strand only. In fact, the reads (and mates) can be mapped to both forward and reverse strands. In our example, we have found the following pairs:
Flag 99’s are paired to 147’s and vice versa
The read with flag 99 is mapped to the forward strand, and the mate is mapped to the reverse strand. The read appears in the first input file (first in pair) which belongs to the 5’ reads. The read with flag 147 is mapped to the reverse strand, and the mate is mapped to the forward strand. The read appears in the second input file (second in pair) which belongs to the 3’ reads.
Flag 83’s are paired to 163’s and vice versa
The read with flag 83 is mapped to the reverse strand, and the mate is mapped to the forward strand. The read appears in the first input file (first in pair) which belongs to the 5’ reads. The read with flag 163 is mapped to the forward strand, and the mate is mapped to the reverse strand. The read appears in the second input file (second in pair) which belongs to the 3’ reads.
Hope it helps,
Joe
추가 답변 (0개)
참고 항목
카테고리
Help Center 및 File Exchange에서 Alignment에 대해 자세히 알아보기
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!