Exploring Genome-wide Differences in DNA Methylation Profiles documentation
이전 댓글 표시
Could someone please explain to me how am I supposed to do the following steps?
" *(1) downloaded the files SRR030222.sra, SRR030223.sra, SRR030224.sra and SRR030225.sra containing the unmapped short reads for two replicates of from the DICERex5 sample and two replicates from the HCT116 sample respectively. Converted them to FASTQ-formatted files using the NCBI SRA Toolkit.
(2) produced SAM-formatted files by mapping the short reads to the reference human genome (NCBI Build 37.5) using the Bowtie [2] algorithm. Only uniquely mapped reads are reported.
(3) compressed the SAM formatted files to BAM and ordered them by reference name first, then by genomic position by using SAMtools [3].* "
For part 1 I downloaded to files using the FTP site and I used to file fastq-dump.exe to find the fastq format. After that I have no idea what to do. Help is appreciated. Wish Matlab documentation was more descriptive though.
채택된 답변
추가 답변 (1개)
elcebir
2016년 2월 1일
0 개 추천
For bowtie 2 the given answer need to be corrected for the flags and usage. in the 3 step the usage and flags should be in the given way 3-$BT2_HOME/bowtie2 -p 16 --local -M 3 -x hsGRCh38 -U $BT2_HOME/example/reads/SRR030224.fastq -S SRR030224.sam --best and -v options are not valid in bowtie2 In the last step the usage can be in the given way 5-samtools sort -l 9 -n -T abc SRR030224unordered.bam -o SRR030224.bam
카테고리
도움말 센터 및 File Exchange에서 Genomics and Next Generation Sequencing에 대해 자세히 알아보기
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!