ORF (open reading frame) finder for genome

조회 수: 12 (최근 30일)
Sara Kamali
Sara Kamali 2011년 7월 1일
편집: Achal Mahajan 2022년 6월 1일
Dear Sir,
I need to generate a set of all possible ORFs (4 codons or longer)
in the entire genome for some microbial sequences, for that I used
“SEQSHOWORFS “ Function from Bioinformatics toolbox and got the
following error :
“SEQSHOWORFS can only display at most 1000000 symbols. Use
“SEQTOOL” to display ORFs for long sequences or use a shorter sub-
sequence or fewer reading frames”.
Since “SEQtool” is only ORF visualization tool, How can we take
out the identified ORFs into a separate file for the further
analysis?
Thank you so much in advance sadaf
  댓글 수: 1
N/A
N/A 2011년 11월 19일
Hi Sara!
Did you discover how to obtain ORFs for long sequences into a separate file?
I have the same problem here!
=/
Thanks!

댓글을 달려면 로그인하십시오.

답변 (2개)

Stephanie Logan
Stephanie Logan 2018년 10월 9일
if you go into edit seqshoworfs.m it opens the file. Read through it and find where it restricts the limit. It says it causes a Java issue but I added two zeros and had no problem. Took some time but worked.

Achal Mahajan
Achal Mahajan 2022년 6월 1일
편집: Achal Mahajan 2022년 6월 1일
Hi,
You can use seqviewer app to visualize the base pairs if the size of the genome > 1000000. The app also generates a set of all possible ORFs, complement sequence and reverse complement sequence as well.
A random sequence is generated here using the in-built function randseq.
Here is a sample script to do that:
-Achal
clear all;clc;close all;
s = randseq(1000000, 'alphabet', 'dna');
seqviewer(s)

카테고리

Help CenterFile Exchange에서 Genomics and Next Generation Sequencing에 대해 자세히 알아보기

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by