[Biojava-l] Extracting 3' UTR, 5' UTR, exons, introns, CD sequence structure for NCBI NM RefSeq

dmitriy dms700 at gmail.com
Wed Jul 25 16:10:11 EDT 2007


Hi

Does anyone has the code which takes NCBI NM RefSeq number and NCBI NC
RefSeq number gets NC RefSeq  from NCBI and parses it in such way so
for specified NM RefSeq "gene table" object is build. "Gene table"
object should have information on 3' UTR, 5' UTR, exons, introns and
CD sequence. The data in "gene table" should be sufficient for example
to generate sequence string with 3' UTR, 5' UTR,  introns, non coding
exon(s)  or part(s) of exon(s)  in small letters and coding exon(s)
or part(s) of exon(s)  in capital letters.

Thanks
Dmitriy


More information about the Biojava-l mailing list