[Bioperl-l] GenBank Parser

Drew Stewart d25_bioperl@yahoo.com
Mon, 30 Dec 2002 18:02:01 -0800 (PST)


Hi Everybody,
I have just joined the mailing list and kind of new to
Bioperl.
Here is my problem,
I am trying to write a parser for the GenBank file
obtained from the NCBI website, specifically for the
"CDS" feature in the GenBank file.
I am trying to parse the range given in the "CDS"
feature to get the nucleotide subsequence from the
whole genome for a specific protein.
For example here is a portion of the genbank file I am
interested in parsing.

  CDS   complement(join(12..78,54..1043))

or

CDS   join(complement(<1..799),complement(5080..5120))

I want to parse  this and other possible formats
"join(complement(<1..799),complement(5080..5120))" 

I have seen in the GenBank readme file that there are
many other possible formats for this CDS feature line
and so I was wondering if somebody has already written
a parser for this.

Can anyone please suggest some things I can use.
It would be a great help. Thank you.

Sincerely,
Dhruv Bhatt.

__________________________________________________
Do you Yahoo!?
Yahoo! Mail Plus - Powerful. Affordable. Sign up now.
http://mailplus.yahoo.com