[BioPython] GenBank parsing

Lee R. Shekter lee at epigenomix.com
Wed Oct 15 15:29:49 EDT 2003


Hi all -

I am trying to have BioPython become a standard at my new job (we're  a
tiny biotech start-up in Providence, RI). I am new to BioPython, not so
new to python. I have version 1.22 of BioPython installed on top of
python 2.3.2 and all is working correctly (as per tests, sample code,
etc.). I have read the documentation but am still somewhat puzzled. I
have to parse GenBank records for the following information (the
sequences have already been downloaded and so are local):

1) Accession number
2) protein sequence
3) Date of deposit
4) Country of origin

I'm sure it's just a slight modification of what is described in the
manual and there seems to be some snippets of code on the net that are
related to this, but if anyone could help me, i.e. point me to some code
which I can use to extract the information I need and put it into a
file, that would be greatly appreciated.

Lee Shekter



More information about the BioPython mailing list