[Biopython] Parsing problem

Peter biopython at maubp.freeserve.co.uk
Tue Dec 8 23:53:29 UTC 2009


On Tue, Dec 8, 2009 at 10:43 PM, Peter <biopython at maubp.freeserve.co.uk> wrote:
> On Tue, Dec 8, 2009 at 6:52 PM, Iwan Grin <iwan.grin at googlemail.com> wrote:
>> Hi all,
>>
>> I am having a little problem while trying to parse a GenBank (or rather
>> GenProt) file using BioPython. I am trying to extract the position on the
>> genome from the "coded_by" qualifier of the CDS feature of a protein.
>>
>> The "coded_by" string in this specific case looks like this:
>>
>> 'complement(NC_012967.1:
>> 3622110..3624728)'
>
> Oh, one of those tricky cross references to another file :(

It looks like the Bio.GFF.easy code expects that to be formatted
as NC_012967.1:complement(3622110..3624728) and not as
complement(NC_012967.1:3622110..3624728)

Peter



More information about the Biopython mailing list