[Bioperl-l] extract feature seq when split between 2 GenBank accessions

Charles Hauser chauser at duke.edu
Wed Aug 27 12:11:29 EDT 2003


All,

I'd like to extract the CDS from genbank records and have found that in
some instances these are distributed among >1 genbank accession (see
below).

I have a script which does fine if CDS is fully contained within 1
accession, other than storing all accession seqs in a hash is there a
good way to deal with these?

Charles


LOCUS       AY095303S1              2375 bp    DNA     linear   PLN 21-JAN-2003
DEFINITION  Chlamydomonas reinhardtii c-type cytochrome synthesis 1 (CCS1)
            gene, ccs1-ac206 allele, 5'UTR and exons 1 through 6.
ACCESSION   AY095303
VERSION     AY095303.1  GI:25986619

     CDS             join(207..330,512..825,1045..1233,1418..1798,2000..2131,
                     2253..2345,AY095304.1:6..303,AY095304.1:495..677,
                     AY095304.1:863..1098)
                     /gene="CCS1"




LOCUS       AY095303S2              1505 bp    DNA     linear   PLN 21-JAN-2003
DEFINITION  Chlamydomonas reinhardtii c-type cytochrome synthesis 1 (CCS1)
            gene, ccs1-ac206 allele, exons 7, 8 and 9, 3'UTR and complete cds.
ACCESSION   AY095304
VERSION     AY095304.1  GI:25986620
     CDS             join(AY095303.1:207..330,AY095303.1:512..825,
                     AY095303.1:1045..1233,AY095303.1:1418..1798,
                     AY095303.1:2000..2131,AY095303.1:2253..2345,6..303,
                     495..677,863..1098)
                     /gene="CCS1"






More information about the Bioperl-l mailing list