[BioPython] retrieve sequence coordinates of exons for a stretch of genes

Peter Cock p.j.a.cock at googlemail.com
Thu Feb 14 11:01:14 UTC 2008


Hi Kevin,

Where do you normally get your genomes from? I am most familiar with
the NCBI formats, so I would start by examining the GenBank file for
the relevant genome.  Have a look by hand first - it may well have
features for these genes, and in particular a CDS feature which marks
out the introns/exons for you.

Biopython will read GenBank files, although I would say dealing with
the locations via the SeqFeature object is a little fiddly... have a
look at the main documentation and also perhaps
http://www2.warwick.ac.uk/go/peter_cock/python/genbank/

Peter

On Thu, Feb 14, 2008 at 6:02 AM, Kevin Lam <ULNJUJERYDIX at spammotel.com> wrote:
> Hi
>  I have been scouring through the web for something I thought was a rather
>  simple task but I can't find the answer.
>
>  How do I get the sequence coordinates for exons of genes in a stretch of
>  genome demarcated by say HoxA13 and Hox A1 ?
>
>  below is the example of the data I am looking for.
>
>  1026087..1026688    1026807..1026834    1026839..1027045    HOXD12
>  1033641..1034421    1035192..1035427    1035428..1035873    HOXD11
>  _______________________________________________
>  BioPython mailing list  -  BioPython at lists.open-bio.org
>  http://lists.open-bio.org/mailman/listinfo/biopython
>



More information about the Biopython mailing list