[Biopython] New to BP. Looking for closely spaced genes

Mark Budde markbudde at gmail.com
Mon Apr 1 18:41:43 UTC 2013


Hi,
Before I dive too far into BioPython, I'd like to get some input if you
BioPython is an appropriate tool for my task....

I would like to look at the human genome ORF structure and identify regions
where ORFs are closely spaced but differentially regulated, and also
identify whether the ORFs are facing the same direction of opposing
directions. To do this, I assume I would first download the annotated
genome and write a script in BioPython annotating how far each ORF is from
it's neighbors, what the orientation is, and store the result in a
dictionary. Then I would download some expression data sets and add this to
the data to the dictionary. Then I would write some algorithm comparing
gene distance, orientation and expression correlation to generate a list of
candidate ORF pairs which fit my criteria.

My question is, is BioPython a reasonable tool to accomplish this, or is it
going to be way to slow whereas some alternative package is better suited
for my task?
Thanks,
Mark Budde



More information about the Biopython mailing list