[Biopython-dev] support for database of BOLDSYSTEMS?

Carlos Peña mycalesis at gmail.com
Tue Nov 4 20:31:17 UTC 2014

Hi all,

I have written an interface to the BOLD database of DNA barcodes. It
accepts FASTA files, sends them to BOLD and gets the specimen
identifications to the species level:


I was wondering whether it could be included into BioPython? So far the
packages is a bunch of scripts and I want to make it more robust.
The working version is not so efficient as the running time has exponential
growth (n squared).

However, I was able to use asynchronous calls (using Twisted) to make it
faster. The script was able to take (n) seconds for (n) number of
sequences. But I don't fully understand Twisted and the package is unstable.

So, I wanted to ask if this little project of mine has any hope of getting
into BioPython. If that is the case I would need some pointers on using
proper classes for the code and fixing the code so that it can be
integrated. I guess I would need to drop Twisted and use instead a standard
Python library for multithreading.

I want to improve the package anyways, make it more robust and quick. So I
wanted to ask before giving another chance to Twisted.

Any comments would be appreciated,


Dr. Carlos Peña
Laboratory of Genetics
Department of Biology
University of Turku
20014 Turku
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.open-bio.org/pipermail/biopython-dev/attachments/20141104/fc574105/attachment.html>

More information about the Biopython-dev mailing list