[Biopython-dev] Conserved Domains Database Support
Adam Hughes
hughesadam87 at gmail.com
Tue Jul 3 19:19:04 UTC 2012
Hi everyone,
I'm new to the BioPython library and was wondering if there was any support
for the conserved domains database from NCBI? In particular, the
superfamily batch files that their webtool releases. Doing a Google
search, there was some interest for this back in 2008; however, they were
mainly interested in parsing the HTML output of CDD searches. Now that CDD
offers a nice, regular downloadable datatype, has any BioPython support
been implemented to work with this?
If not, I'd like to contribute.
The data is simple tab-delmited formats of domain alignments, E.G.:
Q#10000 0 >WHL22.364604.0 superfamily 212291 7 290
1.01528e-138 401.1 cl09099 P-loop_NTPase superfamily
0
I had envisioned a simple class of mainly getters/setters with a few
methods such as sorting by Query batches.
~Adam
More information about the Biopython-dev
mailing list