[Biopython-dev] New: Uniprot XML parser
Peter
biopython at maubp.freeserve.co.uk
Mon Sep 13 17:47:23 UTC 2010
Hi Andrea,
I've done some work on the plain text swiss parser to handle features,
and some basic testing to make sure it agrees with the uniprot-xml
parser. This showed some problems with end locations out by one
in the XML parser which I believe I was able to resolve. I have also
commented out the use of the skip_parsing_errors option - it doesn't
seem to be needed and silent errors are bad.
I have (for the moment) introduced a couple of new position classes
in Bio.SeqFeature for "?123" where we have a position but it is
uncertain, and "?" where we don't have a position at all. The later
might be handled more elegantly by inferring a Before/AfterPosition
instead...
Note that for testing purposes, I have disabled your code where
it builds a SeqFeature for a dbReference - I'm not sure what the
best plan here is yet.
Could you have a look at my branch please?
http://github.com/peterjc/biopython/commits/uniprot
Thanks,
Peter
More information about the Biopython-dev
mailing list