[Bioperl-l] tigrxml parsing
Robert Buels
rmb32 at cornell.edu
Thu Aug 6 17:14:12 UTC 2009
Hi all,
Recently in #bioperl somebody came by trying to use
Bio::SeqIO::tigrxml.pm to parse the medicago genome annotations at
http://www.medicago.org/genome/downloads/Mt2/MT2.0_medicago_chrX_20080103_NoOverlap.xml.tar.gz
svn HEAD tigrxml.pm was not at all happy with these files, eventually
dieing with
------------- EXCEPTION: Bio::Root::Exception -------------
MSG: start is undefined
STACK: Error::throw
STACK: Bio::Root::Root::throw Bio/Root/Root.pm:368
STACK: Bio::RangeI::contains Bio/RangeI.pm:255
STACK: Bio::SeqFeature::Generic::add_SeqFeature
Bio/SeqFeature/Generic.pm:783
STACK: Bio::SeqIO::tigrxml::start_element Bio/SeqIO/tigrxml.pm:206
STACK: try{} block /usr/share/perl5/XML/SAX/Base.pm:292
STACK: XML::SAX::Base::start_element /usr/share/perl5/XML/SAX/Base.pm:266
STACK: XML::SAX::Expat::_handle_start /usr/share/perl5/XML/SAX/Expat.pm:225
STACK: XML::Parser::Expat::parse /usr/lib/perl5/XML/Parser/Expat.pm:469
STACK: XML::Parser::parse /usr/lib/perl5/XML/Parser.pm:187
STACK: XML::SAX::Expat::_parse_bytestream
/usr/share/perl5/XML/SAX/Expat.pm:45
STACK: XML::SAX::Base::parse /usr/share/perl5/XML/SAX/Base.pm:2602
STACK: XML::SAX::Base::parse_file /usr/share/perl5/XML/SAX/Base.pm:2631
STACK: Bio::SeqIO::tigrxml::next_seq Bio/SeqIO/tigrxml.pm:116
STACK: /crypt/rob/test2.pl:10
-----------------------------------------------------------
Looking at the medicago XML and comparing it to the
bioperl-live/t/data/test.tigrxml, the two look VERY different in
structure. Lots of things that are attrs in test.tigrxml seem to be
elements in the medicago XML, for example.
So I guess the question is: is the medicago TIGR XML malformed? Can
tigrxml.pm be expected to parse it? What, if anything, should be done
about this?
Rob
--
Robert Buels
Bioinformatics Analyst, Sol Genomics Network
Boyce Thompson Institute for Plant Research
Tower Rd
Ithaca, NY 14853
Tel: 503-889-8539
rmb32 at cornell.edu
http://www.sgn.cornell.edu
More information about the Bioperl-l
mailing list