[Bioperl-l] XML parser preference?
    Chris Fields 
    cjfields at uiuc.edu
       
    Thu Aug 10 13:35:21 UTC 2006
    
    
  
Jurgen,
Thanks for pointing that out!  However, the problem is we want to  
keep the number of dependencies down; there are already four XML  
parser dependencies for Bioperl (XML::Twig is one, but XML::LibXML  
isn't).
Maybe new modules which require XML parsing stick with four XML  
parsers.  However, not the current four (XML::DOM, XML::Twig,   
XML::Parser, XML::SAX).
Maybe we should pick four XML parsers, each with their own particular  
strengths:
1)  XML::SAX  (SAX parsing; flexible, can use pure Perl, ExpatXS, etc)
      Switch using XML::Parser to XML::SAX (done for  
Bio::SeacrhIO::blastxml)
2)  XML::LibXML (DOM parsing; maintained, up to date, fast)
     Switch using XML::DOM to XML::LibXML
3)  XML::Twig (DOM-like, SAX-based) - great for processing 'chunks'  
of XML
      Used in Bio::DB::Taxonomy::entrez
4)  XML::Simple (small XML) - very easy to use XML parser
Since they are currently available for most (all?) OS's, shouldn't be  
a problem.  What do you think Mauricio?
Chris
On Aug 10, 2006, at 7:29 AM, Jurgen Pletinckx wrote:
> | I have no doubt that XML::LibXML is a great parser (I've used
> | it a few
> | times), the problem with it is that it runs on top of libxml2's C
> | library. On *nix systems it's fairly simple to have this dependency
> | compiled and running, but what about having it under other OS's
> (e.g.
> | Windows)?
> |
> | Introducing XML::LibXML as a dependency into the toolkit will
> | probably
> | place EUtilities as a module not usable by everyone, especially
> those
> | who use BioPerl in a OS where installing/compiling C
> | dependencies can be
> | a headache.
>
> Regarding XML::LibXML, there does appear to be an up-to-date ppm
> package (which fetches libxml2.dll) at
>
> http://theoryx5.uwinnipeg.ca/ppms/XML-LibXML.ppd
>
> (and less than a week since the release of the corresponding
> version to cpan, too.)
>
> So the threshold for distribution to Windows, at least, is less
> high than it might have been.
>
> -- 
> Jurgen Pletinckx
> AlgoNomics NV
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/bioperl-l
Christopher Fields
Postdoctoral Researcher
Lab of Dr. Robert Switzer
Dept of Biochemistry
University of Illinois Urbana-Champaign
    
    
More information about the Bioperl-l
mailing list