[BioPython] plain txt blast output - xml instead

Michiel Jan Laurens de Hoon mdehoon at c2b2.columbia.edu
Thu Jun 15 22:37:18 UTC 2006


Peter wrote:
> According to the XML file, it is from BLASTP 2.2.14 [May-07-2006], maybe 
> they changed the XML format without telling anyone?
> 
It appears that the XML format did change.
With Blastp 2.2.14, multiple searches generate multiple 
<Iteration>...</Iteration> blocks, one for each search.
With an older Blastp, multiple searches effectively generate multiple 
XML files (each with one <Iteration>...</Iteration> block). These files 
are then concatenated into one output file. Biopython then parses this 
file by looking for the beginning of each XML file in this output file.

The new output is in a sense better because the output file is a valid 
XML file. It may be that Biopython's XML parser ignores the <Iteration> 
tags, since in the old format there was only one <Iteration> block 
anyway, and therefore fails with the new format.

--Michiel.

-- 
Michiel de Hoon
Center for Computational Biology and Bioinformatics
Columbia University
1130 St Nicholas Avenue
New York, NY 10032



More information about the Biopython mailing list