[Biopython-dev] Blast
    Jeffrey Chang 
    jeffrey.chang at duke.edu
       
    Thu Sep 29 22:16:04 EDT 2005
    
    
  
On Sep 29, 2005, at 1:46 PM, Michiel De Hoon wrote:
> To my surprise, the parser in Blast.NCBIWWW tries to parse HTML output
> instead of text output. My guess is that the HTML output changes  
> more often
> and is more difficult to parse than text output. So isn't it  
> possible to make
> NCBIWWW.qblast return text output instead of HTML and parse that  
> instead?
> So my question is, why was the choice made to parse HTML instead of  
> text? Is
> it simply because blast-on-the-web couldn't return text output in  
> the past?
You are right.  It was done that way in the past when the only way to  
use NCBI's BLAST was to use the HTML output.  (Actually, there was a  
version that you could access through a proprietary non-HTTP  
protocol, but the databases were not updated as frequently.)  Now  
that we can get text, perhaps it is time to encourage users to use  
the text one.  I believe the HTML parser is a few versions behind  
now, and unable to parse current BLAST output anymore.
Jeff
    
    
More information about the Biopython-dev
mailing list