[Biopython-dev] Fwd: Blast XML Parse Improvements

Travis Wrightsman twrig002 at ucr.edu
Thu Jul 30 15:51:58 UTC 2015


Forgot to reply all, see below. Issue Resolved!

Travis Wrightsman

Begin forwarded message:

> From: Travis Wrightsman <twrig002 at ucr.edu>
> Date: July 29, 2015 at 12:32:08 PM PDT
> To: Peter Cock <p.j.a.cock at googlemail.com>
> Subject: Re: [Biopython-dev] Blast XML Parse Improvements
> 
> That is an unfortunately labeled feature, hopefully NCBI will implement a subject-like feature that works the way we expect it to.
> 
> Thank you for the clarification!
> 
> Travis Wrightsman
> 
>> On Jul 29, 2015, at 11:26 AM, Peter Cock <p.j.a.cock at googlemail.com> wrote:
>> 
>> Hi Travis,
>> 
>> This is an NCBI "feature", not a Biopython bug.
>> 
>> Using -subject means BLAST does lots of pairwise comparisons,
>> so you get lots of iteration blocks in the XML. To get the results
>> you probably want (and meaningful e-values), first make a BLAST
>> database. See:
>> 
>> http://blastedbio.blogspot.co.uk/2012/05/blast-ingoring-search-space-size-for-e.html
>> 
>> Regards,
>> 
>> Peter
>> 
>>> On Wed, Jul 29, 2015 at 5:19 PM, Travis Wrightsman <twrig002 at ucr.edu> wrote:
>>> Peter and others,
>>> 
>>> Here is the gist: https://gist.github.com/twrightsman/a94c66016692fd4295be
>>> 
>>> The xml output shows the tool version and I included my shell commands used
>>> to get the results; it is still using the "current" BLAST XML format with
>>> iteration blocks.
>>> 
>>> -Travis
>>> 
>>> On Wed, Jul 29, 2015 at 1:44 AM, Peter Cock <p.j.a.cock at googlemail.com>
>>> wrote:
>>>> 
>>>> Right, the new "blastxml2" output in BLAST+ 2.2.31 is a whole new format,
>>>> but the existing "blastxml" which Biopython already has a parser for is
>>>> still available as usual via -outfmt 5.
>>>> 
>>>> And that parser ought to work fine with multiple queries.
>>>> 
>>>> Travis, can you post more details? e.g. BLAST+ command line used
>>>> and ideally sample output (on gist.github.com or similar)
>>>> 
>>>> Thanks,
>>>> 
>>>> Peter
>>>> 
>>>> On Wed, Jul 29, 2015 at 6:29 AM, Fields, Christopher J
>>>> <cjfields at illinois.edu> wrote:
>>>>> Keep in mind there are significant changes for BLAST+ XML output in the
>>>>> latest release, not all of them good IMHO.  Peter has a pretty good
>>>>> writeup
>>>>> on this:
>>>>> 
>>>>> http://blastedbio.blogspot.com/2015/07/blast-xml-2-include-trouble.html
>>>>> 
>>>>> chris
>>>>> 
>>>>> On Jul 29, 2015, at 12:21 AM, Wibowo Arindrarto <w.arindrarto at gmail.com>
>>>>> wrote:
>>>>> 
>>>>> Hi Travis,
>>>>> 
>>>>> There hasn't been any new test cases added for BLAST 2.2.30, as I
>>>>> recall. Did you get the same behavior when using Bio.SearchIO to parse
>>>>> it?
>>>>> 
>>>>> Best regards,
>>>>> Bow
>>>>> 
>>>>> On Wed, Jul 29, 2015 at 7:12 AM, Travis Wrightsman <twrig002 at ucr.edu>
>>>>> wrote:
>>>>> 
>>>>> Biopython Devs,
>>>>> 
>>>>> I was trying to parse through a tblastn XML output with multiple queries
>>>>> in
>>>>> the query file and it was generating a BLAST record object for each
>>>>> contig
>>>>> in the subject file instead of each run. Is this the intended behavior
>>>>> or
>>>>> has this not been updated for ncbi tools 2.2.30?
>>>>> 
>>>>> If it needs to be updated or finished, I'm more than happy to rewrite or
>>>>> update some of the code in NCBIXML for the latest version.
>>>>> 
>>>>> -Travis
>>>>> 
>>>>> _______________________________________________
>>>>> Biopython-dev mailing list
>>>>> Biopython-dev at mailman.open-bio.org
>>>>> http://mailman.open-bio.org/mailman/listinfo/biopython-dev
>>>>> 
>>>>> _______________________________________________
>>>>> Biopython-dev mailing list
>>>>> Biopython-dev at mailman.open-bio.org
>>>>> http://mailman.open-bio.org/mailman/listinfo/biopython-dev
>>>>> 
>>>>> 
>>>>> 
>>>>> _______________________________________________
>>>>> Biopython-dev mailing list
>>>>> Biopython-dev at mailman.open-bio.org
>>>>> http://mailman.open-bio.org/mailman/listinfo/biopython-dev
>>>> _______________________________________________
>>>> Biopython-dev mailing list
>>>> Biopython-dev at mailman.open-bio.org
>>>> http://mailman.open-bio.org/mailman/listinfo/biopython-dev
>>> 
>>> 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.open-bio.org/pipermail/biopython-dev/attachments/20150730/dee55a6c/attachment.html>


More information about the Biopython-dev mailing list