[BioPython] Problem parsing Blast XML output from different sources
Michiel Jan Laurens de Hoon
mdehoon at c2b2.columbia.edu
Thu Oct 5 16:01:04 UTC 2006
Which sequence are you running blast on?
I'd like to try this on our local blast installation.
--Michiel.
Steffi Gebauer-Jung wrote:
> Hello,
>
> because of blastall 2.2.14 output was not parsed from the
> Bio.Blast.NCBIStandalone parser,
> I tried to switch to the recommended Bio.Blast.NCBIXML parser.
>
> Thereby I found, that the xml output of the locally installed standalone
> blastall (2.2.14)
> differs from the web xml output.
>
> For BlastN hsps on Plus/Minus strands, the xml gives
> query_frame/hit_frame 1 / -1 as usual.
> But query and frame positions and sequences are switched in direction
> (would match frames -1/1).
>
> As the Bio.Blast.Record returned by the NCBIXML parser only gives
> frames, sequences
> and start positions it is not possible (without knowing the source of
> the xml file)
> to be sure to find the right data.
>
> This is clearly a problem of Blast.
> But because of the missing end positions in the returned record object
> it becomes a problem for users of the parser too.
>
> Could somebody try to confirm the different behaviour of the xml blast
> output
> with his/her own examples/installation?
>
> Thanks, Steffi
>
>
>
> _______________________________________________
> BioPython mailing list - BioPython at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/biopython
--
Michiel de Hoon
Center for Computational Biology and Bioinformatics
Columbia University
1130 St Nicholas Avenue
New York, NY 10032
More information about the Biopython
mailing list