[Biopython-dev] Notification: incoming/35

biopython-bugs at bioperl.org biopython-bugs at bioperl.org
Tue Jun 19 10:57:43 EDT 2001


JitterBug notification

new message incoming/35

Message summary for PR#35
	From: tarjei at mit.edu
	Subject: NCBIStandalone.BlastParser bug
	Date: Tue, 19 Jun 2001 10:57:42 -0400
	0 replies 	0 followups

====> ORIGINAL MESSAGE FOLLOWS <====

>From tarjei at mit.edu Tue Jun 19 10:57:42 2001
Received: from localhost (localhost [127.0.0.1])
	by pw600a.bioperl.org (8.11.2/8.11.2) with ESMTP id f5JEvg826272
	for <biopython-bugs at pw600a.bioperl.org>; Tue, 19 Jun 2001 10:57:42 -0400
Date: Tue, 19 Jun 2001 10:57:42 -0400
Message-Id: <200106191457.f5JEvg826272 at pw600a.bioperl.org>
From: tarjei at mit.edu
To: biopython-bugs at bioperl.org
Subject: NCBIStandalone.BlastParser bug

Full_Name: Tarjei Mikkelsen
Module: Bio.Blast.NCBIStandalone.BlastParser
Version: 1.00a
OS: Dec/Alpha OSF1
Submission from: incognito.mit.edu (18.246.0.239)


The standalone BLAST record parser (Bio.Blast.NCBISTandalone.BlastParser) fails
with a SyntaxError when the (path)name of the database spans more than one
line.

The following code stub/BLAST output will reproduce the bug: (Even though this
example is from BLAST 2.0.5 the same thing happens in newer versions)

<<<<<CUT: blast_parser_bug.py>>>>>
from Bio.Blast import NCBIStandalone

blast_out = open("blast_parser_bug.out", "r")
blast_parser = NCBIStandalone.BlastParser()
blast_record = blast_parser.parse(blast_out)
<<<<<CUT>>>>>

<<<<<CUT: blast_parser_bug.out>>>>>
BLASTP 2.0.5 [May-5-1998]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= eco:b1416
         (83 letters)

Database: /home/strontium/tarjei/pathway/src/Bio/Pathway/data/2.7.1.11
.fa
           39 sequences; 18,779 total letters

Searching......................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits) 
Value

spy:SPy1283                                                           20  0.64
lla:L0002                                                             20  0.84

>spy:SPy1283
           Length = 337
           
 Score = 20.4 bits (41), Expect = 0.64
 Identities = 10/26 (38%), Positives = 17/26 (64%), Gaps = 1/26 (3%)

Query: 21  GYTDEEIVSSDIIG-SHFGSVFDATQ 45
           G  +EE+V S I+G +  G++F  T+
Sbjct: 287 GIHNEELVESPILGTAEEGALFSLTE 312


>lla:L0002
           Length = 340
           
 Score = 20.0 bits (40), Expect = 0.84
 Identities = 10/25 (40%), Positives = 16/25 (64%), Gaps = 1/25 (4%)

Query: 21  GYTDEEIVSSDIIG-SHFGSVFDAT 44
           G  +EE+V S I+G +  G++F  T
Sbjct: 286 GIRNEELVESPILGTAEEGALFSLT 310


 Score = 18.8 bits (37), Expect = 1.9
 Identities = 9/29 (31%), Positives = 17/29 (58%), Gaps = 1/29 (3%)

Query: 28  VSSDIIGSHFGSVFD-ATQTEITAVGDLQ 55
           + +DI+G+ F   FD A  T + A+  ++
Sbjct: 126 IDNDIVGTDFTIGFDTAVSTVVDALDKIR 154


  Database: /home/strontium/tarjei/pathway/src/Bio/Pathway/data/2.7.1.
  11.fa
    Posted date:  Jun 18, 2001  1:19 PM
  Number of letters in database: 18,779
  Number of sequences in database:  39
  
Lambda     K      H
   0.313    0.129    0.352 

Gapped
Lambda     K      H
   0.270   0.0470    0.230 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2788
Number of Sequences: 39
Number of extensions: 119
Number of successful extensions: 3
Number of sequences better than 10: 2
Number of HSP's better than 10.0 without gapping: 2
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3
length of query: 83
length of database: 18779
effective HSP length: 33
effective length of query: 50
effective length of database: 17492
effective search space:   874600
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.8 bits)
X3: 64 (24.9 bits)
S1: 34 (18.3 bits)
S2: 31 (16.5 bits)
<<<<<CUT>>>>>





More information about the Biopython-dev mailing list