[Biopython-dev] blastpgp parsing buglet

Jeffrey Chang jchang at jeffchang.com
Fri May 9 02:02:48 EDT 2003


Great patch!  I've committed it.

Thanks,
Jeff


On Thursday, May 8, 2003, at 11:45  AM, Coleman, Michael wrote:

> Parsing by NCBIStandalone.py fails for BLASTP 2.2.5 output.  This is 
> the partial output that trips the problem:
>
> gi|23099742|ref|NP_693208.1| ornithine aminotransferase [Oceanob...   
> 430   e-119
> gi|16081241|ref|NP_393547.1| L-2, 4-diaminobutyrate:2-ketoglutar...   
> 430   e-119
>
> Sequences not found previously or not previously below threshold:
>
>> gi|23466947|gb|ZP_00122533.1| hypothetical protein [Haemophilus 
>> somnus 129PT]
>           Length = 432
>
>  Score =  591 bits (1524), Expect = e-167
>  Identities = 191/420 (45%), Positives = 291/420 (69%), Gaps = 7/420 
> (1%)
>
> The code expects to see a 'CONVERGED' but none is given here.  One 
> possible fix would be to also look for a line beginning with '>', like 
> so
>
>             # Read the descriptions and the following blank lines.
>             read_and_call_while(uhandle, consumer.noevent, blank=1)
>             l = safe_peekline(uhandle)
>             if l[:9] != 'CONVERGED' and l[:1] != '>':
>                 read_and_call_until(uhandle, consumer.description, 
> blank=1)
>                 read_and_call_while(uhandle, consumer.noevent, blank=1)
>
> Mike
>
> Mike Coleman, Scientific Programmer, +1 816 926 4419
> Stowers Institute for Biomedical Research
> 1000 E. 50th St., Kansas City, MO  64110
>
> _______________________________________________
> Biopython-dev mailing list
> Biopython-dev at biopython.org
> http://biopython.org/mailman/listinfo/biopython-dev




More information about the Biopython-dev mailing list