[Bioperl-l] parsing blast HIT with multiple GIs

suhoiy suhoiy@21cn.com
Thu, 31 Oct 2002 21:58:23 GMT


I am sorry I did not express myself clearly. For example, here is part of a hit:

==========

>gi|9789726|sp|O55131|SEP7_MOUSE   Septin 7 (CDC10 protein homolog)
 gi|2864606|emb|CAA11547.1|   (AJ223782) CDC10 [Mus musculus]
          Length = 436

 Score =  225 bits (574), Expect = 9e-58
 Identities = 119/284 (41%), Positives = 184/284 (63%)
 Frame = -1

==========

$hit->description returned "Septin 7 (CDC10 protein homolog)", 
while I want " gi|2864606|emb|CAA11547.1|...".

I traced the source code,  those lines are discarded in next_result. :(


---Original Message---
From: Jason Stajich<jason@cgt.mc.duke.edu>
Subject: Re: [Bioperl-l] parsing blast HIT with multiple GIs
 
>they are in the $hit->description part.
>
>-jason
>On Thu, 31 Oct 2002, suhoiy wrote:
>
>> Hello all,
>>
>> When I parse some blast results of proteins, I found some hits had
>> multiple GIs, which are identical sequences. The SearchIO module
>> discard the GIs without a leading >. Then how can I get all the GIs
>> in a blast result? any suggestion?
>>
>> Many thanks! :)
>>
>> suhoiy
>>
>>
>> _______________________________________________
>> Bioperl-l mailing list
>> Bioperl-l@bioperl.org
>> http://bioperl.org/mailman/listinfo/bioperl-l
>>
>
>-- 
>Jason Stajich
>Duke University
>jason at cgt.mc.duke.edu
>

 
---End of Message---