[Biopython] Parsing Blast results in XML format

Jon Bråte jon.brate at bio.uio.no
Fri Nov 6 00:30:14 UTC 2009


Dear all,

I have a Blast output file in xml format generated by qBlast done  
through biopython. The Blast was performed with 22 query sequences and  
50 hits were returned for each query. The result is in one single xml  
file. I want to extract all the sequence IDs for all the hits (22x50)  
and I have been checking out the BioPython cookbook page 53.

I am using this code, but I am only getting the 50 hits for the 1st  
query sequence:
from Bio.Blast import NCBIXM
blast_records = NCBIXML.parse(result_handle)
blast_record = blast_records.next()

save_file = open("/Users/jonbra/Desktop/my_fasta_seq.fasta", 'w')

for alignment in blast_record.alignments:
     for hsp in alignment.hsps:
             save_file.write('>%s\n' % (alignment.title,))
save_file.close()
Can anyone help me to retrieve all the hits for all the query sequences?

Best wishes

Jon Bråte

-------------------------------------------------------------
Jon Bråte
PhD student

Microbial Evolution Research Group (MERG)
Department of Biology
University of Oslo
P.b. 1066 Blindern
N-0316 Oslo
Norway
Phone: +47 22855083





More information about the Biopython mailing list