[Bioperl-l] organism restricted blast search

Aaron J. Mackey amackey at pcbi.upenn.edu
Thu Aug 5 12:20:31 EDT 2004


On Aug 5, 2004, at 12:03 PM, Jason Stajich wrote:

> I believe there are ways of building virtual subsets of the databases
> locally with some tricks with formatdb but I don't really know them.

You can make a file that has the GI numbers of the sequences you want 
to search against (say, a GI number dump from the Taxonomy DB), and 
then use "-l gilist" with NCBI blast tools

> You can also parse your reports and lookup the GI number using the 
> tools
> in Bio::Taxonomy and Bio::DB::Taxonomy to get the species name and or
> restrict hits to a particular subset of the taxonomy hierarchy.

This also works, but your expectation values will be artificially 
conservative (as you actually searched a potentially much larger 
database).

-Aaron



More information about the Bioperl-l mailing list