[Bioperl-l] no results with standalone tblastn
Hubert Prielinger
hubert.prielinger at gmx.at
Mon Feb 6 18:16:01 EST 2006
dear torsten,
I have downloaded all the databases, as you recommended me. And it is
working, but I don't get any results, if I try it online it works fine.
my result file looks like that:
TBLASTN 2.2.13 [Nov-27-2005]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query=
(8 letters)
Database: All GenBank+EMBL+DDBJ+PDB sequences (but no EST, STS,
GSS,environmental samples or phase 0, 1 or 2 HTGS sequences)
3,749,503 sequences; 16,556,997,203 total letters
Searching..................................................done
Sequences producing significant alignments: Score
E (bits) Value
the program code for it looks like that:
#!/usr/local/bin/perl -w
BEGIN
{
$ENV{BLASTDIR}= "/home/Hubert/blast/blast-2.2.13/bin";
$ENV{BLASTDATADIR}= "/home/Hubert/blast/blast-2.2.13/data";
}
use Bio::Tools::Run::StandAloneBlast;
use Bio::Seq;
use Bio::SeqIO;
use strict;
print "Please insert matrix:\t";
my $matrix_STD = <STDIN>;
chomp $matrix_STD;
print "Please insert count:\t";
my $count_STD = <STDIN>;
chomp $count_STD;
# parameters
my $expect_value = 20000;
#my $filter_query_sequence = 'T';
my $one_line_description = 1000;
my $alignments = 1000;
#my $matrix = 'BLOSUM80';
my $gapcost = 10;
my $gapextend = 1;
my $wordsize = 2;
#my $compbasedStat = '1';
#my $count = 1;
# my $strands = 1;
my @params = ('program' => 'tblastn','database' => 'nt');
#my $progress_interval = 100;
my $seqio_obj = Bio::SeqIO->new(
-file => "aloneblosum62.txt",
-format => "raw",
);
# create factory object and set parameters
my $factory = Bio::Tools::Run::StandAloneBlast->new(@params);
print "submitted parameters successfully \n";
$factory->e($expect_value);
#$factory->F($filter_query_sequence);
$factory->v($one_line_description);
$factory->b($alignments);
$factory->M($matrix_STD);
$factory->G($gapcost);
$factory->E($gapextend);
$factory->W($wordsize);
#$factory->C($compbasedStat);
#$factory->S($strands);
print "changed parameters successfully \n";
print "\n";
# get query
while ( my $query = $seqio_obj->next_seq) {
print "entered while loop \n";
my $blast_report = $factory->blastall($query);
# print "$blast_report\n";
$factory->outfile("nucleo80$count_STD.txt");
$count_STD++;
print $query->seq;
print "\n";
}
thanks
Hubert
Torsten Seemann wrote:
>Hubert
>
>
>
>>thanks for your quick reply, I have looked up at the ftp server and
>>there are nt.00 to nt.04. Do I have to download all of them, are there
>>differences?
>>
>>
>
>You have to download them all. The "nt" database (actually the index
>files) is very big, and it is split up into gigabyte (?) parts. Although
>they are called "nt.00" "nt.01" etc, you still pass "-d nt" to
>"blastall", because together these parts are one "nt" database. The
>"blastall" program will automatically use the separate parts; you do not
>have to join them.
>
>You should read http://www.ncbi.nlm.nih.gov/BLAST/ to make sure you are
>using the correct BLAST search for your problem.
>
>
>
More information about the Bioperl-l
mailing list