[Bioperl-l] Recommended way to download qual files from Genbank?

Phillip San Miguel pmiguel at purdue.edu
Fri Jan 11 16:22:38 UTC 2008


No problem getting sequence from genbank via a myriad of methods. But as 
the volume of non-finished sequence in genbank increases the importance 
of also obtaining quality values for a given sequence increases. Some 
records include quality values.

I typically use bp_fetch.pl to grab a sequence from genbank:

bp_fetch.pl -fmt fasta net::genbank:AC207960

sends the fasta sequence to STDOUT. But that bp_fetch.pl wasn't designed 
to pull down quals evidently:

bp_fetch.pl -fmt qual net::genbank:AC207960

gives:

------------- EXCEPTION: Bio::Root::Exception -------------
MSG: You must pass a Bio::Seq::Quality or a Bio::Seq::PrimaryQual object 
to write_seq() as a parameter named "source"
STACK: Error::throw
STACK: Bio::Root::Root::throw 
/usr/local/perl_5.8/lib/site_perl/5.8.8/Bio/Root/Root.pm:359
STACK: Bio::SeqIO::qual::write_seq 
/usr/local/perl_5.8/lib/site_perl/5.8.8/Bio/SeqIO/qual.pm:205
STACK: /usr/local/perl/bin/bp_fetch.pl:313
-----------------------------------------------------------

(running under bioperl 1.5.2)

The quality values for this accession are in genbank as these URLs 
demonstrate:

http://www.ncbi.nlm.nih.gov/entrez/viewer.fcgi?db=nuccore&id=154937460

http://www.ncbi.nlm.nih.gov/entrez/viewer.fcgi?db=nuccore&list_uids=154937460&dopt=fasta

http://www.ncbi.nlm.nih.gov/entrez/viewer.fcgi?db=nuccore&list_uids=154937460&dopt=qual

What is the best way to pull down these qual values? They aren't present 
in "GenBank(Full)" format. They are present in an ASN.1 format.

Advice would be appreciated.

-- 
Phillip
Purdue Genomics Core Facility







More information about the Bioperl-l mailing list