[Bioperl-l] WGS, WGS_SCAFLD support added for GenBank files
Chris Fields
cjfields at uiuc.edu
Thu Mar 9 13:08:16 EST 2006
Added WGS and WGS_SCAFLD support to Bio::SeqIO::genbank as well as tests and
WGS sample file; the previous fix missed the WGS_SCAFLD line. I will also
soon add support to Bio::DB::GenBank for downloading WGS and WGS_SCAFLD
subfiles.
Brian, I found a pretty decent speed improvement for contig building in
Bio::DB::NCBIHelper; it basically fetches the contig whole from NCBI using
return type of 'gbwithparts' so the work is done on their end and just
switches the CONTIG line with the sequence; it took about 10 seconds vs. ~50
seconds using an unmodified NCBIHelper on my PC. I haven't committed it yet
bc I noticed the resulting contig files differ; the bioperl contig build
lacks any N's from the 'gaps()' in the CONTIG line while NCBI's version has
the N filler. I didn't know if the difference was a bug or not. Should I
go ahead and commit?
Christopher Fields
Postdoctoral Researcher - Switzer Lab
Dept. of Biochemistry
University of Illinois Urbana-Champaign
More information about the Bioperl-l
mailing list