[Bioperl-l] Next-gen modules

Sendu Bala bix at sendu.me.uk
Thu Jun 18 02:45:17 EDT 2009


Chris Fields wrote:
> On Jun 17, 2009, at 5:10 PM, Sendu Bala wrote:
 >
>>> I'm personally wondering if this could be done as a sequence 
>>> database, something similar in theme to Lincoln's SeqFeature::Store, 
>>> but sequence only, and returns quality objects in a similar manner 
>>> (ala Storable)?  Not sure whether that's feasible, but it's appears 
>>> at least scalable.
>>
>> I think not. Well, at least SeqFeature::Store doesn't scale. Try 
>> storing millions of features in a database and watch it crawl to 
>> complete unusability. I can't imagine a db scaling to holding hundreds 
>> of TB of data either. I'm also not sure what the benefit is. There are 
>> already high-speed ways of indexing your fastq or bam files.
> 
> Interesting that you ran into issues with SF::Store; wonder if object 
> storage is the limiting factor there, or if it is something else.

Object storage certainly was an issue, which is why I patched it to 
(optionally) not store objects. That helped a great deal, but ultimately 
only increased the number of features you could store before it slowed 
down; it didn't solve the problem completely.


More information about the Bioperl-l mailing list