[Bioperl-pipeline] Re: bioperl-pipeline for the small lab

06 Aug 2002 09:41:01 +0800

Hi Mat,
	great to hear from you. We are most interested to see how we can help
you out. We want to encourage as much use of the pipeline as possible
and that will give us tremendous support in terms of validating our
design and incorporating new features as new requirements arise.
>From your mail, I will try and summarize what you propose

1) The daemon pipeline which I can see as a cron job is essentially a
one stage pipeline. But we might have one or more runnables for the
logic of doing the diffs and annotating your database based on new hits
 etc. Yup, its interesting to me to develop this functionality which is
very generalizable.

2) The second pipeline is seems like a series of blast pipelines running
in parallel. Then using the hits to run framesearch and TFASTA. We can
write bioperl wrapper for those or you could help out too :) Wait for
the proposal of a new Bio::Tools::Run::Analysis interfaceI think that
Martin Senger is proposing for writing these wrappers which should be
quite nice and clean  . Outputs can be dumped to database or csv as you
wish :) In terms of GFD, it seems that we are mostly storing
hits/featurepairs for your searches which is now doable and we are
refining. You prolly also want to store your genes in the db as well.
For your sequences you may want to store them in BioSQL.

When I get back, we can come up with more concrete plans, I will try and
write up some configurations for your pipeline :)

its looks like an interesting start. FYI, we just created a
bioperl-pipeline mailing list and hope u don't mind I have cc'ed this 
mail to the list. 

>I can write some sort of docs for you on how to do this for the small
lab? 
:) we will definitely take u up on that

are u still at ISMB? lets talk some more if you want 
great mail.

cheers,
shawn