[BioRuby] Rakefiles for the Dindel workflow

Raoul Bonnal bonnalraoul at ingm.it
Wed Feb 2 09:41:47 UTC 2011


Hi Hiro,
seems an interesting approach, we could discuss this e-mail tomorrow during the irc meeting.
is it fine  for you ?

I can't catch how do you use pwrake, sorry probably I missed something, I mean the benefit.


On 02/feb/2011, at 04.51, MISHIMA, Hiroyuki wrote:

> Hi all,
> 
> I wrote a rakefile for the Dindel workflow. Dindel is a toolkit to call
> small indels from mapped short-read data (BAM files). See
> http://www.sanger.ac.uk/resources/software/dindel/ .
> 
> The rakefile is available at https://github.com/misshie/RakefileDindel .
> 
> I think my rakefile is a good example of "dynamic task definition".
> During Stage 3 of the workflow, Dindel generates over 300 files for
> exome data. Although the file naming rule is known before running rake,
> the number of files to be generated is unknown at the time.
> 
> van der Aalst et al. (2003) have shown this pattern as "pattern 14:
> multiple instances with a priori runtime knowledge". Rakefiles can
> describe this pattern using Rake::Task#invoke.
> 
> Furthermore, this rakefile demonstrates effectiveness of Pwrake. Stage 3
> is a typical embarrassingly parallel problem.
> 
> To improve rakefile readability, I separated a Rakefile into Rakefile,
> Rakefile.invoke, and Rakefile.helper. Rakefile is workflow description.
> Rakefile.invoke is command-lines to invoke tools. Rakefile.helper has
> helper methods making Rakefile simpler.
> 
> Previously Yannick Wurm has shown a rakefile "cdsToAlignmentToTree" at
> https://github.com/yannickwurm/tidbits/ . The rakefile handles
> exceptions carefully. Mine does not. Yannick's approach is important
> because sometimes error messages in rakefiles are not intuitive.
> 
> So far, my workflow does not use BioRuby at all. Raoul Bonnal have
> suggested BioRuby-rake integration replying Yannick's post. Introducing
> modular task definition to rake is what we need but not very easy
> because each workflow step in a rakefiles is linked too tight.
> 
> Introducing a BioRuby plug-in to support common helper methods
> simplifying rakefiles seems easier. It may contain my Rakefile.helper,
> and Yannick's helper methods and exception handling.
> 
> I will try to write a small plug-in for a while.
> 
> Sincerely yours,
> Hiro.
> -- 
> MISHIMA, Hiroyuki, DDS, Ph.D.
> COE Research Fellow
> Department of Human Genetics
> Nagasaki University Graduate School of Biomedical Sciences
> _______________________________________________
> BioRuby Project - http://www.bioruby.org/
> BioRuby mailing list
> BioRuby at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/bioruby

--
R.J.P.B.








More information about the BioRuby mailing list