[BioRuby] Beautiful code for Bioinformatics

Raoul Bonnal bonnal at ingm.org
Mon Feb 13 05:46:26 EST 2012


Yeah, I was thinking the exact same thing.

This kind of documentation will be of great help!!! I'm thinking about
teaching Ruby @ Uni.

What I missed from the previous email is:
Beautiful code come from coders but that is not enough, working with people
in this ML I found that the quality of code and its beauty increase only if
you chat/talk with people and you are available to accept critics and
contributes. 

Another tab ? ;-)


On 13/02/12 11.34, "Pjotr Prins" <pjotr.public14 at thebird.nl> wrote:

> Excellent. 
> 
> May I propose that at least the other half of the authors of the
> Biogem paper send in an example of beautiful code? We can compile
> these snippets on a special page on one of the websites (or both).
> That will help courses, 'newbies' and even 'die hard' Perl users, such
> as Yoda, to see what this Ruby thing is about.
> 
> Maybe it is time for a 'Bioinformatics Ruby Academy'. Ruby rules. Herh
> herh herh.
> 
> Pj.
> 
> On Mon, Feb 13, 2012 at 11:06:03AM +0100, Francesco Strozzi wrote:
>> Some code I particularly liked comes from the BioNGS Wrapper library (by
>> Raoul). The gem is in development and I'm contributing to it so I could
>> look at the code from a very close perspective!
>> 
>> https://gist.github.com/1815501
>> 
>> What is interesting is the use of an internal DSL which makes the code
>> particularly expressive and readable, and writing a wrapper becomes only a
>> matter of defining the options that the command line binary expects. The
>> library is quite large and includes also methods to define aliases for the
>> options.
>> This module makes use also of the powerful Thor gem (used in Rails) to
>> create and define tasks that can be used to launch the binaries and to
>> include them in larger workflows and pipelines.
>> Now let's say you want to turn your freshly new wrapped binary into a
>> mighty Thor taks, all you need to do is:
>> 
>> https://gist.github.com/1815523
>> 
>> The amazing part is that the parameters passed to the block works like
>> arguments definition for the Thor task itself. The code that makes this
>> possible behind the scenes is here:
>> 
>> https://gist.github.com/1815481
>> 
>> It checks also if the number of arguments passed are equal to the arguments
>> of the block. If not, it raises an error to the user as he/she is calling
>> the task with the wrong number of arguments. To understand every part you
>> need to know a bit the way the Thor library defines tasks (more here:
>> https://github.com/wycats/thor) but the code here in BioNGS definitely
>> worth a look!
>> 
>> P.S. I'm diving into Scala too :-). I took the Odersky book few months ago
>> and now I'm about to start looking in details at this new programming
>> language. Easy and powerful parallelism is the new goal we need to achieve
>> to keep up with the big data era.
>> 
>> Cheers
>> 
>> On Mon, Feb 13, 2012 at 08:54, Pjotr Prins <pjotr.public14 at thebird.nl>wrote:
>> 
>>> OK, here another candidate for the price of beautiful code:
>>> 
>>> 
>>> https://github.com/trevor/bioruby-restriction_enzyme/blob/master/lib/bio/uti
>>> l/restriction_enzyme/range/sequence_range/calculated_cuts.rb
>>> 
>>> Trevor has implemented some hairy logic into the RE code. I mean
>>> hairy, that if it were done by someone else it would become spaghetti
>>> code (plenty of examples there in the real world!). You can see, that
>>> even when choosing sensible names, and explaining the code with good
>>> comments, it may still be hard to understand! But I think
>>> 
>>>  def add_cuts_from_cut_ranges(cut_ranges)
>>> 
>>> pretty much sums it up :). Still, it is beautiful, because it is hard
>>> to think of doing it better. The Ruby code is short and self
>>> explanatory and RE library has almost become a DSL for cutting
>>> sequences using restriction enzymes. That is beautiful.
>>> 
>>> Pj.
>>> 
>>> On Sat, Feb 11, 2012 at 11:27:44PM +0300, George Githinji wrote:
>>>> Hi All
>>>> Beauty is in the eyes of the beholder!
>>>> The Bio-Alignment plugin can  read and interconvert a nucleotide
>>>> alignment to an amino acid alignment.  I liked the simplicity of how
>>>> PJ has implemented the codon to amino acid conversion helper method
>>>> while taking care of the gaps or undefined aa translations.
>>>> 
>>>>       # lazily convert to Amino acid (once only)
>>>>       def to_aa
>>>>         aa = translate
>>>>         if not aa
>>>>           if gap?
>>>>             return '-'
>>>>           elsif undefined?
>>>>             return 'X'
>>>>           else
>>>>             raise 'What?'
>>>>           end
>>>>         end
>>>>         aa
>>>>       end
>>>> 
>>>> This method does not have any ruby 'magic' and is self documenting.
>>>> The gap? and undefined? methods are implemented as simple one line
>>>> standalone methods.
>>>> 
>>>> Again I like this simple 'trick' of getting an array of codons from a
>>>> sequence in the codonsequence class.
>>>> 
>>>> seq.scan(/\S\S\S/) #gets an array of codons
>>>> 
>>>> The longer alternative would be to create a bio::sequence::NA object
>>>> and iterate
>>>> seq = Bio::Sequence::NA.new("blahahahha")
>>>> seq.window_search(3, 3) do |subseq|
>>>>   puts subseq
>>>> end
>>>> 
>>>> It seems more intuitive to represent a sequence as an array of codon
>>>> objects. In this way the codons have some state and can carry
>>>> 'luggage'. getting the string representation of the sequence is as
>>>> simple as
>>>> def to_s
>>>>  @seq.map { |codon| codon.to_s }.join(' ')
>>>> end
>>>> 
>>>> To be more DRY, the to_nt method in the same class could be aliased
>>>> from the to_s method
>>>> 
>>>> It seems the bio-plugins are a rich source of tricks and great
>>> learning.... !
>>>> 
>>>> 
>>>> On Sat, Feb 11, 2012 at 10:08 PM, Peter Cock <p.j.a.cock at googlemail.com>
>>> wrote:
>>>>> On Sat, Feb 11, 2012 at 5:46 PM, Pjotr Prins <
>>> pjotr.public14 at thebird.nl> wrote:
>>>>>> Correct me if I am wrong, but has everyone moved across to BioPython
>>>>>> now? Or even to the dark side?
>>>>>> 
>>>>>> Pj.
>>>>> 
>>>>> I haven't noticed any BioRuby developers posting on the Biopython
>>>>> mailing lists recently - but you'd be welcome ;)
>>>>> 
>>>>> On a related topic, my first BioRuby pull request was merged, so
>>>>> there is a little direct cross project contribution going on :)
>>>>> 
>>>>> 
>>> https://github.com/bioruby/bioruby/commit/f33abf9bbd90c3c1e320f06447fdb54ffd
>>> 094c5d
>>>>> 
>>>>> Peter
>>>>> _______________________________________________
>>>>> BioRuby Project - http://www.bioruby.org/
>>>>> BioRuby mailing list
>>>>> BioRuby at lists.open-bio.org
>>>>> http://lists.open-bio.org/mailman/listinfo/bioruby
>>>> 
>>>> 
>>>> 
>>>> --
>>>> ---------------
>>>> Sincerely
>>>> George
>>>> Skype: george_g2
>>>> Blog: http://biorelated.wordpress.com/
>>>> Twitter: http://twitter.com/#!/george_l
>>>> 
>>> _______________________________________________
>>> BioRuby Project - http://www.bioruby.org/
>>> BioRuby mailing list
>>> BioRuby at lists.open-bio.org
>>> http://lists.open-bio.org/mailman/listinfo/bioruby
>>> 
>> 
>> 
>> 
>> -- 
>> 
>> Francesco
>> _______________________________________________
>> BioRuby Project - http://www.bioruby.org/
>> BioRuby mailing list
>> BioRuby at lists.open-bio.org
>> http://lists.open-bio.org/mailman/listinfo/bioruby
>> 
> _______________________________________________
> BioRuby Project - http://www.bioruby.org/
> BioRuby mailing list
> BioRuby at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/bioruby




More information about the BioRuby mailing list