[Bioperl-l] About binning data for histograms

Abhishek Pratap abhishek.vit at gmail.com
Sun Aug 16 17:34:54 UTC 2009


Thanks All. I completely forgot and dint realize that histogram
function in R could auto bin based on the data.

Cheers,
-Abhi

On Sun, Aug 16, 2009 at 1:25 PM, Jason Stajich<jason at bioperl.org> wrote:
> For binning of a distribution see the perl module Statistics::Descriptive -
> http://search.cpan.org/~colink/Statistics-Descriptive-2.6/Descriptive.pm function:
> frequency_distritibution
>
> I would also look at R histogram function for the plotting.  This would be
> one of the easiest ways - I would just make a perl script that generates the
> correct R code that can be used to make the plots.
>
>
> On Aug 16, 2009, at 4:06 AM, Abhishek Pratap wrote:
>
>> Hi All
>>
>> After a lot of look up on forums I could google, I am finally posting
>> my question here. I think it may not be appropriate for this mailing
>> list. I apologize for this first up. The question is regarding dynamic
>> binning of data points for histogram plots.
>>
>> So I have many hashes, each having a "numerical" coverage data
>> obtained from Next generation sequencing data analysis. Now each hash
>> may have couple of hundred to thousands entry "contig_name =>
>> coverage".  What I want to do is to plot a histogram for each
>> hash/dataset.  "Coverage v/s Count of contigs with coverage > #N " ( N
>> has to be binned according to the data size).
>>
>> I am using Chart::Gnuplot for this but I am not able to figure out how
>> to bin the data points to fit nicely on a screen. Is there any
>> smart/quick method to do this.
>>
>> Any pointers will help a great deal.
>>
>> Best Regards,
>> -Abhi
>> _______________________________________________
>> Bioperl-l mailing list
>> Bioperl-l at lists.open-bio.org
>> http://lists.open-bio.org/mailman/listinfo/bioperl-l
>
> --
> Jason Stajich
> jason.stajich at gmail.com
> jason at bioperl.org
>
>




More information about the Bioperl-l mailing list