[Bioperl-l] About binning data for histograms

Sean Davis sdavis2 at mail.nih.gov
Sun Aug 16 11:48:23 UTC 2009


On Sun, Aug 16, 2009 at 4:06 AM, Abhishek Pratap <abhishek.vit at gmail.com>wrote:

> Hi All
>
> After a lot of look up on forums I could google, I am finally posting
> my question here. I think it may not be appropriate for this mailing
> list. I apologize for this first up. The question is regarding dynamic
> binning of data points for histogram plots.
>
> So I have many hashes, each having a "numerical" coverage data
> obtained from Next generation sequencing data analysis. Now each hash
> may have couple of hundred to thousands entry "contig_name =>
> coverage".  What I want to do is to plot a histogram for each
> hash/dataset.  "Coverage v/s Count of contigs with coverage > #N " ( N
> has to be binned according to the data size).
>
> I am using Chart::Gnuplot for this but I am not able to figure out how
> to bin the data points to fit nicely on a screen. Is there any
> smart/quick method to do this.
>
> Any pointers will help a great deal.
>

Hi, Abhi.

You could use R, but you got that already.  ; )

However, you might look here for a perl solution.

http://search.cpan.org/~whizdog/GDGraph-histogram-1.1/lib/GD/Graph/histogram.pm

Sean



More information about the Bioperl-l mailing list