openmolecules.org

 
Home » DataWarrior » Functionality » maximum number of bins/categories for bar charts
maximum number of bins/categories for bar charts [message #1801] Sat, 07 January 2023 05:10 Go to next message
chemtv is currently offline  chemtv
Messages: 26
Registered: February 2015
Location: Indianapolis
Junior Member
Hello,
I have a large dataset with near 5000 bins/categories that I would like to plot as a bar chart/histogram. Since I can do this with small example datasets I assume I've exceeded some maximum number of bins/categories allowed. What is the limit for bar charts? Is this the same limit as for category variables? Any way I should be able to figure this out without a lot of trial and error?

Thanks,
Greg
Re: maximum number of bins/categories for bar charts [message #1802 is a reply to message #1801] Sat, 07 January 2023 20:45 Go to previous messageGo to next message
nbehrnd is currently offline  nbehrnd
Messages: 224
Registered: June 2019
Senior Member
Dear Greg,

I think the maximum number of bins the bar chart is capable to display equates to 100,* which is less than the binning algorithm is capable to generate and simultaneously, which is less than the interactive preview (Data -> Add Bins from Numbers) apparently can visualize when adjusting the slide rules.

This suggests the addition of a display, or/and a dial about the number of bins created; a bit similar to what the generation of histograms with e.g. Origin implements.[1] Perhaps complemented with a note to the user («Danger, Will Robinson») if the number of bins computed reached, or exceeds the critical threshold of bins the bar chart still is capable to visualize.

Norwid

[1] https://www.originlab.com/doc/en/Origin-Help/PD-Dialog-Data- Tab

* I had a 5k set at hand, and iteratively progressed by division by 2, rounded to integers somewhat easier to recall. By this approach, 100 bins passed, and 150 bins were too many.


Re: maximum number of bins/categories for bar charts [message #1810 is a reply to message #1802] Wed, 18 January 2023 15:16 Go to previous messageGo to next message
thomas is currently offline  thomas
Messages: 715
Registered: June 2014
Senior Member
Dear Greg and Norwid,

currently, the maximum number of categories on one axis is 256.
The max number of categories on all axes (product of individual axis category counts) is limited to 32768.
In addition one can sub-split categories on any axis with case separation into up to 128 sub-categories.
Then views can be split by 1 or 2 category columns into a maximum of 10000 views.

The limits are there to avoid an explosion of memory usage considering that a higher number of categories usually can not be displayed reasonably well anyway.

Thomas
Re: maximum number of bins/categories for bar charts [message #1812 is a reply to message #1810] Thu, 19 January 2023 22:51 Go to previous message
nbehrnd is currently offline  nbehrnd
Messages: 224
Registered: June 2019
Senior Member
Dear Thomas,

though my test did assigned a maximum threshold of bins below the one set in the program, I still like to retain it were a nice addition if the interactive binning (by move of the sliders) either

a) there were an indicator of the number of bins are going to be created, or/and
b) an optional dial to divide a set into exactly 12, 36, 100, or an other number of bins at the discretion of the user

in the small temporally displayed window (cf. the silent attached) before the eventual click on «ok». In comparison to the adjustments of representations of the 2D/3D plots, the binning is less reversible; once applied to the data in the table, it stays there until an other binning is applied.

With regards,

Norwid
  • Attachment: binning.mp4
    (Size: 286.06KB, Downloaded 210 times)
Previous Topic: ChemAxon plugin
Next Topic: How to add the "Structure of smile similarity" score to the colume
Goto Forum:
  


Current Time: Thu Nov 21 17:37:46 CET 2024

Total time taken to generate the page: 0.02964 seconds