Generate Histogram Distribution
For a quick exploration of the distribution
Exporting the data to python is cumbersome... use SQL to generate a frequency distribution for a quick exploration.
Here's how:
- Use π΅π»πΎπΎπ to assign to buckets
- Fix missing bucket with πΆπ΄π½π΄ππ°ππ΄_ππ΄ππΈπ΄π
- Combine steps 1 and 2
Some caveats:
- Here, bucketing is done manually in a rather hacky way. Python's auto-bucketing is superior here.
- πΆπ΄π½π΄ππ°ππ΄_ππ΄ππΈπ΄π is a PostgreSQL function. Its equivalent is also available in t-SQL, Redshift and Bigquery.
- Some dashboarding tools (Metabase) have built-in histogram visualization. You can use that instead if it's supported.