datasauRus
Datasets from the Datasaurus Dozen
The Datasaurus Dozen is a set of datasets with the same summary statistics. They retain the same summary statistics despite having radically different distributions. The datasets represent a larger and quirkier object lesson that is typically taught via Anscombe's Quartet (available in the 'datasets' package). Anscombe's Quartet contains four very different distributions with the same summary statistics and as such highlights the value of visualisation in understanding data, over and above summary statistics. As well as being an engaging variant on the Quartet, the data is generated in a novel way. The simulated annealing process used to derive datasets from the original Datasaurus is detailed in "Same Stats, Different Graphs: Generating Datasets with Varied Appearance and Identical Statistics through Simulated Annealing" doi:10.1145/3025453.3025912.
- GitHub
- https://jumpingrivers.github.io/datasauRus/
- File a bug report
- datasauRus results
- datasauRus.pdf
- Version0.1.9
- R versionR (≥ 3.5.0)
- LicenseMIT
- Needs compilation?No
- Languageen-US
- Last release01/23/2025
Documentation
Team
Colin Gillespie
MaintainerShow author detailsAlberto Cairo
Show author detailsRolesdtcRichard Cotton
Show author detailsRolesContributorGeorge Fitzmaurice
Show author detailsRolesdtcSteph Locke
Show author detailsRolesAuthorJustin Matejka
Show author detailsRolesdtcLucy D'Agostino McGowan
Show author detailsRolesAuthorJumping Rivers
Rhian Davies
Show author detailsRolesAuthorTim Book
Show author detailsRolesContributor
Insights
Last 30 days
The following line graph shows the downloads per day. You can hover over the graph to see the exact number of downloads per day.
Last 365 days
The following line graph shows the downloads per day. You can hover over the graph to see the exact number of downloads per day.
Data provided by CRAN
Binaries
Dependencies
- Suggests6 packages
- Reverse Suggests1 package