git2rdata
Store and Retrieve Data.frames in a Git Repository
The git2rdata package is an R package for writing and reading dataframes as plain text files. A metadata file stores important information. 1) Storing metadata allows to maintain the classes of variables. By default, git2rdata optimizes the data for file storage. The optimization is most effective on data containing factors. The optimization makes the data less human readable. The user can turn this off when they prefer a human readable format over smaller files. Details on the implementation are available in vignette("plain_text", package = "git2rdata"). 2) Storing metadata also allows smaller row based diffs between two consecutive commits. This is a useful feature when storing data as plain text files under version control. Details on this part of the implementation are available in vignette("version_control", package = "git2rdata"). Although we envisioned git2rdata with a git workflow in mind, you can use it in combination with other version control systems like subversion or mercurial. 3) git2rdata is a useful tool in a reproducible and traceable workflow. vignette("workflow", package = "git2rdata") gives a toy example. 4) vignette("efficiency", package = "git2rdata") provides some insight into the efficiency of file storage, git repository size and speed for writing and reading.
- https://ropensci.github.io/git2rdata/
- GitHub
- https://doi.org/10.5281/zenodo.1485309
- File a bug report
- git2rdata results
- git2rdata.pdf
- Version0.4.1
- R version≥ 3.5.0
- LicenseGPL-3
- Needs compilation?No
- Languageen-GB
- git2rdata citation info
- Last release09/06/2024
Documentation
- VignetteEfficiency Relative to Storage and Time
- Vignettesource
- VignetteR code
- VignetteAdding metadata
- Vignettesource
- VignetteR code
- VignetteGetting Started Storing Dataframes as Plain Text
- Vignettesource
- VignetteR code
- VignetteStoring Large Dataframes
- Vignettesource
- VignetteR code
- VignetteOptimizing Storage for Version Control
- Vignettesource
- VignetteR code
- VignetteSuggested Workflow for Storing a Variable Set of Dataframes under Version Control
- Vignettesource
- VignetteR code
- MaterialREADME
- MaterialNEWS
Team
Thierry Onkelinx
Floris Vanderhaeghe
Peter Desmet
Els Lommelen
Research Institute for Nature and Forest
Show author detailsRolesCopyright holder, fnd
Insights
Last 30 days
Last 365 days
The following line graph shows the downloads per day. You can hover over the graph to see the exact number of downloads per day.
Data provided by CRAN
Binaries
Dependencies
- Depends1 package
- Imports3 packages
- Suggests5 packages