planningML
A Sample Size Calculator for Machine Learning Applications in Healthcare
Advances in automated document classification has led to identifying massive numbers of clinical concepts from handwritten clinical notes. These high dimensional clinical concepts can serve as highly informative predictors in building classification algorithms for identifying patients with different clinical conditions, commonly referred to as patient phenotyping. However, from a planning perspective, it is critical to ensure that enough data is available for the phenotyping algorithm to obtain a desired classification performance. This challenge in sample size planning is further exacerbated by the high dimension of the feature space and the inherent imbalance of the response class. Currently available sample size planning methods can be categorized into: (i) model-based approaches that predict the sample size required for achieving a desired accuracy using a linear machine learning classifier and (ii) learning curve-based approaches (Figueroa et al. (2012)
- Version1.0.1
- R version≥ 3.5.0
- LicenseGPL-2
- Needs compilation?No
- Last release06/23/2023
Documentation
Team
Xinying Fang
Satabdi Saha
Show author detailsRolesAuthorJaejoon Song
Show author detailsRolesAuthorSai Dharmarajan
Show author detailsRolesAuthor
Insights
Last 30 days
Last 365 days
The following line graph shows the downloads per day. You can hover over the graph to see the exact number of downloads per day.
Data provided by CRAN
Binaries
Dependencies
- Depends1 package
- Imports8 packages
- Suggests2 packages