piecemaker
Tools for Preparing Text for Tokenizers
Tokenizers break text into pieces that are more usable by machine learning models. Many tokenizers share some preparation steps. This package provides those shared steps, along with a simple tokenizer.
- GitHub
- https://macmillancontentscience.github.io/piecemaker/
- File a bug report
- piecemaker results
- piecemaker.pdf
- Version1.0.2
- R version≥ 2.10
- LicenseApache License (≥ 2)
- Needs compilation?No
- Last release06/02/2023
Documentation
Team
Jon Harmon
Jonathan Bratt
Bedford Freeman & Worth Pub Grp LLC DBA Macmillan Learning
Show author detailsRolesCopyright holder
Insights
Last 30 days
Last 365 days
The following line graph shows the downloads per day. You can hover over the graph to see the exact number of downloads per day.
Data provided by CRAN
Binaries
Dependencies
- Depends1 package
- Imports5 packages
- Suggests2 packages
- Reverse Imports2 packages