ngram
Fast n-Gram 'Tokenization'
An n-gram is a sequence of n "words" taken, in order, from a body of text. This is a collection of utilities for creating, displaying, summarizing, and "babbling" n-grams. The 'tokenization' and "babbling" are handled by very efficient C code, which can even be built as its own standalone library. The babbler is a simple Markov chain. The package also offers a vignette with complete example 'workflows' and information about the utilities offered in the package.
- Version3.2.3
- R versionunknown
- LicenseBSD 2-clause License
- LicenseLICENSE
- Needs compilation?Yes
- ngram citation info
- Last release12/10/2023
Documentation
Team
Drew Schmidt
Christian Heckendorf
Show author detailsRolesAuthor
Insights
Last 30 days
Last 365 days
The following line graph shows the downloads per day. You can hover over the graph to see the exact number of downloads per day.
Data provided by CRAN
Binaries
Dependencies
- Reverse Imports5 packages
- Reverse Suggests1 package