tok

Fast Text Tokenization

CRAN Package

Interfaces with the 'Hugging Face' tokenizers library to provide implementations of today's most used tokenizers such as the 'Byte-Pair Encoding' algorithm . It's extremely fast for both training new vocabularies and tokenizing texts.


Documentation


Team


Insights

Last 30 days

Last 365 days

The following line graph shows the downloads per day. You can hover over the graph to see the exact number of downloads per day.

Data provided by CRAN


Binaries


Dependencies

  • Depends1 package
  • Imports2 packages
  • Suggests4 packages