udpipe
Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing with the 'UDPipe' 'NLP' Toolkit
This natural language processing toolkit provides language-agnostic 'tokenization', 'parts of speech tagging', 'lemmatization' and 'dependency parsing' of raw text. Next to text parsing, the package also allows you to train annotation models based on data of 'treebanks' in 'CoNLL-U' format as provided at
- Version0.8.11
- R version≥ 2.10
- LicenseMPL-2.0
- Needs compilation?Yes
- Last release01/06/2023
Documentation
- VignetteUDPipe Natural Language Processing - Annotating text
- VignetteUDPipe Natural Language Processing - Parallel
- VignetteUDPipe Natural Language Processing - Model Building
- VignetteUDPipe Natural Language Processing - Try it out
- VignetteUDPipe Natural Language Processing - Universe
- VignetteUDPipe Natural Language Processing - Basic Analytical Use Cases
- VignetteUDPipe Natural Language Processing - Topic Modelling Use Cases
- MaterialREADME
- MaterialNEWS
- In ViewsNaturalLanguageProcessing
Team
Jan Wijffels
BNOSAC
Show author detailsRolesCopyright holderInstitute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University in Prague, Czech Republic
Show author detailsRolesCopyright holderMilan Straka
Show author detailsRolesContributor, Copyright holderJana Straková
Show author detailsRolesContributor, Copyright holder
Insights
Last 30 days
Last 365 days
The following line graph shows the downloads per day. You can hover over the graph to see the exact number of downloads per day.
Data provided by CRAN
Binaries
Dependencies
- Depends1 package
- Imports5 packages
- Suggests5 packages
- Linking To1 package
- Reverse Imports6 packages
- Reverse Suggests12 packages
- Reverse Enhances1 package