Question 1

What does the R-package 'udpipe' do?

Accepted Answer

Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing with the 'UDPipe' 'NLP' Toolkit. This natural language processing toolkit provides language-agnostic 'tokenization', 'parts of speech tagging', 'lemmatization' and 'dependency parsing' of raw text. Next to text parsing, the package also allows you to train annotation models based on data of 'treebanks' in 'CoNLL-U' format as provided at [https://universaldependencies.org/format.html](https://universaldependencies.org/format.html). The techniques are explained in detail in the paper: 'Tokenizing, POS Tagging, Lemmatizing and Parsing UD 2.0 with UDPipe', available at [doi:10.18653/v1/K17-3009](doi:10.18653/v1/K17-3009). The toolkit also contains functionalities for commonly used data manipulations on texts which are enriched with the output of the parser. Namely functionalities and algorithms for collocations, token co-occurrence, document term matrix handling, term frequency inverse document frequency calculations, information retrieval metrics (Okapi BM25), handling of multi-word expressions, keyword detection (Rapid Automatic Keyword Extraction, noun phrase extraction, syntactical patterns) sentiment scoring and semantic similarity analysis.

Question 2

Who maintains udpipe?

Accepted Answer

Jan Wijffels

Question 3

Who authored udpipe?

Accepted Answer

BNOSAC, Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University in Prague, Czech Republic, Milan Straka, Jana Straková

Question 4

What is the current version of udpipe?

Accepted Answer

The current version of the R-package '0.8.11' is 0.8.11

Question 5

When was the last release of udpipe?

Accepted Answer

The last release of the R-package '0.8.11' was 01/06/2023

Question 6

Where can I search for the R-package 'udpipe'?

Accepted Answer

You can search for the R-package 'udpipe' on CRAN/E at https://cran-e.com

udpipe

Documentation

Team

Jan Wijffels

BNOSAC

Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University in Prague, Czech Republic

Milan Straka

Jana Straková

Insights

Last 30 days

Last 365 days

Binaries

Dependencies