ufal / udpipe
UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files
β379Updated 4 months ago
Alternatives and similar repositories for udpipe:
Users that are interested in udpipe are comparing it to the libraries listed below
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.β315Updated last month
- spaCy + UDPipeβ161Updated 2 years ago
- π₯ Use the latest Stanza (StanfordNLP) research models directly in spaCyβ731Updated 7 months ago
- A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphologyβ¦β222Updated 2 years ago
- Various utilities for processing the data.β208Updated this week
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interfaceβ254Updated 7 months ago
- Universal Dependencies online documentationβ282Updated this week
- A minimal, pure Python library to interface with CoNLL-U format files.β149Updated last year
- Text tokenization and sentence segmentation (segtok v2)β202Updated 3 years ago
- English dataβ206Updated last week
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more β¦β112Updated 11 months ago
- π Work continues on INCEpTION π https://github.com/inception-project/inception π -- β οΈ The official WebAnno repository has reached theβ¦β245Updated 2 years ago
- A sentence segmenter that actually works!β305Updated 4 years ago
- Machine-Translation-based sentence alignment tool for parallel textβ308Updated 4 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentationβ191Updated 4 years ago
- ConllEditor is a tool to edit dependency syntax trees in CoNLL-U format.β56Updated this week
- Sentence alignerβ112Updated 3 years ago
- Named Entity Recognition data for Europeana Newspapersβ171Updated 2 years ago
- Quickly extract multi-word phrases from a corpusβ191Updated 4 years ago
- Bitextor generates translation memories from multilingual websitesβ292Updated 4 months ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)β361Updated last year
- Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.β352Updated 2 years ago
- AmbiverseNLU: A Natural Language Understanding suite by Max Planck Institute for Informaticsβ210Updated last year
- Improved Sentence Alignment in Linear Time and Spaceβ169Updated 2 years ago
- Efficient Low-Memory Alignerβ143Updated 2 months ago
- Language detection extension for spaCy 2.0+β112Updated 6 years ago
- Lexical database for ~70k English words with morphological variablesβ42Updated 3 years ago
- German Morphological Analyzerβ47Updated 3 years ago
- Neural network models for joint POS tagging and dependency parsing (CoNLL 2017-2018)β158Updated 5 years ago
- A modern, interlingual wordnet interface for Pythonβ237Updated last week