ufal / udpipe
UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files
☆375Updated 3 months ago
Alternatives and similar repositories for udpipe:
Users that are interested in udpipe are comparing it to the libraries listed below
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆314Updated last week
- Various utilities for processing the data.☆208Updated this week
- spaCy + UDPipe☆161Updated 2 years ago
- English data☆205Updated this week
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆731Updated 6 months ago
- Universal Dependencies online documentation☆281Updated this week
- A minimal, pure Python library to interface with CoNLL-U format files.☆148Updated last year
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆191Updated 4 years ago
- Bitextor generates translation memories from multilingual websites☆292Updated 3 months ago
- Text tokenization and sentence segmentation (segtok v2)☆201Updated 2 years ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆253Updated 6 months ago
- Automatically exported from code.google.com/p/universal-pos-tags☆129Updated 2 years ago
- A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology…☆221Updated 2 years ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆112Updated 9 months ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆139Updated 2 months ago
- 🆕 Work continues on INCEpTION 👉 https://github.com/inception-project/inception 👈 -- ⚠️ The official WebAnno repository has reached the…☆244Updated 2 years ago
- Named Entity Recognition data for Europeana Newspapers☆171Updated last year
- NER, syntax markup visualizations☆138Updated last year
- Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.☆347Updated 2 years ago
- German Morphological Analyzer☆47Updated 3 years ago
- The official released annotations, both in .prop pointer format and as conll files. Does not contain the source texts☆137Updated 2 years ago
- General-Purpose Neural Networks for Sentence Boundary Detection☆72Updated last year
- Quickly extract multi-word phrases from a corpus☆190Updated 4 years ago
- Anafora is a web-based raw text annotation tool☆241Updated 2 years ago
- Named Entity Recognition based on dictionaries☆242Updated 6 years ago
- Sentence aligner☆110Updated 3 years ago
- Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipg…☆126Updated 2 months ago
- Making sense embedding out of word embeddings using graph-based word sense induction☆212Updated 3 years ago
- ConllEditor is a tool to edit dependency syntax trees in CoNLL-U format.☆56Updated last week
- Shared BERT model for 4 languages of Bulgarian, Czech, Polish and Russian. Slavic NER model.☆73Updated 3 years ago