ufal / udpipe
UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files
☆364Updated last week
Related projects ⓘ
Alternatives and complementary repositories for udpipe
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆312Updated last month
- spaCy + UDPipe☆161Updated 2 years ago
- Various utilities for processing the data.☆207Updated this week
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆249Updated 2 months ago
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆725Updated 3 months ago
- English data☆201Updated this week
- AmbiverseNLU: A Natural Language Understanding suite by Max Planck Institute for Informatics☆208Updated 11 months ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆185Updated 4 years ago
- Universal Dependencies online documentation☆273Updated this week
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆112Updated 6 months ago
- Bitextor generates translation memories from multilingual websites☆291Updated last week
- Text tokenization and sentence segmentation (segtok v2)☆203Updated 2 years ago
- A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology…☆220Updated last year
- A minimal, pure Python library to interface with CoNLL-U format files.☆149Updated last year
- Named Entity Recognition based on dictionaries☆242Updated 5 years ago
- ConllEditor is a tool to edit dependency syntax trees in CoNLL-U format.☆54Updated 2 weeks ago
- A multilingual, cross-domain temporal tagger developed at the Database Systems Research Group at Heidelberg University.☆343Updated last year
- Named Entity Recognition data for Europeana Newspapers☆173Updated last year
- Lexicon of frame files used by Propbank annotation. A searchable, readable version of the latest release is here: http://propbank.github…☆99Updated this week
- Making sense embedding out of word embeddings using graph-based word sense induction☆212Updated 3 years ago
- Anafora is a web-based raw text annotation tool☆241Updated 2 years ago
- Quickly extract multi-word phrases from a corpus☆191Updated 4 years ago
- German Morphological Analyzer☆47Updated 3 years ago
- Neural network models for joint POS tagging and dependency parsing (CoNLL 2017-2018)☆158Updated 5 years ago
- An implementation of a full named-entity evaluation metrics based on SemEval'13 Task 9 - not at tag/token level but considering all the t…☆217Updated 4 months ago
- 🆕 Work continues on INCEpTION 👉 https://github.com/inception-project/inception 👈 -- ⚠️ The official WebAnno repository has reached the…☆245Updated last year
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆61Updated this week
- Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.☆343Updated last year
- The official released annotations, both in .prop pointer format and as conll files. Does not contain the source texts☆136Updated 2 years ago
- Python framework for processing Universal Dependencies data☆57Updated this week