UniversalDependencies / docsLinks
Universal Dependencies online documentation
☆285Updated this week
Alternatives and similar repositories for docs
Users that are interested in docs are comparing it to the libraries listed below
Sorting:
- Various utilities for processing the data.☆209Updated this week
- English data☆208Updated last week
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆316Updated last week
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆194Updated 4 years ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆151Updated 2 years ago
- ConllEditor is a tool to edit dependency syntax trees in CoNLL-U format.☆56Updated 3 weeks ago
- Crawler for linguistic corpora☆204Updated last year
- GIZA++ is a statistical machine translation toolkit that is used to train IBM Models 1-5 and an HMM word alignment model. This package al…☆267Updated 2 years ago
- Python framework for processing Universal Dependencies data☆57Updated this week
- Automatically exported from code.google.com/p/universal-pos-tags☆129Updated 3 years ago
- UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files☆380Updated 7 months ago
- 🆕 Work continues on INCEpTION 👉 https://github.com/inception-project/inception 👈 -- ⚠️ The official WebAnno repository has reached the…☆246Updated 2 years ago
- A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology…☆223Updated 2 years ago
- The Global WordNet Association Collaborative Inter-Lingual Index☆43Updated 7 months ago
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆734Updated 10 months ago
- Language independent truecaser in Python.☆160Updated 3 years ago
- The official released annotations, both in .prop pointer format and as conll files. Does not contain the source texts☆139Updated 2 years ago
- A word alignment tool based on famous GIZA++, extended to support multi-threading, resume training and incremental training.☆164Updated 4 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated 2 years ago
- Bitextor generates translation memories from multilingual websites☆293Updated 7 months ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆259Updated 9 months ago
- TED parallel Corpora is growing collection of Bilingual parallel corpora, Multilingual parallel corpora and Monolingual corpora extracted…☆249Updated 9 years ago
- Machine-Translation-based sentence alignment tool for parallel text☆309Updated 4 years ago
- This is a CoNLL formatted version of the OntoNotes 5.0 release.☆189Updated 10 years ago
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆73Updated 10 years ago
- PredPatt: Predicate-Argument Extraction from Universal Dependencies☆112Updated 4 years ago
- The Arborator software is aimed at collaboratively annotating dependency corpora.☆26Updated 5 years ago
- ☆64Updated last year
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆363Updated last year
- Automatically exported from code.google.com/p/berkeleyparser☆182Updated 4 years ago