Hyperparticle / udifyLinks

A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology tags, lemmas, and dependency trees.

☆223

Alternatives and similar repositories for udify

Users that are interested in udify are comparing it to the libraries listed below

Sorting:

robertostling / eflomal
Efficient Low-Memory Aligner
☆146Updated 6 months ago
feralvam / easse
Easier Automatic Sentence Simplification Evaluation
☆161Updated last year
EmilStenstrom / conllu
A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.
☆317Updated this week
getalp / disambiguate
Disambiguate is a tool for training and using state of the art neural WSD models
☆60Updated 3 weeks ago
TharinduDR / TransQuest
Transformer based translation quality estimation
☆112Updated 2 years ago
cisnlp / simalign
Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)
☆373Updated last year
pyconll / pyconll
A minimal, pure Python library to interface with CoNLL-U format files.
☆151Updated 2 years ago
getalp / UFSAC
UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them
☆38Updated 3 years ago
artetxem / monoses
Unsupervised Statistical Machine Translation
☆229Updated 4 years ago
Helsinki-NLP / OpusFilter
OpusFilter - Parallel corpus processing toolkit
☆108Updated last month
lilt / alignment-scripts
Scripts to preprocess training and test data and to run fast_align and giza
☆108Updated 3 years ago
afshinrahimi / mmner
Massively Multilingual Transfer for NER
☆86Updated 3 years ago
SapienzaNLP / ewiser
A Word Sense Disambiguation system integrating implicit and explicit external knowledge.
☆69Updated 3 years ago
snukky / wikiedits
Automatic extraction of edited sentences from text edition histories.
☆83Updated 3 years ago
Unbabel / OpenKiwi
Open-Source Machine Translation Quality Estimation in PyTorch
☆232Updated 3 years ago
cocoxu / simplification
Text Simplification System and Dataset
☆122Updated 2 years ago
thompsonb / prism
MT Evaluation in Many Languages via Zero-Shot Paraphrasing
☆101Updated last year
bitextor / bicleaner
Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.
☆158Updated last year
TalSchuster / CrossLingualContextualEmb
Cross-Lingual Alignment of Contextual Word Embeddings
☆99Updated 5 years ago
nert-nlp / streusle
STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)
☆66Updated 2 months ago
HSLCY / GlossBERT
GlossBERT: BERT for Word Sense Disambiguation with Gloss Knowledge (EMNLP 2019)
☆95Updated 2 years ago
conll / reference-coreference-scorers
This is the reference implementation of commonly used coreference metrics.
☆74Updated 7 years ago
WING-NUS / scisumm-corpus
Scientific Document Summarization Corpus and Annotations from the WING NUS group.
☆214Updated 2 years ago
m-popovic / chrF
a tool for calcualting character n-gram F score
☆74Updated 2 years ago
google-research-datasets / wiki-split
One million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.
☆123Updated 6 years ago
danlou / LMMS
Language Modelling Makes Sense - WSD (and more) with Contextual Embeddings
☆95Updated 2 years ago
neulab / compare-mt
A tool for holistic analysis of language generations systems
☆471Updated 3 years ago
explosion / tokenizations
Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/
☆192Updated last year
machamp-nlp / machamp
Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/
☆87Updated 2 months ago
facebookresearch / access
Code to reproduce the experiments from the paper.
☆101Updated last year