Helsinki-NLP / Tatoeba-ChallengeLinks
☆834Updated 10 months ago
Alternatives and similar repositories for Tatoeba-Challenge
Users that are interested in Tatoeba-Challenge are comparing it to the libraries listed below
Sorting:
- Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing☆756Updated 8 months ago
- Open neural machine translation models and web services☆701Updated last week
- A neural word aligner based on multilingual BERT☆350Updated 3 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆363Updated last year
- Training open neural machine translation models☆367Updated 3 months ago
- Easy to use, state-of-the-art Neural Machine Translation for 100+ languages☆1,228Updated last year
- New dataset☆304Updated 3 years ago
- ⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.☆579Updated 2 years ago
- A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology…☆223Updated 2 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆155Updated last month
- Tools and Modeling Code for the MASSIVE dataset☆544Updated 2 years ago
- GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors☆509Updated 5 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆157Updated last year
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆161Updated 9 months ago
- NeuSpell: A Neural Spelling Correction Toolkit☆695Updated last year
- High-accuracy NLP parser with models for 11 languages.☆890Updated 3 years ago
- xfspell — the Transformer Spell Checker☆190Updated 5 years ago
- Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Tagg…☆927Updated last year
- A sentence segmenter that actually works!☆307Updated 4 years ago
- ☆1,279Updated 2 years ago
- ☆1,575Updated 2 years ago
- Language-Agnostic SEntence Representations☆3,646Updated last year
- NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations☆785Updated last year
- Facebook Low Resource (FLoRes) MT Benchmark☆739Updated last year
- Open-Source Machine Translation Quality Estimation in PyTorch☆231Updated 3 years ago
- The website for the CMU Language Technologies Institute low resource NLP bootcamp 2020☆601Updated 5 years ago
- Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.☆358Updated 2 years ago
- A tool for holistic analysis of language generations systems☆468Updated 3 years ago
- Python library & examples for Masked Language Model Scoring (ACL 2020)☆342Updated 2 years ago
- Fast BPE☆671Updated last year