kakaobrain / word2wordLinks
Easy-to-use word-to-word translations for 3,564 language pairs.
☆366Updated 4 years ago
Alternatives and similar repositories for word2word
Users that are interested in word2word are comparing it to the libraries listed below
Sorting:
- A sentence segmenter that actually works!☆306Updated 4 years ago
- TED parallel Corpora is growing collection of Bilingual parallel corpora, Multilingual parallel corpora and Monolingual corpora extracted…☆249Updated 9 years ago
- GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors☆510Updated 5 years ago
- Preprocessing Library for Natural Language Processing☆164Updated 2 years ago
- A TensorFlow implementation of Neural Sequence Labeling model, which is able to tackle sequence labeling tasks such as POS Tagging, Chunk…☆234Updated 6 years ago
- Evaluating Cross-lingual Sentence Representations☆457Updated 3 years ago
- This repository contains various ways to calculate sentence vector similarity using NLP models☆197Updated 5 years ago
- Explains nlp building blocks in a simple manner.☆251Updated 5 years ago
- ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.☆450Updated last year
- Unsupervised Statistical Machine Translation☆229Updated 4 years ago
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…☆560Updated 3 years ago
- Builds wordpiece(subword) vocabulary compatible for Google Research's BERT☆229Updated 4 years ago
- A tool for holistic analysis of language generations systems☆471Updated 3 years ago
- A language model-based approach to Grammatical Error Correction for English that uses minimal annotated data.☆48Updated 6 years ago
- ☆323Updated 2 years ago
- Machine-Translation-based sentence alignment tool for parallel text☆311Updated 4 years ago
- Method to encode text for GPT-2 to generate text based on provided keywords☆260Updated 4 years ago
- Bitextor generates translation memories from multilingual websites☆294Updated 9 months ago
- Transformer language model (GPT-2) with sentencepiece tokenizer☆164Updated 4 years ago
- A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology…☆223Updated 2 years ago
- Python code for various NLP metrics☆168Updated 5 years ago
- Open-Source Machine Translation Quality Estimation in PyTorch☆232Updated 3 years ago
- Team Kakao&Brain's Grammatical Error Correction System for the ACL 2019 BEA Shared Task☆92Updated 5 years ago
- Neural Essay Assessor: An Automated Essay Scoring System Based on Deep Neural Networks☆209Updated 7 years ago
- Corpora for evaluating NLU services (like API.ai, RASA, Microsoft LUIS, ...)☆146Updated 5 years ago
- Sentence paraphrase generation at the sentence level☆408Updated 2 years ago
- It is a question-generator model. It takes text and an answer as input and outputs a question.☆170Updated 6 years ago
- (yet another not really) awesome topic/text segmentation list☆109Updated 6 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆158Updated last year
- LASER multilingual sentence embeddings as a pip package☆224Updated 2 years ago