kakaobrain / word2wordLinks
Easy-to-use word-to-word translations for 3,564 language pairs.
☆366Updated 4 years ago
Alternatives and similar repositories for word2word
Users that are interested in word2word are comparing it to the libraries listed below
Sorting:
- TED parallel Corpora is growing collection of Bilingual parallel corpora, Multilingual parallel corpora and Monolingual corpora extracted…☆250Updated 9 years ago
- Preprocessing Library for Natural Language Processing☆164Updated 2 years ago
- A sentence segmenter that actually works!☆305Updated 5 years ago
- Evaluating Cross-lingual Sentence Representations☆457Updated 4 years ago
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…☆560Updated 3 years ago
- This repository contains various ways to calculate sentence vector similarity using NLP models☆197Updated 5 years ago
- ☆323Updated 2 years ago
- Team Kakao&Brain's Grammatical Error Correction System for the ACL 2019 BEA Shared Task☆92Updated 5 years ago
- Unsupervised Statistical Machine Translation☆229Updated 5 years ago
- Builds wordpiece(subword) vocabulary compatible for Google Research's BERT☆230Updated 4 years ago
- ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.☆451Updated last year
- GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors☆510Updated 5 years ago
- Explains nlp building blocks in a simple manner.☆251Updated 5 years ago
- A tool for holistic analysis of language generations systems☆471Updated 3 years ago
- A language model-based approach to Grammatical Error Correction for English that uses minimal annotated data.☆48Updated 6 years ago
- ☆606Updated 2 months ago
- A TensorFlow implementation of Neural Sequence Labeling model, which is able to tackle sequence labeling tasks such as POS Tagging, Chunk…☆234Updated 6 years ago
- Automatic question generation by using NLP☆206Updated last year
- Source code for paper: Improving Grammatical Error Correction via Pre-Training a Copy-Augmented Architecture with Unlabeled Data☆251Updated 5 years ago
- A Corpus for Multilingual Document Classification in Eight Languages.☆151Updated 3 years ago
- Transformer language model (GPT-2) with sentencepiece tokenizer☆164Updated 4 years ago
- Neural models and instructions on how to reproduce our results for our neural grammatical error correction systems from M. Junczys-Dowmun…☆88Updated 6 years ago
- Deep neural models for core NLP tasks (Pytorch version)☆441Updated 3 years ago
- LASER multilingual sentence embeddings as a pip package☆224Updated 2 years ago
- Machine-Translation-based sentence alignment tool for parallel text☆312Updated 4 years ago
- Unsupervised Question answering via Cloze Translation☆219Updated 3 years ago
- Code and model files for the paper: "A Multilayer Convolutional Encoder-Decoder Neural Network for Grammatical Error Correction" (AAAI-18…☆184Updated 6 years ago
- Fast + Non-Autoregressive Grammatical Error Correction using BERT. Code and Pre-trained models for paper "Parallel Iterative Edit Models …☆230Updated 2 years ago
- One million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.☆123Updated 6 years ago
- interactive explorer for language models☆135Updated 3 years ago