kakaobrain / word2word
Easy-to-use word-to-word translations for 3,564 language pairs.
☆354Updated 3 years ago
Related projects: ⓘ
- Evaluating Cross-lingual Sentence Representations☆440Updated 3 years ago
- A tool for holistic analysis of language generations systems☆465Updated 2 years ago
- ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.☆432Updated 5 months ago
- Deep neural models for core NLP tasks (Pytorch version)☆440Updated 2 years ago
- TED parallel Corpora is growing collection of Bilingual parallel corpora, Multilingual parallel corpora and Monolingual corpora extracted…☆240Updated 8 years ago
- Builds wordpiece(subword) vocabulary compatible for Google Research's BERT☆226Updated 3 years ago
- ☆606Updated last month
- Simple, fast unsupervised word aligner☆732Updated 2 years ago
- GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors☆482Updated 4 years ago
- Ner with Bert☆281Updated 4 years ago
- A sentence segmenter that actually works!☆303Updated 4 years ago
- A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology…☆220Updated last year
- Unsupervised Statistical Machine Translation☆227Updated 4 years ago
- ALBERT model Pretraining and Fine Tuning using TF2.0☆199Updated last year
- A TensorFlow implementation of Neural Sequence Labeling model, which is able to tackle sequence labeling tasks such as POS Tagging, Chunk…☆234Updated 5 years ago
- An elaborate and exhaustive paper list for Named Entity Recognition (NER)☆394Updated 2 years ago
- Explains nlp building blocks in a simple manner.☆251Updated 4 years ago
- EmbedRank: Unsupervised Keyphrase Extraction using Sentence Embeddings (official implementation)☆431Updated last year
- Unsupervised Neural Machine Translation☆472Updated 4 years ago
- Bitextor generates translation memories from multilingual websites☆287Updated 3 months ago
- This repository contains various ways to calculate sentence vector similarity using NLP models☆198Updated 4 years ago
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…☆545Updated 2 years ago
- A framework to learn cross-lingual word embedding mappings☆642Updated last year
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆347Updated 10 months ago
- ☆317Updated last year
- Easy to use NLP library built on PyTorch and TorchText☆253Updated 4 years ago
- 📃Language Model based sentences scoring library☆300Updated 2 years ago
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆724Updated last month
- Unsupervised Question answering via Cloze Translation☆219Updated 2 years ago