qhungngo / EVBCorpusLinks
The English-Vietnamese Bilingual Corpus (EVBCorpus) is a collection of English and Vietnamese parallel translations and bitexts.
☆43Updated 6 years ago
Alternatives and similar repositories for EVBCorpus
Users that are interested in EVBCorpus are comparing it to the libraries listed below
Sorting:
- Neural Machine Translation system for English to Vietnamese (IWSLT'15 English-Vietnamese data)☆60Updated 6 years ago
- Scripts to preprocess training and test data and to run fast_align and giza☆108Updated 3 years ago
- NTREX -- News Test References for MT Evaluation☆85Updated last year
- OpusFilter - Parallel corpus processing toolkit☆109Updated this week
- TUFS Asian Language Parallel Corpus☆51Updated 2 years ago
- We release a dataset based on Wikipedia sentences and the corresponding translations in 6 different languages along with the scores (scal…☆81Updated 3 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆156Updated 2 months ago
- cLang-8 is a dataset for grammatical error correction.☆107Updated 3 years ago
- Transformer based translation quality estimation☆113Updated 2 years ago
- ☆36Updated 2 years ago
- MT Evaluation in Many Languages via Zero-Shot Paraphrasing☆101Updated last year
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆119Updated 3 years ago
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆41Updated last year
- Yet Another Neural Machine Translation Toolkit☆179Updated 5 months ago
- Repository to collect and categorize Grammatical Error Correction papers.☆119Updated 4 months ago
- ☆94Updated last year
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆158Updated last year
- Code for AAAI 2021 paper "Lexically Constrained Neural Machine Translation with Explicit Alignment Guidance"☆25Updated 2 years ago
- ☆42Updated 7 years ago
- The official code of the "Frustratingly Easy System Combination for Grammatical Error Correction" paper☆56Updated last year
- Improved version of GECToR☆59Updated 2 years ago
- Improved Sentence Alignment in Linear Time and Space☆180Updated 2 years ago
- Simultaneous NMT/MMT framework in PyTorch☆38Updated 4 months ago
- ☆21Updated 3 years ago
- A Supervised Word Alignment Method based on Cross-Language Span Prediction using Multilingual BERT☆27Updated 4 years ago
- Supplementary material for "When and Why Are Pre-trained Word Embeddings Useful for Neural Machine Translation?" at NAACL 2018☆124Updated 5 years ago
- Zero -- A neural machine translation system☆153Updated 2 years ago
- Add noise to your text, can be used to improve synthetic training corpus for Neural Machine Translation☆41Updated 6 years ago
- Source code for paper Grammatical Error Correction in Low-Resource Scenarios (W-NUT 2019)☆13Updated 3 years ago
- ☆59Updated 2 years ago