rbawden / DiaBLa-datasetLinks
English-French MT dialogue dataset
☆17Updated 3 years ago
Alternatives and similar repositories for DiaBLa-dataset
Users that are interested in DiaBLa-dataset are comparing it to the libraries listed below
Sorting:
- ☆28Updated last year
- Automatically harvested multilingual contrastive word sense disambiguation test sets for machine translation☆17Updated 4 years ago
- ☆94Updated last year
- We release a dataset based on Wikipedia sentences and the corresponding translations in 6 different languages along with the scores (scal…☆81Updated 3 years ago
- Python code for training models in the ACL paper, "Beyond BLEU:Training Neural Machine Translation with Semantic Similarity".☆52Updated 5 years ago
- ☆24Updated 2 years ago
- Scripts to preprocess training and test data and to run fast_align and giza☆108Updated 3 years ago
- NMT domain adaptation papers (updating...)☆17Updated 6 years ago
- This is a repository with the data and code for the ACL 2019 paper "When a Good Translation is Wrong in Context: ..." and the EMNLP 2019 …☆97Updated 5 years ago
- Feature Decay Algorithms☆11Updated 11 years ago
- Lexically Constrained Neural Machine Translation with Levenshtein Transformer☆39Updated 5 years ago
- YiSi: A Semantic Machine Translation Evaluation Metric for Evaluating Languages with Different Levels of Available Resources☆26Updated 6 years ago
- ☆22Updated 4 years ago
- Lexically constrained decoding for sequence generation using Grid Beam Search☆91Updated 6 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- A program to choose transfer languages for cross-lingual learning☆72Updated 2 years ago
- ☆16Updated 4 years ago
- Source code for paper Grammatical Error Correction in Low-Resource Scenarios (W-NUT 2019)☆13Updated 3 years ago
- Source code for "Improving Robustness of Neural Machine Translation with Multi-task Learning"☆19Updated 5 years ago
- Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"☆23Updated 4 years ago
- TVsub: DCU-Tencent Chinese-English Dialogue Corpus☆46Updated 7 years ago
- ☆21Updated 3 years ago
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆41Updated last year
- ☆36Updated 2 years ago
- ☆20Updated 4 years ago
- Terminology Dataset☆23Updated 5 years ago
- ☆34Updated 4 years ago
- Domain Adaptation of Neural Machine Translation by Lexicon Induction☆20Updated 5 years ago
- Code for "Simulated Multiple Reference Training Improves Low-Resource Machine Translation"☆15Updated 4 years ago
- Multilingual Quality Estimation and Automatic Post-editing Dataset☆42Updated 3 years ago