rbawden / DiaBLa-dataset
English-French MT dialogue dataset
☆17Updated 3 years ago
Alternatives and similar repositories for DiaBLa-dataset
Users that are interested in DiaBLa-dataset are comparing it to the libraries listed below
Sorting:
- ☆28Updated 11 months ago
- ☆16Updated 4 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- Feature Decay Algorithms☆11Updated 11 years ago
- ☆24Updated 2 years ago
- We release a dataset based on Wikipedia sentences and the corresponding translations in 6 different languages along with the scores (scal…☆81Updated 3 years ago
- Pretraining scripts for BART transformer model☆11Updated 2 years ago
- PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"☆25Updated 3 years ago
- ☆36Updated 2 years ago
- ☆22Updated 4 years ago
- Webpage for the DSTC8 - NOESIS II: Predicting Responses☆48Updated 2 years ago
- Lexically Constrained Neural Machine Translation with Levenshtein Transformer☆39Updated 4 years ago
- Automatically harvested multilingual contrastive word sense disambiguation test sets for machine translation☆17Updated 4 years ago
- Source code for "Improving Robustness of Neural Machine Translation with Multi-task Learning"☆19Updated 5 years ago
- Python code for training models in the ACL paper, "Beyond BLEU:Training Neural Machine Translation with Semantic Similarity".☆52Updated 5 years ago
- ☆20Updated 4 years ago
- ☆21Updated 2 years ago
- Code for the paper "Cross-Lingual BERT Transformation for Zero-Shot Dependency Parsing"☆35Updated 5 years ago
- GMEG☆29Updated 5 months ago
- NMT domain adaptation papers (updating...)☆17Updated 5 years ago
- Code for AAAI 2021 paper "Lexically Constrained Neural Machine Translation with Explicit Alignment Guidance"☆25Updated 2 years ago
- Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"☆23Updated 3 years ago
- ☆41Updated 4 years ago
- ☆28Updated 5 years ago
- This repository contains datasets (including testing set) for EMNLP-IJCNLP 2019 paper "BiPaR: A Bilingual Parallel Dataset for Multilingu…☆23Updated 3 years ago
- A program to choose transfer languages for cross-lingual learning☆72Updated last year
- Progressively Pretrained Dense Corpus Index for Open-Domain QA and Information Retrieval☆43Updated last year
- Terminology Dataset☆23Updated 5 years ago
- ☆92Updated last year
- Scripts to preprocess training and test data and to run fast_align and giza☆108Updated 3 years ago