kenantang / petci
PETCI: A Parallel English Translation Dataset of Chinese Idioms
☆24Updated 3 years ago
Alternatives and similar repositories for petci:
Users that are interested in petci are comparing it to the libraries listed below
- The repository for the paper: Multilingual Translation via Grafting Pre-trained Language Models☆24Updated 3 years ago
- Code for our paper "Mask-Align: Self-Supervised Neural Word Alignment" in ACL 2021☆60Updated 3 years ago
- ☆44Updated 2 years ago
- A Bilingual Multi-Domain Dataset For Task-Oriented Dialogue Modeling☆23Updated 3 years ago
- ☆36Updated 2 years ago
- ☆31Updated last year
- Code for AAAI 2021 paper "Lexically Constrained Neural Machine Translation with Explicit Alignment Guidance"☆25Updated 2 years ago
- Improving Word Translation via Two-Stage Contrastive Learning (ACL 2022). Keywords: Bilingual Lexicon Induction, Word Translation, Cross-…☆34Updated 3 months ago
- A dataset and baselines for CLS.☆11Updated 2 years ago
- ☆31Updated 3 years ago
- Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning (ICLR 2021)☆24Updated 4 years ago
- Learning to Rewrite for Non-Autoregressive Neural Machine Translation☆21Updated 3 years ago
- ☆20Updated 4 years ago
- [ACL'21] Data for "An In-depth Study on Internal Structure of Chinese Words".☆14Updated 3 years ago
- Lexically Constrained Neural Machine Translation with Levenshtein Transformer☆39Updated 4 years ago
- ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET☆62Updated 3 years ago
- mSimCSE: Multilingual SimCSE☆34Updated 2 years ago
- Code base for "G-Transformer for Document-level Machine Translation"☆45Updated last year
- ☆25Updated 4 years ago
- This repository contains datasets (including testing set) for EMNLP-IJCNLP 2019 paper "BiPaR: A Bilingual Parallel Dataset for Multilingu…☆23Updated 3 years ago
- ☆37Updated 3 years ago
- code and data for paper "Learning Kernel-Smoothed Machine Translation with Retrieved Examples"☆24Updated 3 years ago
- TVsub: DCU-Tencent Chinese-English Dialogue Corpus☆47Updated 7 years ago
- CoSDA-ML: Multi-Lingual Code-Switching Data Augmentation for Zero-Shot Cross-Lingual NLP☆51Updated 2 years ago
- https://liuzeming01.github.io/XDailyDialog/☆11Updated last year
- Bilingual lexicons map words in one language to their translations in another, and are typically induced by learning linear project…☆16Updated 3 years ago
- The code for EMNLP2022 paper "Improved grammatical error correction by ranking elementary edits"☆19Updated 2 years ago
- Tools for formatting WMT hypothesis and test sets in XML☆27Updated 2 weeks ago
- Codes for our CCL 2021 paper: Incorporating Commonsense Knowledge into Abstractive Dialogue Summarization via Heterogeneous Graph Network…☆25Updated 3 years ago
- Source code for the EMNLP 2020 long paper <Token-level Adaptive Training for Neural Machine Translation>.☆20Updated 2 years ago