☆20Aug 17, 2021Updated 4 years ago
Alternatives and similar repositories for BConTrasT
Users that are interested in BConTrasT are comparing it to the libraries listed below
Sorting:
- Code and Data for the ACL21 paper "Modeling Bilingual Conversational Characteristics for Neural Chat Translation"☆12Dec 17, 2021Updated 4 years ago
- Feature Decay Algorithms☆11Mar 5, 2014Updated 11 years ago
- A High-Quality Multilingual Dataset for Structured Documentation Translation☆37May 1, 2025Updated 10 months ago
- Official code and data of "3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset"☆12Dec 8, 2024Updated last year
- simple translate☆12Mar 7, 2020Updated 5 years ago
- Implementation of DTMT with incremental decoding☆13Feb 20, 2021Updated 5 years ago
- Code and Data for the ACL22 main conference paper "MSCTD: A Multimodal Sentiment Chat Translation Dataset"☆45Dec 25, 2024Updated last year
- Scripts to preprocess training and test data and to run fast_align and giza☆107Nov 2, 2021Updated 4 years ago
- bilingual dictionary extractor from parallel corpora☆23Jul 3, 2014Updated 11 years ago
- Cynical data selection☆20Jan 16, 2021Updated 5 years ago
- Source code for the AAAI 2020 long paper <Modeling Fluency and Faithfulness for Diverse Neural Machine Translation>.☆19Mar 10, 2020Updated 5 years ago
- Social Media Machine Translation Toolkit☆21Sep 13, 2013Updated 12 years ago
- Improving the Transformer translation model with document-level context☆170Jul 7, 2020Updated 5 years ago
- NMT domain adaptation papers (updating...)☆17Jun 1, 2019Updated 6 years ago
- MARMOT - the open source framework for feature extraction and machine learning, designed to estimate the quality of Machine Translation o…☆22Oct 29, 2017Updated 8 years ago
- ☆21May 30, 2022Updated 3 years ago
- Japanese--Russian--English News Commentary Parallel Data☆18Jul 9, 2019Updated 6 years ago
- The English-Vietnamese Bilingual Corpus (EVBCorpus) is a collection of English and Vietnamese parallel translations and bitexts.☆49Jul 12, 2019Updated 6 years ago
- TVsub: DCU-Tencent Chinese-English Dialogue Corpus☆46Feb 14, 2018Updated 8 years ago
- This repository contains datasets (including testing set) for EMNLP-IJCNLP 2019 paper "BiPaR: A Bilingual Parallel Dataset for Multilingu…☆23Jul 13, 2021Updated 4 years ago
- Resources for the OpenNMT hackathon☆51May 24, 2019Updated 6 years ago
- Import of https://sourceforge.net/projects/champollion☆18Mar 14, 2016Updated 9 years ago
- Code for ACL 2023 paper: Exploring Better Text Image Translation with Multimodal Codebook☆21May 12, 2025Updated 9 months ago
- We present a list of languages with their codes, families, regions and etc. We also present a list of multi-lingual corpora (with urls).☆88Jun 2, 2021Updated 4 years ago
- A repository containing the code for speech translation papers.☆21Mar 11, 2022Updated 3 years ago
- OpusFilter - Parallel corpus processing toolkit☆115Feb 11, 2026Updated 2 weeks ago
- Supplementary material for "When and Why Are Pre-trained Word Embeddings Useful for Neural Machine Translation?" at NAACL 2018☆123Sep 22, 2025Updated 5 months ago
- Neural Machine Translation in Pytorch☆31Jun 11, 2018Updated 7 years ago
- Documentation effort for the BookCorpus dataset☆34Jun 2, 2021Updated 4 years ago
- Document-Level Neural Machine Translation with Hierarchical Attention Networks☆67May 9, 2022Updated 3 years ago
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆74Feb 25, 2015Updated 11 years ago
- TER-plus Machine Translation metric.☆31May 23, 2022Updated 3 years ago
- A parallel evaluation data set of SAP software documentation with document structure annotation☆14Jul 30, 2025Updated 7 months ago
- ☆86Dec 26, 2022Updated 3 years ago
- Simultaneous NMT/MMT framework in PyTorch☆38Mar 22, 2025Updated 11 months ago
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 2 years ago
- personal settings for linux tools, including zsh, vim, tmux, pip.☆11Dec 2, 2019Updated 6 years ago
- WordNet Domains, WordNet Affect and SentiWords☆48Jan 8, 2016Updated 10 years ago
- A Python script to delete all comment and submission data from a given Reddit account.☆11Jan 5, 2021Updated 5 years ago