☆29Jun 10, 2024Updated last year
Alternatives and similar repositories for discourse-mt-test-sets
Users that are interested in discourse-mt-test-sets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a repository with the data and code for the ACL 2019 paper "When a Good Translation is Wrong in Context: ..." and the EMNLP 2019 …☆100May 12, 2020Updated 6 years ago
- Automatically harvested multilingual contrastive word sense disambiguation test sets for machine translation☆18Jan 18, 2021Updated 5 years ago
- Word sense disambiguation test sets for NMT☆20Dec 3, 2020Updated 5 years ago
- Contrastive evaluation of pronoun translation in neural machine translation☆26Aug 22, 2019Updated 6 years ago
- TVsub: DCU-Tencent Chinese-English Dialogue Corpus☆46Feb 14, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- English-French MT dialogue dataset☆17Apr 29, 2022Updated 4 years ago
- bin files☆13Jan 30, 2025Updated last year
- ☆15Jun 17, 2019Updated 6 years ago
- Explicit Sentence Compression for Neural Machine Translation☆10May 12, 2020Updated 6 years ago
- Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"☆23May 26, 2021Updated 5 years ago
- [CHABCNet] ABCNet on the Chinese dataset, building on Detectron2 (Facebook AI Research)☆11Oct 3, 2023Updated 2 years ago
- Post-editing Datasets by Rakuten (PEDRa)☆14Jun 23, 2021Updated 4 years ago
- ☆21Feb 13, 2023Updated 3 years ago
- Terminology Dataset☆24Feb 27, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Find informative examples to efficiently (human)-evaluate NLG models.☆17Apr 22, 2026Updated last month
- ☆33Oct 1, 2021Updated 4 years ago
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆130Apr 23, 2026Updated last month
- This repository contains additional reference translations for the WMT'14 En-De (newstest2014) and WMT'19 En-Ru (newstest2019) test sets …☆15Aug 31, 2021Updated 4 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆164Apr 13, 2026Updated last month
- OpusFilter - Parallel corpus processing toolkit☆115May 13, 2026Updated 2 weeks ago
- Efficient Low-Memory Aligner☆147Jan 15, 2025Updated last year
- CD20200004 from 01/01/2021 to 31/12/2023 - LIG UGA - Python Notebook and Models for the MT Lab @ ALPS 2022☆13Apr 1, 2024Updated 2 years ago
- The implementation of "Does Multi-Encoder Help? A Case Study on Context-AwareNeural Machine Translation"☆39Aug 26, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A repository with the code related to experiments around context-aware machine translation☆51Sep 22, 2025Updated 8 months ago
- Unsupervised factor-based text tokenizer for natural-language processing applications☆17Jul 24, 2020Updated 5 years ago
- simple translate☆12Mar 7, 2020Updated 6 years ago
- Cross Sentence Neural Machine Translation☆11Mar 26, 2018Updated 8 years ago
- We release a dataset based on Wikipedia sentences and the corresponding translations in 6 different languages along with the scores (scal…☆81Aug 31, 2021Updated 4 years ago
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆28Sep 14, 2024Updated last year
- ☆24Apr 2, 2024Updated 2 years ago
- ☆17Jul 5, 2022Updated 3 years ago
- ☆12Jan 30, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- NLP Preprocessing Pipeline Wrappers☆11May 12, 2023Updated 3 years ago
- This repository provides the source code used to automatically generate the book summarization datasets described in the paper titled "Ec…☆10Apr 14, 2025Updated last year
- Reading list for research topics in Diffusion models.☆18Jan 12, 2024Updated 2 years ago
- [SynthText Chinese] Improved code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural I…☆13Dec 8, 2022Updated 3 years ago
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆42Dec 19, 2023Updated 2 years ago
- ParCourE - Parallel Corpus Explorer☆12Dec 27, 2021Updated 4 years ago
- This repository contains datasets (including testing set) for EMNLP-IJCNLP 2019 paper "BiPaR: A Bilingual Parallel Dataset for Multilingu…☆23Jul 13, 2021Updated 4 years ago