rbawden/DiaBLa-dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rbawden/DiaBLa-dataset)

rbawden / DiaBLa-dataset

English-French MT dialogue dataset

☆17

Alternatives and similar repositories for DiaBLa-dataset

Users that are interested in DiaBLa-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rbawden / discourse-mt-test-sets
View on GitHub
☆29Jun 10, 2024Updated 2 years ago
Eurus-Holmes / CHABCNet
View on GitHub
[CHABCNet] ABCNet on the Chinese dataset, building on Detectron2 (Facebook AI Research)
☆12Oct 3, 2023Updated 2 years ago
isl-mt / fluent-fisher
View on GitHub
☆15Jun 17, 2019Updated 7 years ago
shuyanzhou / multitask_transformer
View on GitHub
Source code for "Improving Robustness of Neural Machine Translation with Multi-task Learning"
☆19Aug 15, 2019Updated 6 years ago
ImperialNLP / MMT-Delib
View on GitHub
☆10Dec 21, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
rgwt123 / simple-fairseq
View on GitHub
simple translate
☆12Mar 7, 2020Updated 6 years ago
marzenakrp / LiteraryTranslation
View on GitHub
☆24Apr 2, 2024Updated 2 years ago
isl-mt / SLT.KIT
View on GitHub
Spoken Language Translation System
☆20Jul 26, 2021Updated 5 years ago
Eurus-Holmes / SynthText_CH
View on GitHub
[SynthText Chinese] Improved code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural I…
☆14Dec 8, 2022Updated 3 years ago
ucasligang / awesome-Diffusion
View on GitHub
Reading list for research topics in Diffusion models.
☆18Jan 12, 2024Updated 2 years ago
joshua-decoder / thrax
View on GitHub
Hadoop-based tool for extraction of large scale synchronous grammars for paraphrasing and machine translation
☆15Dec 2, 2016Updated 9 years ago
sharejing / BiPaR
View on GitHub
This repository contains datasets (including testing set) for EMNLP-IJCNLP 2019 paper "BiPaR: A Bilingual Parallel Dataset for Multilingu…
☆23Jul 13, 2021Updated 5 years ago
david-abel / aaai_2019
View on GitHub
Conference notes for AAAI 2019
☆15Feb 1, 2019Updated 7 years ago
Unbabel / BConTrasT
View on GitHub
☆20Aug 17, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
google-research-datasets / lareqa
View on GitHub
LAReQA is a challenging benchmark for evaluating language agnostic answer retrieval from a multilingual candidate pool. This repository c…
☆14May 19, 2020Updated 6 years ago
mjpost / bin
View on GitHub
bin files
☆13Jan 30, 2025Updated last year
tnq177 / witwicky
View on GitHub
Witwicky: An implementation of Transformer in PyTorch.
☆22Aug 17, 2020Updated 5 years ago
mhardalov / exams-qa
View on GitHub
A Multi-subject High School Examinations Dataset for Cross-lingual and Multilingual Question Answering
☆49Apr 5, 2022Updated 4 years ago
nitikam / tangled
View on GitHub
Code, data, and additional analysis for the paper Tangled up in BLEU: Reevaluating the Evaluation of Automatic Machine Translation Evalua…
☆15Aug 13, 2020Updated 5 years ago
NingMiao / KerBS
View on GitHub
Codes for <Kernelized Bayesian Softmax for Text Generation> in NeurIPS 2019
☆16Nov 20, 2019Updated 6 years ago
mayhewsw / pytorch-truecaser
View on GitHub
A simple neural truecaser written in pytorch and allennlp.
☆35Jun 17, 2024Updated 2 years ago
kevinduh / sockeye-recipes
View on GitHub
Training scripts and recipes for Sockeye Neural Machine Translation toolkit
☆37Sep 8, 2019Updated 6 years ago
shuoyangd / tape4nmt
View on GitHub
a ducttape workflow for neural machine translation
☆14Mar 23, 2021Updated 5 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
JDonner / TreeKernel
View on GitHub
C++ implementation of Alessandro Moschitti's Tree Kernel algorithm, from "Making Tree Kernels Practical for Natural Language Learning"
☆12Oct 10, 2019Updated 6 years ago
elipousson / mapbaltimore
View on GitHub
🗺🏙 Map Baltimore neighborhoods and local open data
☆19Oct 10, 2025Updated 9 months ago
joshua-decoder / joshua_translation_engine
View on GitHub
RESTful wrapper for the Joshua machine translation decoder
☆14Oct 25, 2016Updated 9 years ago
jimth001 / Bi-Seq2Seq
View on GitHub
An implementation of "Two are Better than One: An Ensemble of Retrieval- and Generation-Based Dialog Systems"
☆14Jul 23, 2019Updated 7 years ago
openredact / nerwhal
View on GitHub
This is a prototype of a multi-lingual suite for named-entity recognition in Python. ➡️ The project has moved to: https://gitlab.opencode…
☆21Mar 20, 2026Updated 4 months ago
openfeedback / superhf
View on GitHub
Open-source Human Feedback Library
☆11Oct 25, 2023Updated 2 years ago
dhfbk / KIND
View on GitHub
KIND: an Italian Multi-Domain Dataset for Named Entity Recognition
☆13Jun 28, 2023Updated 3 years ago
jcsilva / multilingual-g2p
View on GitHub
Multilingual Grapheme to Phoneme
☆50Feb 23, 2016Updated 10 years ago
Genius1237 / TyDiP
View on GitHub
TyDiP Multilingual Politeness dataset and code
☆12Oct 15, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
neulab / word-embeddings-for-nmt
View on GitHub
Supplementary material for "When and Why Are Pre-trained Word Embeddings Useful for Neural Machine Translation?" at NAACL 2018
☆123Sep 22, 2025Updated 10 months ago
danliu2 / caat
View on GitHub
☆35Sep 1, 2022Updated 3 years ago
dqqcasia / st
View on GitHub
End-to-end Speech Translation
☆35Apr 12, 2021Updated 5 years ago
ImperialNLP / pysimt
View on GitHub
Simultaneous NMT/MMT framework in PyTorch
☆38Mar 22, 2025Updated last year
matthewmorrone / cmudict-ipa
View on GitHub
CMU dictionary in IPA instead of their subset of Arpabet
☆16Jun 21, 2026Updated last month
ZurichNLP / ContraWSD
View on GitHub
Word sense disambiguation test sets for NMT
☆21Dec 3, 2020Updated 5 years ago
kjhealy / england_gdp_long
View on GitHub
☆16Jan 8, 2023Updated 3 years ago