Unbabel/BConTrasT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Unbabel/BConTrasT)

Unbabel / BConTrasT

☆20

Alternatives and similar repositories for BConTrasT

Users that are interested in BConTrasT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

XL2248 / CPCC
View on GitHub
Code and Data for the ACL21 paper "Modeling Bilingual Conversational Characteristics for Neural Chat Translation"
☆12Dec 17, 2021Updated 4 years ago
bicici / FDA
View on GitHub
Feature Decay Algorithms
☆11Mar 5, 2014Updated 12 years ago
XL2248 / MSCTD
View on GitHub
Code and Data for the ACL22 main conference paper "MSCTD: A Multimodal Sentiment Chat Translation Dataset"
☆44Dec 25, 2024Updated last year
akikoe / nmtrnng
View on GitHub
C++ code of "Learning to Parse and Translate Improves Neural Machine Translation"
☆21May 8, 2017Updated 9 years ago
salesforce / localization-xml-mt
View on GitHub
A High-Quality Multilingual Dataset for Structured Documentation Translation
☆39May 1, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
THUNLP-MT / Document-Transformer
View on GitHub
Improving the Transformer translation model with document-level context
☆169Jul 7, 2020Updated 6 years ago
sameenmaruf / selective-attn
View on GitHub
Data and code used in our NAACL'19 paper "Selective Attention for Context-aware Neural Machine Translation"
☆30Apr 12, 2020Updated 6 years ago
fandongmeng / DTMT_InDec
View on GitHub
Implementation of DTMT with incremental decoding
☆13Feb 20, 2021Updated 5 years ago
e9t / dotfiles
View on GitHub
My dotfiles.
☆21Mar 29, 2026Updated 3 months ago
MaxyLee / 3AM
View on GitHub
Official code and data of "3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset"
☆12Dec 8, 2024Updated last year
XMUDeepLIT / mc_tit
View on GitHub
Code for ACL 2023 paper: Exploring Better Text Image Translation with Multimodal Codebook
☆21Apr 19, 2026Updated 3 months ago
rgwt123 / simple-fairseq
View on GitHub
simple translate
☆12Mar 7, 2020Updated 6 years ago
lilt / alignment-scripts
View on GitHub
Scripts to preprocess training and test data and to run fast_align and giza
☆107Nov 2, 2021Updated 4 years ago
tmu-nlp / sscorpus
View on GitHub
A monolingual parallel corpus for sentence simplification
☆11Jul 4, 2016Updated 10 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
longyuewangdcu / tvsub
View on GitHub
TVsub: DCU-Tencent Chinese-English Dialogue Corpus
☆47Feb 14, 2018Updated 8 years ago
rbawden / DiaBLa-dataset
View on GitHub
English-French MT dialogue dataset
☆17Apr 29, 2022Updated 4 years ago
yhy1117 / DA4NMT
View on GitHub
NMT domain adaptation papers (updating...)
☆17Jun 1, 2019Updated 7 years ago
sharejing / BiPaR
View on GitHub
This repository contains datasets (including testing set) for EMNLP-IJCNLP 2019 paper "BiPaR: A Bilingual Parallel Dataset for Multilingu…
☆23Jul 13, 2021Updated 5 years ago
qhungngo / EVBCorpus
View on GitHub
The English-Vietnamese Bilingual Corpus (EVBCorpus) is a collection of English and Vietnamese parallel translations and bitexts.
☆52Jul 12, 2019Updated 7 years ago
lauhaide / clads
View on GitHub
XWikisCorpus, cross-lingual summarisation, multi-lingual summarisation, pre-trained language models, zero-shot and few-shot summarisation…
☆10Nov 4, 2022Updated 3 years ago
NoSyu / VHUCM
View on GitHub
Implementation of Variational Hierarchical User-based Conversation Model
☆10Jul 2, 2021Updated 5 years ago
XL2248 / SOV-MAS
View on GitHub
The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"
☆11May 16, 2023Updated 3 years ago
samueljamesbell / sequence-labeler
View on GitHub
Neural network sequence labeling model
☆11Dec 28, 2019Updated 6 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
wlin12 / SMMTT
View on GitHub
Social Media Machine Translation Toolkit
☆21Sep 13, 2013Updated 12 years ago
lena-voita / good-translation-wrong-in-context
View on GitHub
This is a repository with the data and code for the ACL 2019 paper "When a Good Translation is Wrong in Context: ..." and the EMNLP 2019 …
☆101May 12, 2020Updated 6 years ago
amittai / cynical
View on GitHub
Cynical data selection
☆20Jan 16, 2021Updated 5 years ago
tsuruoka-lab / AMI-Meeting-Parallel-Corpus
View on GitHub
AMI Meeting Parallel Corpus
☆13Dec 11, 2020Updated 5 years ago
jackbandy / bookcorpus-datasheet
View on GitHub
Documentation effort for the BookCorpus dataset
☆34Jun 2, 2021Updated 5 years ago
NiuTrans / LanguageCodes
View on GitHub
We present a list of languages with their codes, families, regions and etc. We also present a list of multi-lingual corpora (with urls).
☆87Jun 2, 2021Updated 5 years ago
zhusleep / TransForFun
View on GitHub
使用谷歌翻译进行大规模翻译，免疫封锁
☆10Aug 1, 2019Updated 6 years ago
naver / covid19-nmt
View on GitHub
Multi-lingual & multi-domain (specialisation for biomedical data) translation model
☆40Nov 17, 2020Updated 5 years ago
Helsinki-NLP / OpusFilter
View on GitHub
OpusFilter - Parallel corpus processing toolkit
☆115Jul 1, 2026Updated 3 weeks ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
korokes / MCLS
View on GitHub
Assist Non-native Viewers: Multimodal Crosslingual Summarization for How2 Videos
☆10Sep 2, 2024Updated last year
neulab / word-embeddings-for-nmt
View on GitHub
Supplementary material for "When and Why Are Pre-trained Word Embeddings Useful for Neural Machine Translation?" at NAACL 2018
☆123Sep 22, 2025Updated 10 months ago
cunliangkong / linux-envs
View on GitHub
personal settings for linux tools, including zsh, vim, tmux, pip.
☆11Dec 2, 2019Updated 6 years ago
ruili33 / SEC
View on GitHub
Source code for paper Are Human-generated Demonstrations Necessary for In-context Learning
☆12Jan 21, 2024Updated 2 years ago
HHW-zhou / TSMMG
View on GitHub
Code of "Instruction Multi-Constraint Molecular Generation Using a Teacher-Student Large Language Model"
☆13Jul 8, 2025Updated last year
renj / pseudo-ref
View on GitHub
Implementation of the pseudo-reference generation algorithm proposed in EMNLP 2018 paper: Multi-Reference Training with Pseudo-References…
☆11Oct 15, 2018Updated 7 years ago
JHart96 / keras_gcn_sequence_labelling
View on GitHub
Keras implementation of graph convolutional networks for sequence labelling
☆12Sep 21, 2018Updated 7 years ago