CharizardAcademy / convtransformerLinks

Code for the ACL2020 paper Character-Level Translation with Self-Attention

☆31

Alternatives and similar repositories for convtransformer

Users that are interested in convtransformer are comparing it to the libraries listed below

Sorting:

iedwardwangi / MetaAdapter
☆22Updated 4 years ago
shashwattrivedi / Attention_visualizer
A visualizer to display attention weights on text
☆23Updated 6 years ago
facebookresearch / DisCo
DisCo Transformer for Non-autoregressive MT
☆77Updated 3 years ago
berniebear / Multi-HT100M
☆53Updated 3 years ago
Yifan-Gao / open_retrieval_conversational_machine_reading
Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset
☆13Updated 2 years ago
lzy1732008 / GaussionTransformer
For paper《Gaussian Transformer: A Lightweight Approach for Natural Language Inference》
☆28Updated 5 years ago
ranqiu92 / RecoverSAT
☆18Updated last year
intersun / CoDIR
Code for EMNLP 2020 paper CoDIR
☆41Updated 2 years ago
ictnlp / TLAT-NMT
Source code for the EMNLP 2020 long paper <Token-level Adaptive Training for Neural Machine Translation>.
☆20Updated 2 years ago
yxuansu / NAG-BERT
[EACL'21] Non-Autoregressive with Pretrained Language Model
☆62Updated 2 years ago
bojone / univae
基于Transformer的单模型、多尺度的VAE模型
☆57Updated 4 years ago
haorannlp / mix
Code for "Mixed Cross Entropy Loss for Neural Machine Translation"
☆20Updated 4 years ago
zinengtang / VidLanKD
Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))
☆56Updated 2 years ago
cooelf / UVR-NMT
Neural Machine Translation with universal Visual Representation (ICLR 2020)
☆89Updated 5 years ago
xwgeng / SSAN
How Does Selective Mechanism Improve Self-attention Networks?
☆29Updated 4 years ago
lemmonation / jm-nat
Code for ACL2020 "Jointly Masked Sequence-to-Sequence Model for Non-Autoregressive Neural Machine Translation"
☆39Updated 5 years ago
lioutasb / TaLKConvolutions
Official PyTorch implementation of Time-aware Large Kernel (TaLK) Convolutions (ICML 2020)
☆29Updated 4 years ago
eyalbd2 / PADA
Official code for the paper "PADA: Example-based Prompt Learning for on-the-fly Adaptation to Unseen Domains".
☆51Updated 3 years ago
leaderj1001 / Synthesizer-Rethinking-Self-Attention-Transformer-Models
Implementing SYNTHESIZER: Rethinking Self-Attention in Transformer Models using Pytorch
☆70Updated 5 years ago
yxuansu / TaCL
[NAACL'22] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning
☆93Updated 3 years ago
microsoft / EA-VQ-VAE
This repo provides the code for the ACL 2020 paper "Evidence-Aware Inferential Text Generation with Vector Quantised Variational AutoEnco…
☆55Updated 4 years ago
fallcat / stupidNMT
Hard-Coded Gaussian Attention for Neural Machine Translation
☆36Updated 2 years ago
lemmonation / abnet
Code for NeurIPS2020 "Incorporating BERT into Parallel Sequence Decoding with Adapters"
☆32Updated 2 years ago
cookielee77 / DAST
Domain Adaptive Text Style Transfer, EMNLP 2019
☆70Updated 5 years ago
lifu-tu / ENGINE
ENGINE: Energy-Based Inference Networks for Non-Autoregressive Machine Translation
☆25Updated 4 years ago
nuaa-nlp / Multimodality
☆15Updated 3 years ago
m3yrin / aligned-cross-entropy
Test implementation of "Aligned Cross Entropy for Non-Autoregressive Machine Translation" https://arxiv.org/abs/2004.01655
☆21Updated last year
romebert / RomeBERT
☆16Updated 4 years ago
jingjingli01 / TGLS
TGLS: Unsupervised Text Generation by Learning from Search
☆25Updated 4 years ago
zlinao / Variational-Transformer
Variational Transformers for Diverse Response Generation
☆81Updated last year