thu-coai / DA-TransformerView external linksLinks
Official Implementation for the ICML2022 paper "Directed Acyclic Transformer for Non-Autoregressive Machine Translation"
☆135Sep 10, 2023Updated 2 years ago
Alternatives and similar repositories for DA-Transformer
Users that are interested in DA-Transformer are comparing it to the libraries listed below
Sorting:
- Official Implementation for the ICLR2023 paper "Fuzzy Alignments in Directed Acyclic Graph for Non-autoregressive Machine Translation"☆13Mar 1, 2023Updated 2 years ago
- Code for NeurIPS 2023 paper "Non-autoregressive Machine Translation with Probabilistic Context-free Grammar".☆12Jan 4, 2024Updated 2 years ago
- ☆187Jul 22, 2024Updated last year
- Deeply Supervised, Layer-wise Prediction-aware (DSLP) Transformer for Non-autoregressive Neural Machine Translation☆44Jan 9, 2024Updated 2 years ago
- Code for NeurIPS 2022 Spotlight paper " Non-Monotonic Latent Alignments for CTC-Based Non-Autoregressive Machine Translation"☆20Nov 16, 2022Updated 3 years ago
- ☆13Feb 7, 2023Updated 3 years ago
- Implementation of latent-GLAT (ACL-2022)☆34Apr 30, 2022Updated 3 years ago
- Source code for ICLR 2023 spotlight paper "Hidden Markov Transformer for Simultaneous Machine Translation"☆24Dec 11, 2023Updated 2 years ago
- Code for the ICLR'22 paper "Improving Non-Autoregressive Translation Models Without Distillation"☆18Mar 11, 2022Updated 3 years ago
- ☆16Jul 11, 2023Updated 2 years ago
- Non-autoregressive Translation by Learning Target Categorical Codes☆11Jul 11, 2021Updated 4 years ago
- Implementation of "Glancing Transformer for Non-Autoregressive Neural Machine Translation"☆137Mar 15, 2023Updated 2 years ago
- ☆10Feb 12, 2020Updated 6 years ago
- ☆18Mar 10, 2023Updated 2 years ago
- ☆13Aug 23, 2024Updated last year
- ☆15Dec 5, 2019Updated 6 years ago
- source code for NAACL2022 main conference "Dynamic Programming in Rank Space: Scaling Structured Inference with Low-Rank HMMs and PCFGs"☆10Sep 26, 2022Updated 3 years ago
- Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …☆14Aug 25, 2023Updated 2 years ago
- triton ver of gqa flash attn, based on the tutorial☆12Aug 4, 2024Updated last year
- ☆28Sep 28, 2021Updated 4 years ago
- codes for "Scheduled Sampling Based on Decoding Steps for Neural Machine Translation" (long paper of EMNLP-2022)☆20Aug 31, 2021Updated 4 years ago
- Tracking the progress in non-autoregressive generation (translation, transcription, etc.)☆303Mar 15, 2023Updated 2 years ago
- lanmt ebm☆12Jun 19, 2020Updated 5 years ago
- ☆12Dec 13, 2022Updated 3 years ago
- Code for "Discovering Non-monotonic Autoregressive Orderings with Variational Inference" (paper and code updated from ICLR 2021)☆12Mar 7, 2024Updated last year
- Dependency Parsing as Sequence Labeling with Python3+ and PyTorch1+ and MTL☆10Nov 21, 2019Updated 6 years ago
- [EACL'21] Non-Autoregressive with Pretrained Language Model☆61Oct 10, 2022Updated 3 years ago
- Implementation of QKVAE☆11Feb 24, 2023Updated 2 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…☆26Oct 27, 2022Updated 3 years ago
- This is the code for neural-Jacana aligner, and the data for MultiMWA dataset.☆20Feb 12, 2023Updated 3 years ago
- Code for EMNLP 2022 main conference paper "Low-resource Neural Machine Translation with Cross-modal Alignment".☆14Apr 25, 2023Updated 2 years ago
- Data for EMNLP 2022 paper "arXivEdits: Understanding the Human Revision Process in Scientific Writing".☆14Sep 30, 2023Updated 2 years ago
- Reparameterized Discrete Diffusion Models for Text Generation☆104Feb 14, 2023Updated 3 years ago
- ENGINE: Energy-Based Inference Networks for Non-Autoregressive Machine Translation☆25Oct 2, 2020Updated 5 years ago
- Source code for <Sequence-Level Training for Non-Autoregressive Neural Machine Translation>.☆24Jan 17, 2022Updated 4 years ago
- Unofficial implementation of "Non-Autoregressive Machine Translation with Latent Alignments" https://arxiv.org/abs/2004.07437☆23Jun 14, 2020Updated 5 years ago
- [EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling☆87Mar 7, 2023Updated 2 years ago
- Implementation and experiments for Partially Supervised NER via Expected Entity Ratio in TACL 2022☆14Nov 7, 2022Updated 3 years ago