madaan / minimal-text-diffusion
A minimal implementation of diffusion models for text generation
☆327Updated last year
Alternatives and similar repositories for minimal-text-diffusion:
Users that are interested in minimal-text-diffusion are comparing it to the libraries listed below
- Implementation of Self-conditioned Embedding Diffusion for Text Generation☆36Updated 2 years ago
- [ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)☆475Updated 11 months ago
- [ICLR'23] DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models☆751Updated 11 months ago
- Simplified Masked Diffusion Language Model☆273Updated 2 months ago
- ☆122Updated 11 months ago
- Reparameterized Discrete Diffusion Models for Text Generation☆94Updated 2 years ago
- Diffusion-LM☆1,089Updated 6 months ago
- Experiments around a simple idea for inducing multiple hierarchical predictive model within a GPT☆207Updated 6 months ago
- ACL'2023: DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models☆301Updated last year
- Implementation of Bit Diffusion, Hinton's group's attempt at discrete denoising diffusion, in Pytorch☆341Updated last year
- A simple implimentation of Bayesian Flow Networks (BFN)☆240Updated last year
- ☆86Updated last year
- A curated list for awesome discrete diffusion models resources.☆231Updated 2 weeks ago
- Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch☆225Updated 5 months ago
- ☆171Updated last year
- Code for the ALiBi method for transformer language models (ICLR 2022)☆515Updated last year
- Sequence modeling with Mega.☆298Updated 2 years ago
- Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control☆66Updated 2 years ago
- The official codebase for "Empowering Diffusion Models on the Embedding Space for Text Generation" (NAACL 2024)☆54Updated 9 months ago
- Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch☆306Updated 8 months ago
- DiffusER: Discrete Diffusion via Edit-based Reconstruction (Reid, Hellendoorn & Neubig, 2022)☆54Updated last year
- Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch☆405Updated last month
- Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind☆173Updated 5 months ago
- Minimal Implementation of a D3PM in pytorch☆196Updated 9 months ago
- Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation [NAACL 2024]☆94Updated last year
- Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate …☆630Updated last year
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆204Updated last year
- Unofficial Implementation of Consistency Models in pytorch☆255Updated last year
- Scaling Data-Constrained Language Models☆333Updated 5 months ago
- Repository for code used in the xVal paper☆128Updated 10 months ago