allenai / tess-diffusionLinks
☆20Updated 2 years ago
Alternatives and similar repositories for tess-diffusion
Users that are interested in tess-diffusion are comparing it to the libraries listed below
Sorting:
- Reparameterized Discrete Diffusion Models for Text Generation☆104Updated 2 years ago
- Official Code Repository for the paper "Continuous Diffusion Model for Language Modeling" (NeurIPS 2025).☆58Updated 4 months ago
- ☆111Updated 2 years ago
- Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control☆76Updated 3 years ago
- Unofficial PyTorch implementation of "Step-unrolled Denoising Autoencoders for Text Generation"☆24Updated 3 years ago
- Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation [NAACL 2024]☆99Updated 2 years ago
- Official Code for Paper "Think While You Generate: Discrete Diffusion with Planned Denoising" [ICLR 2025]☆84Updated 9 months ago
- The official codebase for "Empowering Diffusion Models on the Embedding Space for Text Generation" (NAACL 2024)☆55Updated last year
- ☆149Updated last year
- DiffusER: Discrete Diffusion via Edit-based Reconstruction (Reid, Hellendoorn & Neubig, 2022)☆55Updated 6 months ago
- A simple DIffusion LM approach.☆26Updated 2 years ago
- Reproduce ICLR2025 Energy-Based Diffusion Language Models for Text Generation☆60Updated 6 months ago
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20Updated 2 years ago
- Diffusion Model Improvement Method☆34Updated 2 years ago
- Code for the paper: "Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods"☆31Updated 8 months ago
- Implementation of Self-conditioned Embedding Diffusion for Text Generation☆38Updated 3 years ago
- Inference-Time Alignment in Protein Diffusion Models☆51Updated last month
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆88Updated last year
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆47Updated 9 months ago
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆19Updated 7 months ago
- Listing some diffusion papers in NLP domain I have read, text generation is main, table will continue to be updated.☆77Updated 10 months ago
- Structured Chemistry Reasoning with Large Language Models☆39Updated last year
- ☆149Updated 6 months ago
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆70Updated last year
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…☆53Updated 2 years ago
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆46Updated 3 months ago
- ☆25Updated 6 months ago
- Implementation of Particle Guidance: non-I.I.D. Diverse Sampling with Diffusion Models☆78Updated 2 years ago
- [ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"☆86Updated 11 months ago
- Remasking Discrete Diffusion Models with Inference-Time Scaling☆66Updated this week