igul222 / plaid
☆90Updated last year
Alternatives and similar repositories for plaid:
Users that are interested in plaid are comparing it to the libraries listed below
- Reparameterized Discrete Diffusion Models for Text Generation☆96Updated 2 years ago
- [ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆113Updated last month
- [ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"☆40Updated last month
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…☆53Updated last year
- Stick-breaking attention☆48Updated this week
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"☆70Updated last year
- ☆81Updated last year
- Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control☆67Updated 2 years ago
- ☆127Updated last year
- ☆51Updated 9 months ago
- Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"☆125Updated 2 months ago
- Language Quantized AutoEncoders☆101Updated 2 years ago
- Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion-LM-Papers".☆98Updated this week
- Language models scale reliably with over-training and on downstream tasks☆96Updated 11 months ago
- Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data"☆33Updated 3 weeks ago
- DiffusER: Discrete Diffusion via Edit-based Reconstruction (Reid, Hellendoorn & Neubig, 2022)☆54Updated 2 years ago
- Official Jax Implementation of MD4 Masked Diffusion Models☆64Updated 2 weeks ago
- [NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…☆64Updated 10 months ago
- Sparse Backpropagation for Mixture-of-Expert Training☆28Updated 8 months ago
- Implementation of Self-conditioned Embedding Diffusion for Text Generation☆36Updated 2 years ago
- ☆28Updated 4 months ago
- Official code implementation for the work Preference Alignment with Flow Matching (NeurIPS 2024)☆42Updated 4 months ago
- A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.☆63Updated 7 months ago
- Online Adaptation of Language Models with a Memory of Amortized Contexts (NeurIPS 2024)☆63Updated 7 months ago
- Educational implementation of the Discrete Flow Matching paper☆78Updated 6 months ago
- Randomized Positional Encodings Boost Length Generalization of Transformers☆79Updated last year
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆55Updated 10 months ago
- The official codebase for "Empowering Diffusion Models on the Embedding Space for Text Generation" (NAACL 2024)☆53Updated 10 months ago
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆99Updated 2 years ago