Any-Order GPT as Masked Diffusion Model: Decoupling Formulation and Architecture. Training an MDM using GPT with this repo!
☆35Jun 23, 2025Updated 9 months ago
Alternatives and similar repositories for AO-GPT-MDM
Users that are interested in AO-GPT-MDM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR…☆82May 30, 2025Updated 10 months ago
- Grammar test suite for masked language models☆10Jan 1, 2023Updated 3 years ago
- ☆55Updated this week
- ☆39Aug 28, 2025Updated 7 months ago
- [ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆387May 31, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official PyTorch implementation of the paper "Accelerating Diffusion Large Language Models with SlowFast Sampling: The Three Golden Princ…☆42Jul 18, 2025Updated 9 months ago
- Code for our paper "Unlocking Guidance for Discrete State-Space Diffusion and Flow Models"☆34Apr 18, 2025Updated last year
- [NeurIPS 2024] Simple and Effective Masked Diffusion Language Model☆675Sep 29, 2025Updated 6 months ago
- The PackNet Continual Learning Method in Pytorch☆15Aug 19, 2021Updated 4 years ago
- [ICLR 2026] Official repository of "Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models"☆164Feb 16, 2026Updated 2 months ago
- ☆44Sep 15, 2025Updated 7 months ago
- Esoteric Language Models☆116Mar 27, 2026Updated 3 weeks ago
- One-Pixel Shortcut: on the Learning Preference of Deep Neural Networks (ICLR 2023 Spotlight)☆14Sep 28, 2025Updated 6 months ago
- Shadow Attack, LiRA, Quantile Regression and RMIA implementations in PyTorch (Online version)☆14Nov 8, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"☆377Dec 22, 2024Updated last year
- Offical implementation of our paper "Exploring the Potential of Diffusion Large Language Models in Code Generation".☆20Oct 29, 2025Updated 5 months ago
- [NeurIPS 2025] Reward-Instruct: A Reward-Centric Approach to Fast Photo-Realistic Image Generation☆35Oct 24, 2025Updated 5 months ago
- official code for Diff-Instruct algorithm for one-step diffusion distillation☆87Jan 9, 2025Updated last year
- Official repository for "Solving Video Inverse Problems Using Image Diffusion Models"☆11Mar 7, 2026Updated last month
- a clean blog☆10Apr 20, 2020Updated 5 years ago
- Code for "Purify Unlearnable Examples via Rate-Constrained Variational Autoencoders" at ICML 2024☆10Sep 18, 2025Updated 7 months ago
- [ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception☆14Jul 4, 2025Updated 9 months ago
- Git for "Stepwise Self-Consistent Mathematical Reasoning with Large Language Models"☆12Nov 26, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Reproduce ICLR2025 Energy-Based Diffusion Language Models for Text Generation☆68Jul 22, 2025Updated 8 months ago
- Does Diffusion Beat GAN in Image Super Resolution?☆12May 27, 2024Updated last year
- Official implementation for CVPR 2025 paper "AMO Sampler: Enhancing Text Rendering with Overshooting"☆30May 3, 2025Updated 11 months ago
- Code for paper 'Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse'☆13Aug 2, 2024Updated last year
- Code for the paper https://arxiv.org/abs/2402.04997☆107Feb 8, 2024Updated 2 years ago
- ☆18May 20, 2025Updated 10 months ago
- ☆17May 14, 2020Updated 5 years ago
- User handbook for mist-v2☆27Dec 16, 2023Updated 2 years ago
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implementation of various equivariant models in JAX☆19Apr 12, 2024Updated 2 years ago
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆129May 22, 2025Updated 10 months ago
- YetAnotherWandbClient☆13Mar 16, 2026Updated last month
- [NeurIPS 2021] "Drawing Robust Scratch Tickets: Subnetworks with Inborn Robustness Are Found within Randomly Initialized Networks" by Yon…☆13Feb 13, 2022Updated 4 years ago
- PyTorch implementation of our ICLR 2023 paper titled "Is Adversarial Training Really a Silver Bullet for Mitigating Data Poisoning?".☆12Mar 13, 2023Updated 3 years ago
- Code to reproduce experiments in Markovian Flow Matching: Accelerating MCMC with Continuous Normalizing Flows☆14May 23, 2024Updated last year
- ☆14Jul 21, 2023Updated 2 years ago