Code for paper "SPG Sandwiched Policy Gradient for Masked Diffusion Language Models"
☆60Oct 29, 2025Updated 8 months ago
Alternatives and similar repositories for SPG
Users that are interested in SPG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference☆45Oct 29, 2025Updated 8 months ago
- A lightweight Inference Engine built for block diffusion models☆47Apr 12, 2026Updated 2 months ago
- MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Models☆45Jan 28, 2026Updated 5 months ago
- Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"☆450Jan 26, 2026Updated 5 months ago
- Crawl & Visualize NeurIPS 2022 Data from OpenReview☆14Nov 8, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆15Jun 15, 2023Updated 3 years ago
- [ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.☆510Jan 28, 2026Updated 5 months ago
- official code for paper Probing the Decision Boundaries of In-context Learning in Large Language Models. https://arxiv.org/abs/2406.11233…☆20Jul 27, 2025Updated 11 months ago
- ☆25Mar 4, 2024Updated 2 years ago
- Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".☆10Feb 6, 2024Updated 2 years ago
- Generates text with diffusion models. Reproduction of the Continous Diffusion for Categorical Data paper by Deepmind☆18Dec 9, 2024Updated last year
- ☆13Apr 19, 2025Updated last year
- Official Implementation of wd1☆31Sep 25, 2025Updated 9 months ago
- (ICLR 2026 🔥) Code for "The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs"☆79Feb 9, 2026Updated 4 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This is the open-source code for TokenCarve.☆25Jan 23, 2026Updated 5 months ago
- ☆28Dec 19, 2025Updated 6 months ago
- ☆18Apr 19, 2024Updated 2 years ago
- SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model(1.7B, 4B, 8B, 30B)☆359Jun 2, 2026Updated last month
- [AAAI-26] Are We on the Right Way for Assessing Document Retrieval-Augmented Generation?☆30Dec 14, 2025Updated 6 months ago
- Pytorch implementation of gradCAM, guidedBackProp, smoothGrad☆13Mar 5, 2019Updated 7 years ago
- ☆12Dec 18, 2024Updated last year
- Out-of-distribution generalization benchmarks for image recognition models☆14Apr 5, 2020Updated 6 years ago
- The Polaris datasets and benchmarks recipes☆14May 26, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆17Dec 28, 2022Updated 3 years ago
- About Code release for "FlashBias: Fast Computation of Attention with Bias" (NeurIPS 2025), https://arxiv.org/abs/2505.12044☆29Nov 17, 2025Updated 7 months ago
- ☆55Apr 14, 2026Updated 2 months ago
- [AAAI 2026 Oral] SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation☆71Apr 5, 2026Updated 2 months ago
- [CVPR2025] Code Release of Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception☆25Jun 17, 2025Updated last year
- Developing efficient Bayesian phylogenetic inference methods☆12Dec 21, 2019Updated 6 years ago
- [NeurIPS 2024] Repository for the paper "OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object Tracking".☆28Nov 9, 2024Updated last year
- ☆20Oct 9, 2024Updated last year
- ☆31Aug 18, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official repo for "ProSec: Fortifying Code LLMs with Proactive Security Alignment"☆18Feb 26, 2026Updated 4 months ago
- 告诉你每门课的意义所在。☆22May 16, 2014Updated 12 years ago
- ☆13May 1, 2024Updated 2 years ago
- Official repo for FSE'24 paper "CodeArt: Better Code Models by Attention Regularization When Symbols Are Lacking"☆19Mar 10, 2025Updated last year
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- R3: Robust Rubric-Agnostic Reward Models☆23Jul 12, 2025Updated 11 months ago
- ☆11Nov 3, 2023Updated 2 years ago