Minimalist RL for Diffusion LLMs with SOTA reasoning performance (89.1% GSM8K). Official implementation of "The Flexibility Trap".
☆134Apr 3, 2026Updated last month
Alternatives and similar repositories for JustGRPO
Users that are interested in JustGRPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Dec 19, 2024Updated last year
- [Nature Machine Intelligence 2025] Emulating Human-like Adaptive Vision for Efficient and Flexible Machine Visual Perception☆145Nov 25, 2025Updated 5 months ago
- ☆21Mar 5, 2025Updated last year
- [CVPR 2026] [oral] Official repository of Vision Test-Time Training☆76Apr 20, 2026Updated 2 weeks ago
- Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)☆41Oct 30, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Stable-DiffCoder is a family of lightweight open-source code DLLMs(diffusion large language models) comprising base and instruct models, …☆84Mar 9, 2026Updated last month
- Official implementation of A Mixture of Surprises for Unsupervised Reinforcement Learning☆23Nov 16, 2022Updated 3 years ago
- Official implementation of Dynamic Perceiver☆43Nov 16, 2023Updated 2 years ago
- Official repo for "StreamingVLA: Streaming Vision-Language-Action Model with Action Flow Matching and Adaptive Early Observation"☆23Apr 22, 2026Updated 2 weeks ago
- [IEEE TPAMI] Latency-aware Unified Dynamic Networks for Efficient Image Recognition☆53Mar 20, 2025Updated last year
- MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence☆57Mar 11, 2026Updated last month
- Official code for the paper "HEXA-MoE: Efficient and Heterogeneous-Aware MoE Acceleration with Zero Computation Redundancy"☆15Mar 6, 2025Updated last year
- Jittor implementation of Vision Transformer with Deformable Attention☆32Mar 1, 2022Updated 4 years ago
- The open-source code for the NeurIPS 2025 paper, "Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learn…☆51Jan 5, 2026Updated 4 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official PyTorch Implementation of Masked Temporal Interpolation Diffusion for Procedure Planning in Instructional Videos☆11Apr 26, 2026Updated last week
- Repository of GridMix (ICLR 2025)☆36Mar 18, 2025Updated last year
- [ICML 2024] SimPro: A Simple Probabilistic Framework Towards Realistic Long-Tailed Semi-Supervised Learning☆31Sep 30, 2024Updated last year
- Explore Inter-layer Expert Affinity in MoE Model Inference☆16May 6, 2024Updated 2 years ago
- [ECCV 2024] Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators☆47Sep 11, 2024Updated last year
- [TPAMI 2024] Probabilistic Contrastive Learning for Long-Tailed Visual Recognition☆94Sep 30, 2024Updated last year
- [AAAI 2026 Oral] SpatialActor: Exploring Disentangled Spatial Representations for Robust Robotic Manipulation☆62Jan 14, 2026Updated 3 months ago
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 7 months ago
- 3DSlicer plugin for inpainting lung nodules in 3D chest CT data.☆11Dec 2, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"☆47Jun 13, 2024Updated last year
- [Tutorial] Few-Step Distillation for Text-to-Image Generation: A Practical Guide☆358Dec 31, 2025Updated 4 months ago
- 哈工大软件构造课程总结笔记☆20Jul 16, 2018Updated 7 years ago
- The code repository of UniRL☆52May 30, 2025Updated 11 months ago
- [ICLR 2026] ParallelBench: Understanding the Tradeoffs of Parallel Decoding in Diffusion LLMs☆45Mar 27, 2026Updated last month
- Code for Continual Learning of Control Primitives☆18Nov 11, 2020Updated 5 years ago
- codes for ICML2021 paper iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients☆10May 27, 2021Updated 4 years ago
- [NeurIPS 2024] Official repository of InLine attention☆60Dec 22, 2024Updated last year
- [NeurIPS 2025] Beyond Masked and Unmasked: Discrete Diffusion Models via Partial Masking☆30Mar 18, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆73Mar 7, 2026Updated 2 months ago
- ☆16Sep 11, 2025Updated 7 months ago
- 🔥[MobiCom'25 Poster] AFL-Lib: An Asynchronous Federated Learning Library and Benchmark☆40Jul 23, 2025Updated 9 months ago
- Sequential Diffusion Language Model (SDLM) enhances pre-trained autoregressive language models by adaptively determining generation lengt…☆97Dec 27, 2025Updated 4 months ago
- Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Atte…☆20Apr 17, 2024Updated 2 years ago
- ☆24Sep 26, 2025Updated 7 months ago
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆22Sep 24, 2025Updated 7 months ago