SPEC-RL: Accelerating On-Policy Reinforcement Learning via Speculative Rollouts
☆64Dec 1, 2025Updated 5 months ago
Alternatives and similar repositories for Spec-RL
Users that are interested in Spec-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2026] Learning to Parallel: Accelerating Diffusion Large Language Models via Learnable Parallel Decoding☆32Jan 27, 2026Updated 3 months ago
- [ECCV2024]The official implementation of the DiffPNG paper in PyTorch.☆17Oct 17, 2024Updated last year
- 🦦 Crayotter: A Multimodal AI-Agent for Video-Editing, Video-Composing, and Video Production. Powered by Multimodal LLMs for autonomous T…☆61Apr 21, 2026Updated last week
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated last year
- Official Code For EMNLP2025 Findings: {DLPO : Towards a Robust, Efficient, and Generalizable Prompt Optimization Framework from a Deep-Le…☆10Dec 25, 2025Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large…☆15Jun 4, 2025Updated 11 months ago
- The codes of our paper "ObjectAdd: Adding Objects into Image via a Training-Free Diffusion Modification Fashion"☆14Jun 29, 2025Updated 10 months ago
- Pytorch implementation of our paper accepted by NeurIPS 2022 -- Learning Best Combination for Efficient N:M Sparsity☆22Jan 13, 2023Updated 3 years ago
- Multilingual and Multiculture Benchmark and LLM☆34Apr 23, 2026Updated last week
- 哈工大威海自动评教脚本☆12Feb 4, 2024Updated 2 years ago
- Resilient fork of OpenClaw Browser Relay extension — auto-reconnect, state persistence, keepalive☆27Feb 21, 2026Updated 2 months ago
- Transformers components but in Triton☆34May 9, 2025Updated 11 months ago
- ☆35Jun 28, 2025Updated 10 months ago
- [AAAI 2026] AD-L-JEPA: Self-Supervised Representation Learning with Joint Embedding Predictive Architecture for Automotive LiDAR Object D…☆38Nov 18, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official implementation of CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.☆48Sep 15, 2025Updated 7 months ago
- Code for paper 'Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse'☆13Aug 2, 2024Updated last year
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆53Dec 13, 2025Updated 4 months ago
- The code for paper Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models.☆13Apr 10, 2024Updated 2 years ago
- This is a modified sthlm-beamer template optimized for chinese. Maintained by Hongxing Xia☆15Dec 12, 2015Updated 10 years ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Oct 9, 2025Updated 6 months ago
- ☆24Mar 26, 2025Updated last year
- IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse☆96Mar 14, 2026Updated last month
- [ICLR 2025] PyTorch Implementation of "ETA: Evaluating Then Aligning Safety of Vision Language Models at Inference Time"☆31Jul 20, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official implementation of our IWSLT 2023 paper "The MineTrans Systems for IWSLT 2023 Offline Speech Translation and Speech-to-Speech Tra…☆16Jul 14, 2023Updated 2 years ago
- 🌈 The Bangumi extension for VSCode. Her data source came from Bilibili. [Maintenance phase]☆12Oct 7, 2023Updated 2 years ago
- 基于AnimeGAN2+serverless+NAS存储的漫画风图片生成工具(demo 已失效)☆12May 11, 2022Updated 3 years ago
- [Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…☆15Jan 16, 2026Updated 3 months ago
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆50Mar 31, 2026Updated last month
- Accelerating RL for LLM Reasoning with Optimal Advantage Regression☆41May 30, 2025Updated 11 months ago
- ☆24May 21, 2025Updated 11 months ago
- Open-ended wargames with large language models☆52Feb 11, 2026Updated 2 months ago
- Demand Forecasting is the process in which historical sales data is used to develop an estimate of an expected forecast of customer deman…☆12Jul 13, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Real-time image and video foveation transform using PyCUDA☆11Jan 6, 2021Updated 5 years ago
- The missing layer between idea and code.☆28Feb 5, 2026Updated 3 months ago
- [Ebook]从零到百万店铺:一个没有计算机学位的普通人的系统设计实战之旅☆27Nov 11, 2025Updated 5 months ago
- Bayes-Adaptive RL for LLM Reasoning☆46May 28, 2025Updated 11 months ago
- implementation of xv6 labs from MIT 6.S081 2020☆12Oct 2, 2022Updated 3 years ago
- [ICLR 2026 🔥] Official pytorch implementation for "Attention Is All You Need for KV Cache in Diffusion LLMs"☆40Jan 23, 2026Updated 3 months ago
- ☆36Oct 3, 2018Updated 7 years ago