SPEC-RL: Accelerating On-Policy Reinforcement Learning via Speculative Rollouts
☆65Dec 1, 2025Updated 7 months ago
Alternatives and similar repositories for Spec-RL
Users that are interested in Spec-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- "FusionFactory: Fusing LLM Capabilities with Routing Data", Tao Feng, Haozhen Zhang, Zijie Lei, Pengrui Han, Mostofa Patwary, Mohammad Sh…☆22Dec 30, 2025Updated 6 months ago
- [ICLR 2026] Learning to Parallel: Accelerating Diffusion Large Language Models via Learnable Parallel Decoding☆34Jan 27, 2026Updated 5 months ago
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆21Apr 9, 2025Updated last year
- Official Code For EMNLP2025 Findings: {DLPO : Towards a Robust, Efficient, and Generalizable Prompt Optimization Framework from a Deep-Le…☆10Dec 25, 2025Updated 6 months ago
- This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large…☆15Jun 4, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Pytorch implementation of our paper accepted by NeurIPS 2022 -- Learning Best Combination for Efficient N:M Sparsity☆22Jan 13, 2023Updated 3 years ago
- [COLM 2025] JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language Model☆26Nov 25, 2025Updated 7 months ago
- Multilingual and Multiculture Benchmark and LLM☆42May 18, 2026Updated last month
- ☆17Mar 8, 2021Updated 5 years ago
- 哈工大威海自动评教脚本☆12Feb 4, 2024Updated 2 years ago
- Transformers components but in Triton☆34May 9, 2025Updated last year
- ☆13Jan 14, 2025Updated last year
- surface remeshing with field-aligned CVT (code of Eurographics 2018 paper "Field-Aligned Isotropic Surface Remeshing")☆12Jul 5, 2018Updated 7 years ago
- ☆40Jun 28, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A curated list of research in machine learning system. I also summarize some papers if I think they are really interesting.☆10Nov 6, 2021Updated 4 years ago
- [ICLR 2026] Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn Search Agents☆115Apr 23, 2026Updated 2 months ago
- This is a modified sthlm-beamer template optimized for chinese. Maintained by Hongxing Xia☆15Dec 12, 2015Updated 10 years ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Oct 9, 2025Updated 8 months ago
- ☆25Jun 1, 2026Updated last month
- An effort to benchmark Arabic legal reasoning in foundation models.☆19May 21, 2025Updated last year
- Official implementation of our IWSLT 2023 paper "The MineTrans Systems for IWSLT 2023 Offline Speech Translation and Speech-to-Speech Tra…☆16Jul 14, 2023Updated 2 years ago
- ☆37Jul 21, 2025Updated 11 months ago
- [ICLR 2025] PyTorch Implementation of "ETA: Evaluating Then Aligning Safety of Vision Language Models at Inference Time"☆34Jul 20, 2025Updated 11 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Voronoi-Based Foveated Volume Rendering☆10Sep 30, 2021Updated 4 years ago
- 🌈 The Bangumi extension for VSCode. Her data source came from Bilibili. [Maintenance phase]☆12Oct 7, 2023Updated 2 years ago
- 基于AnimeGAN2+serverless+NAS存储的漫画风图片生成工具(demo 已失效)☆12May 11, 2022Updated 4 years ago
- [Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…☆16Jan 16, 2026Updated 5 months ago
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆51Mar 31, 2026Updated 3 months ago
- IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse☆121Mar 14, 2026Updated 3 months ago
- Homepage for the Data Interaction Group at CMU☆13Jun 23, 2026Updated last week
- nv-one-logger enables tracking of GPU application progress over time and can help to identify overhead from workload and cluster ineffici…☆23Nov 6, 2025Updated 7 months ago
- Accelerating RL for LLM Reasoning with Optimal Advantage Regression☆41May 30, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆23May 21, 2025Updated last year
- Real-time image and video foveation transform using PyCUDA