SPEC-RL: Accelerating On-Policy Reinforcement Learning via Speculative Rollouts
☆63Dec 1, 2025Updated 3 months ago
Alternatives and similar repositories for Spec-RL
Users that are interested in Spec-RL are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] RaSA: Rank-Sharing Low-Rank Adaptation☆10May 19, 2025Updated 9 months ago
- "FusionFactory: Fusing LLM Capabilities with Routing Data", Tao Feng, Haozhen Zhang, Zijie Lei, Pengrui Han, Mostofa Patwary, Mohammad Sh…☆19Dec 30, 2025Updated 2 months ago
- [Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…☆13Jan 16, 2026Updated last month
- Transformers components but in Triton☆34May 9, 2025Updated 9 months ago
- ☆36Oct 3, 2018Updated 7 years ago
- Software to enable data-rich collaboration from high-resolution display walls to your laptop☆16Feb 19, 2026Updated last week
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 3 months ago
- AI-native knowledge kernel for human/agent collaboration. Use it as a Knowledge Base, Wiki, Annotator, Research Tool, or Agentic Memory.☆29Updated this week
- ☆12Jan 15, 2015Updated 11 years ago
- Auction Theory Toolbox – Computer Verified Auctions☆14Jul 12, 2016Updated 9 years ago
- Fast, free, easy, and object-agnostic video anonymization☆11Dec 12, 2020Updated 5 years ago
- Benchmark evaluating ocean forecasting systems against reference datasets and observations.☆26Updated this week
- ☆11Jan 25, 2021Updated 5 years ago
- [ICLR 2026] Learning to Parallel: Accelerating Diffusion Large Language Models via Learnable Parallel Decoding☆30Jan 27, 2026Updated last month
- ☆24Dec 19, 2025Updated 2 months ago
- ☆31Feb 3, 2026Updated last month
- ☆13Oct 21, 2024Updated last year
- ☆10Jun 14, 2024Updated last year
- MCP server for Grok AI API integration☆21Jun 2, 2025Updated 9 months ago
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated 2 months ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 2 years ago
- LiteGPT: A 124M Small Language Model (SLM) pre-trained on FineWeb and fine-tuned on Alpaca.☆34Dec 16, 2025Updated 2 months ago
- A game engine made in Java using libgdx (Currently in alpha state, and probably will remain that way)☆16Jan 4, 2012Updated 14 years ago
- It shows an intelligent agent based on LangGraph for long form writing.☆12Mar 1, 2025Updated last year
- [AAAI 2026] AutoTool: Efficient Tool Selection for Large Language Model Agents☆29Dec 28, 2025Updated 2 months ago
- Codebase accompanying the paper 'Widening the Representation Bottleneck in Neural Machine Translation with Lexical Shortcuts', (Emelin, D…☆11Feb 14, 2023Updated 3 years ago
- triton ver of gqa flash attn, based on the tutorial☆12Aug 4, 2024Updated last year
- The open-source language model computer☆10Mar 22, 2024Updated last year
- The Official API of Array of Things☆10Dec 3, 2022Updated 3 years ago
- Struct-aware fuzzing framework + some fuzzers☆30Jan 28, 2026Updated last month
- ☆26Updated this week
- UnicEdit-10M and UnicBench project☆23Updated this week
- [NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search☆17Jan 24, 2026Updated last month
- Dataflow-MM, multi-media operators for Dataflow. We aim to prepare data for Multimodal Large Language Models.☆31Feb 25, 2026Updated last week
- Large language models to diffusion finetuning code☆24Jun 2, 2025Updated 9 months ago
- ☆39Oct 29, 2025Updated 4 months ago
- The Python solutions of leetcode☆13Apr 26, 2020Updated 5 years ago
- Get aid from local LLMs right in your PowerShell☆15May 2, 2025Updated 10 months ago
- Metadata browser of TREC☆10Updated this week