A curated list of reinforcement learning with verifiable rewards (continually updated)
☆135Dec 15, 2025Updated 4 months ago
Alternatives and similar repositories for awesome-RLVR
Users that are interested in awesome-RLVR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025 SynthData Workshop Spotlight] Empowering LLMs in Decision Games through Algorithmic Data Synthesis☆40Apr 27, 2025Updated last year
- A simple toolkit package for opendilab☆144Oct 14, 2025Updated 6 months ago
- OpenDILab RL HPC OP Lib, including CUDA and Triton kernel☆256Jul 4, 2024Updated last year
- A curated list of of awesome UI agents resources, encompassing Web, App, OS, and beyond (continually updated)☆294Dec 15, 2025Updated 4 months ago
- Decision Intelligence Adventure for Beginners☆104Dec 9, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Python cli and package interface for local and remote plantuml☆14Jun 26, 2025Updated 10 months ago
- 🚀 轻量视频🎥 大模型🤖☆22Apr 27, 2025Updated last year
- Auxiliary code for pulling, loading reinforcement learning models based on DI-engine from the Huggingface Hub, or pushing them onto Huggi…☆65Dec 12, 2023Updated 2 years ago
- OpenDILab RL Kubernetes Custom Resource and Operator Lib☆270Jan 9, 2023Updated 3 years ago
- LightRFT: Light, Efficient, Omni-modal & Reward-model Driven Reinforcement Fine-Tuning Framework☆317Apr 29, 2026Updated last week
- CodeMorpheus: Generate code self-portraits with one click(一键生成代码自画像,决策型 AI + 生成式 AI)☆61Jan 8, 2024Updated 2 years ago
- [ACL 2026] Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning☆89Jan 22, 2026Updated 3 months ago
- [CVPR 2024] SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction☆168Nov 10, 2024Updated last year
- 1024 + 深度强化学习(Deep Reinforcement Learning + 1024 Game/ 2048 Game)☆150Jul 23, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Fun project to run your own LLM chat bot using llama.cpp☆11Jun 9, 2023Updated 2 years ago
- A curated list of awesome exploration RL resources (continually updated)☆678Dec 2, 2025Updated 5 months ago
- Open-Source Reproduction/Demo of the LLM Riddles Game☆577Jul 30, 2024Updated last year
- [NeurIPS 2024] "Mind the Gap between Prototypes and Images in Cross-domain Finetuning"☆11Nov 15, 2024Updated last year
- A curated list of Multi-Modal Reinforcement Learning resources (continually updated)☆604Dec 15, 2025Updated 4 months ago
- MiniWoB++: a web interaction benchmark for reinforcement learning☆12Apr 1, 2023Updated 3 years ago
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆28Jan 14, 2025Updated last year
- [ICML 2024] "Envisioning Outlier Exposure by Large Language Models for Out-of-Distribution Detection"☆13Feb 15, 2025Updated last year
- [ICML 2024] "Envisioning Outlier Exposure by Large Language Models for Out-of-Distribution Detection"☆15Feb 15, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCT…☆1,577Updated this week
- Mental state inference from observable behavior☆15Dec 3, 2021Updated 4 years ago
- BUAA Compiler Homework☆13Mar 12, 2018Updated 8 years ago
- Decision Intelligence platform for Traffic Crossing Signal Control☆255Mar 22, 2023Updated 3 years ago
- Pretrained model 1024x1024 trained on 1970s scifi art☆16Jul 5, 2023Updated 2 years ago
- Datasets for Causal-Structure-Learning Repo☆15Apr 22, 2020Updated 6 years ago
- [CVPR'2025] Learning Bijective Surface Parameterization for Inferring Signed Distance Functions from Sparse Point Clouds with Grid Deform…☆79Dec 1, 2025Updated 5 months ago
- Here are the most awesome tree structure computing solutions, make your life easier. (这里有目前 性能最优的树形结构计算解决方案)☆260Oct 17, 2024Updated last year
- 一个面向初学者的 Flutter 示例项目,展示基础控件、布局和样式。适合学习 Flutter 基础知识并快速上手开发简单应用。☆28Nov 5, 2025Updated 6 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Apr 2, 2022Updated 4 years ago
- ☆10Aug 19, 2023Updated 2 years ago
- Python library for solving reinforcement learning (RL) problems using generative models.☆11Feb 18, 2025Updated last year
- Official code for the paper: Continual Task Allocation in Meta-Policy Network via Sparse Prompting☆21Feb 10, 2025Updated last year
- [AAAI2024] A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning☆315Jun 22, 2024Updated last year
- OpenDILab RL Object Store☆192Apr 20, 2022Updated 4 years ago
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 3 years ago