A curated list of reinforcement learning with verifiable rewards (continually updated)
☆111Dec 15, 2025Updated 4 months ago
Alternatives and similar repositories for awesome-RLVR
Users that are interested in awesome-RLVR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025 SynthData Workshop Spotlight] Empowering LLMs in Decision Games through Algorithmic Data Synthesis☆35Apr 27, 2025Updated 11 months ago
- A simple toolkit package for opendilab☆139Oct 14, 2025Updated 6 months ago
- Decision Intelligence Adventure for Beginners☆99Dec 9, 2022Updated 3 years ago
- Python cli and package interface for local and remote plantuml☆14Jun 26, 2025Updated 9 months ago
- OpenDILab RL HPC OP Lib, including CUDA and Triton kernel☆251Jul 4, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A curated list of of awesome UI agents resources, encompassing Web, App, OS, and beyond (continually updated)☆288Dec 15, 2025Updated 4 months ago
- Auxiliary code for pulling, loading reinforcement learning models based on DI-engine from the Huggingface Hub, or pushing them onto Huggi…☆64Dec 12, 2023Updated 2 years ago
- Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).☆199Feb 18, 2025Updated last year
- OpenDILab RL Kubernetes Custom Resource and Operator Lib☆264Jan 9, 2023Updated 3 years ago
- CodeMorpheus: Generate code self-portraits with one click(一键生成代码自画像,决策型 AI + 生成式 AI)☆58Jan 8, 2024Updated 2 years ago
- ☆186Dec 26, 2022Updated 3 years ago
- ☆87Jan 22, 2026Updated 2 months ago
- [ICML 2025] This is the official PyTorch implementation of "OmniBal: Towards Fast Instruction-Tuning for Vision-Language Models via Omniv…☆27Jun 16, 2025Updated 10 months ago
- Fun project to run your own LLM chat bot using llama.cpp☆11Jun 9, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A curated list of awesome exploration RL resources (continually updated)☆666Dec 2, 2025Updated 4 months ago
- MiniWoB++: a web interaction benchmark for reinforcement learning☆12Apr 1, 2023Updated 3 years ago
- ☆12May 5, 2023Updated 2 years ago
- The code for the paper, 'Meta-Curvature, Eunbyung Park and Junier Oliver, NeurIPS 2019'☆11Jan 20, 2020Updated 6 years ago
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆19Feb 29, 2024Updated 2 years ago
- From a single topic to a multi-minutes video, LingtiStudio can take you through: script -> review -> keyframes -> voiceover -> clips ->…☆69Apr 9, 2026Updated last week
- Code for the paper "Knowledge-Aware Federated Active Learning with Non-IID Data", ICCV2023☆10Sep 8, 2023Updated 2 years ago
- Decision Intelligence platform for Biological Sequence Searching☆140Oct 10, 2022Updated 3 years ago
- ☆10Aug 19, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Python library for solving reinforcement learning (RL) problems using generative models.☆11Feb 18, 2025Updated last year
- Official code for the paper: Continual Task Allocation in Meta-Policy Network via Sparse Prompting☆21Feb 10, 2025Updated last year
- Building open-ended embodied agent in battle royale FPS game☆39Feb 6, 2024Updated 2 years ago
- [AAAI2024] A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning☆310Jun 22, 2024Updated last year
- OpenDILab RL Object Store☆188Apr 20, 2022Updated 3 years ago
- NeurIPS'2022: Pluralistic Image Completion with Gaussian Mixture Models☆14Jan 28, 2023Updated 3 years ago
- (NeurIPS 2025) LaRes: Evolutionary Reinforcement Learning with LLM-based Adaptive Reward Search☆23Mar 9, 2026Updated last month
- ☆12Apr 18, 2023Updated 2 years ago
- Ultra-minimal AI chat UI: 30s deploy, no sign-up; OpenAI-compatible; RAG + vision + web parsing; plugins/adapters.☆58Feb 21, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Dynamic Spatial-Temporal Aggregation for Skeleton-Aware Sign Language Recognition (COLING2024)☆17Jun 18, 2025Updated 9 months ago
- oo课程规则文档☆10Mar 29, 2019Updated 7 years ago
- Federated Fairness-aware Recommendation☆14Sep 2, 2022Updated 3 years ago
- Code of "Instruction Multi-Constraint Molecular Generation Using a Teacher-Student Large Language Model"☆14Jul 8, 2025Updated 9 months ago
- ☆214Dec 23, 2025Updated 3 months ago
- This repo support auto line plot for multi-seed event file from TensorBoard☆12Jun 23, 2022Updated 3 years ago
- official code for paper Probing the Decision Boundaries of In-context Learning in Large Language Models. https://arxiv.org/abs/2406.11233…☆19Jul 27, 2025Updated 8 months ago