A curated list of reinforcement learning with verifiable rewards (continually updated)
☆161Dec 15, 2025Updated 5 months ago
Alternatives and similar repositories for awesome-RLVR
Users that are interested in awesome-RLVR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025 SynthData Workshop Spotlight] Empowering LLMs in Decision Games through Algorithmic Data Synthesis☆42Apr 27, 2025Updated last year
- [ICML 2025 Tokenization Workshop] HH-Codec: High Compression High-fidelity Discrete Neural Codec for Spoken Language Modeling☆97Sep 28, 2025Updated 7 months ago
- [NeurIPS 2025 AI for Music Workshop] Vocal Reaction Model and Benchmark☆37Dec 10, 2025Updated 5 months ago
- A curated list of of awesome UI agents resources, encompassing Web, App, OS, and beyond (continually updated)☆300Updated this week
- Decision Intelligence Adventure for Beginners☆106Dec 9, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Python cli and package interface for local and remote plantuml☆14Jun 26, 2025Updated 11 months ago
- 🚀 轻量视频🎥 大模型🤖☆22Apr 27, 2025Updated last year
- Auxiliary code for pulling, loading reinforcement learning models based on DI-engine from the Huggingface Hub, or pushing them onto Huggi…☆66Dec 12, 2023Updated 2 years ago
- Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).☆209Feb 18, 2025Updated last year
- CodeMorpheus: Generate code self-portraits with one click(一键生成代码自画像,决策型 AI + 生成式 AI)☆61Jan 8, 2024Updated 2 years ago
- ☆191Dec 26, 2022Updated 3 years ago
- 羊了个羊 + 深度强化学习(Deep Reinforcement Learning + 3 Tiles Game)☆507Mar 10, 2025Updated last year
- [ACL 2026] Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning☆92Jan 22, 2026Updated 4 months ago
- Fun project to run your own LLM chat bot using llama.cpp☆11Jun 9, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A curated list of awesome exploration RL resources (continually updated)☆688Dec 2, 2025Updated 5 months ago
- Open-Source Reproduction/Demo of the LLM Riddles Game☆579Jul 30, 2024Updated last year
- [NeurIPS 2024] "Mind the Gap between Prototypes and Images in Cross-domain Finetuning"☆11Nov 15, 2024Updated last year
- A curated list of Multi-Modal Reinforcement Learning resources (continually updated)☆607Dec 15, 2025Updated 5 months ago
- [ICML 2024] "Envisioning Outlier Exposure by Large Language Models for Out-of-Distribution Detection"☆13Feb 15, 2025Updated last year
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆28Jan 14, 2025Updated last year
- The code for the paper, 'Meta-Curvature, Eunbyung Park and Junier Oliver, NeurIPS 2019'☆11Jan 20, 2020Updated 6 years ago
- [NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCT…☆1,591May 12, 2026Updated 2 weeks ago
- [CoRL 2022] InterFuser: Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer☆646Jan 4, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆19Feb 29, 2024Updated 2 years ago
- MINT, Multiplier-less INTeger Quantization for Energy Efficient Spiking Neural Networks, ASP-DAC 2024, Nominated for Best Paper Award☆16Apr 12, 2024Updated 2 years ago
- [CVPR 2024] LMDrive: Closed-Loop End-to-End Driving with Large Language Models☆904Apr 14, 2025Updated last year
- 一个面向初学者的 Flutter 示例项目,展示基础控件、布局和样式。适合学习 Flutter 基础知识并快速上手开发简单应用。☆26Nov 5, 2025Updated 6 months ago
- Decision Intelligence platform for Biological Sequence Searching☆146Oct 10, 2022Updated 3 years ago
- [ICLR 2025] "Noisy Test-Time Adaptation in Vision-Language Models"☆16Feb 22, 2025Updated last year
- ☆11Apr 2, 2022Updated 4 years ago
- PSYCH 291: Causal Cognition (https://tobiasgerstenberg.github.io/causal_cognition/)☆13May 23, 2019Updated 7 years ago
- Python library for solving reinforcement learning (RL) problems using generative models.☆11Feb 18, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [CVPR'2025] Learning Bijective Surface Parameterization for Inferring Signed Distance Functions from Sparse Point Clouds with Grid Deform…☆92Dec 1, 2025Updated 5 months ago
- Simple, extensible implementations of some meta-learning algorithms in Jax☆11Oct 6, 2020Updated 5 years ago
- Named Entity Recognition via Attention_based CNNs-BiLSTm-CRF☆15Jun 27, 2018Updated 7 years ago
- Building open-ended embodied agent in battle royale FPS game☆39Feb 6, 2024Updated 2 years ago
- [AAAI 2023] Official PyTorch implementation of paper "ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency".☆254Dec 7, 2022Updated 3 years ago
- [AAAI2024] A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning☆313Jun 22, 2024Updated last year
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆14Oct 27, 2021Updated 4 years ago