A curated list of reinforcement learning with verifiable rewards (continually updated)
☆95Dec 15, 2025Updated 3 months ago
Alternatives and similar repositories for awesome-RLVR
Users that are interested in awesome-RLVR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2025 Tokenization Workshop] HH-Codec: High Compression High-fidelity Discrete Neural Codec for Spoken Language Modeling☆85Sep 28, 2025Updated 5 months ago
- A simple toolkit package for opendilab☆135Oct 14, 2025Updated 5 months ago
- Python cli and package interface for local and remote plantuml☆14Jun 26, 2025Updated 9 months ago
- 🚀 轻量视频🎥 大模型🤖☆22Apr 27, 2025Updated 10 months ago
- PsyDI: Towards a Personalized and Progressively In-depth Chatbot for Psychological Measurements. (e.g. MBTI Measurement Agent)☆199Aug 4, 2025Updated 7 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Auxiliary code for pulling, loading reinforcement learning models based on DI-engine from the Huggingface Hub, or pushing them onto Huggi…☆63Dec 12, 2023Updated 2 years ago
- ☆184Dec 26, 2022Updated 3 years ago
- 羊了个羊 + 深度强化学习(Deep Reinforcement Learning + 3 Tiles Game)☆496Mar 10, 2025Updated last year
- ☆80Jan 22, 2026Updated 2 months ago
- [ICML 2025] This is the official PyTorch implementation of "OmniBal: Towards Fast Instruction-Tuning for Vision-Language Models via Omniv…☆27Jun 16, 2025Updated 9 months ago
- A curated list of awesome exploration RL resources (continually updated)☆657Dec 2, 2025Updated 3 months ago
- ☆10Jun 11, 2025Updated 9 months ago
- MiniWoB++: a web interaction benchmark for reinforcement learning☆12Apr 1, 2023Updated 2 years ago
- ☆12Mar 22, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [ICML 2024] "Envisioning Outlier Exposure by Large Language Models for Out-of-Distribution Detection"☆13Feb 15, 2025Updated last year
- ☆12Aug 15, 2024Updated last year
- a modern operating system (just support x86_64,aarch64)☆30Updated this week
- [CIKM-2024] Official code for work "ERASE: Error-Resilient Representation Learning on Graphs for Label Noise Tolerance"☆19Aug 14, 2024Updated last year
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆19Feb 29, 2024Updated 2 years ago
- [ICLR 2025] "Noisy Test-Time Adaptation in Vision-Language Models"☆16Feb 22, 2025Updated last year
- ☆10Aug 19, 2023Updated 2 years ago
- Controllable, Reproducible, Evaluable Agent Platform☆43Updated this week
- Python library for solving reinforcement learning (RL) problems using generative models.☆11Feb 18, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Cross-platform virtual character immersive interaction engine☆49Updated this week
- [AAAI2024] A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning☆305Jun 22, 2024Updated last year
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆14Oct 27, 2021Updated 4 years ago
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 2 years ago
- (NeurIPS 2025) LaRes: Evolutionary Reinforcement Learning with LLM-based Adaptive Reward Search☆22Mar 9, 2026Updated 2 weeks ago
- Code for Horizontal Federated Learning blog around Credit Scoring☆10Sep 14, 2020Updated 5 years ago
- NeurIPS'2022: Pluralistic Image Completion with Gaussian Mixture Models☆14Jan 28, 2023Updated 3 years ago
- can calculate the Hessian matrix and/or its spectrum for simple neural nets☆11May 7, 2018Updated 7 years ago
- [ICLR 2025] "Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond"☆17Feb 27, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Edge-native web analytics on Cloudflare. High-throughput via Durable Objects, tiered D1/R2 storage for infinite retention, and privacy-fi…☆72Updated this week
- This repository is the collection of World model Papers☆57Updated this week
- ☆15Dec 2, 2019Updated 6 years ago
- Code from the CMU LM inference fall 2025 edition.☆35Dec 7, 2025Updated 3 months ago
- [CVPR 2024] SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction☆163Nov 10, 2024Updated last year
- Code of "Instruction Multi-Constraint Molecular Generation Using a Teacher-Student Large Language Model"☆14Jul 8, 2025Updated 8 months ago
- ☆212Dec 23, 2025Updated 3 months ago