A curated list of reinforcement learning with verifiable rewards (continually updated)
☆74Dec 15, 2025Updated 2 months ago
Alternatives and similar repositories for awesome-RLVR
Users that are interested in awesome-RLVR are comparing it to the libraries listed below
Sorting:
- [ICLR 2025 SynthData Workshop Spotlight] Empowering LLMs in Decision Games through Algorithmic Data Synthesis☆26Apr 27, 2025Updated 10 months ago
- [ICML 2025 Tokenization Workshop] HH-Codec: High Compression High-fidelity Discrete Neural Codec for Spoken Language Modeling☆78Sep 28, 2025Updated 5 months ago
- [ICCV 2025] Pretrained Reversible Generation as Unsupervised Visual Representation Learning☆25Nov 5, 2025Updated 4 months ago
- Auxiliary code for pulling, loading reinforcement learning models based on DI-engine from the Huggingface Hub, or pushing them onto Huggi…☆58Dec 12, 2023Updated 2 years ago
- DI-engine docs (Chinese and English)☆321Mar 10, 2025Updated 11 months ago
- [ICML 2025] This is the official PyTorch implementation of "OmniBal: Towards Fast Instruction-Tuning for Vision-Language Models via Omniv…☆27Jun 16, 2025Updated 8 months ago
- ☆78Jan 22, 2026Updated last month
- CodeMorpheus: Generate code self-portraits with one click(一键生成代码自画像,决策型 AI + 生成式 AI)☆56Jan 8, 2024Updated 2 years ago
- ☆10Jun 24, 2020Updated 5 years ago
- A curated list of of awesome UI agents resources, encompassing Web, App, OS, and beyond (continually updated)☆276Dec 15, 2025Updated 2 months ago
- Building open-ended embodied agent in battle royale FPS game☆38Feb 6, 2024Updated 2 years ago
- This is the repository of the EnviroDetaNet☆13Sep 3, 2024Updated last year
- Active Learning for SN photometric classification☆10Oct 10, 2025Updated 4 months ago
- Policy Optimization is awesome, let’s put a tree on it! 🌲🌟☆22Jul 4, 2025Updated 8 months ago
- [NeurIPS 2024] "Mind the Gap between Prototypes and Images in Cross-domain Finetuning"☆10Nov 15, 2024Updated last year
- Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).☆188Feb 18, 2025Updated last year
- ☆13Apr 2, 2018Updated 7 years ago
- Solution for N+1 fish, N+2 fish DrivenData competition (2nd place)☆13Sep 12, 2019Updated 6 years ago
- Predicting treatment effects from RCTs (Circulation: CQO 2019).☆10Jun 21, 2022Updated 3 years ago
- privacy☆12Nov 30, 2018Updated 7 years ago
- Code of "Instruction Multi-Constraint Molecular Generation Using a Teacher-Student Large Language Model"☆14Jul 8, 2025Updated 7 months ago
- Cheminformatic analysis of small molecule type drugs in DrugBank for their ability to form nanoparticles with indocyanine dyes.☆11Apr 30, 2018Updated 7 years ago
- Solve ciphers with python☆10Oct 24, 2018Updated 7 years ago
- synchronous and asynchronous event based c++ executor libray☆13Sep 25, 2016Updated 9 years ago
- ☆11Apr 22, 2018Updated 7 years ago
- Visualizes search engine ranking algorithms for a given domain☆30Dec 13, 2010Updated 15 years ago
- Code from the CMU LM inference fall 2025 edition.☆34Dec 7, 2025Updated 2 months ago
- The code for the paper, 'Meta-Curvature, Eunbyung Park and Junier Oliver, NeurIPS 2019'☆11Jan 20, 2020Updated 6 years ago
- ☆10Mar 1, 2022Updated 4 years ago
- 原神七圣召唤模拟环境 Simulator of Genius Invocation☆49Apr 29, 2024Updated last year
- ☆10Aug 19, 2023Updated 2 years ago
- ☆15Jul 25, 2024Updated last year
- Tensorflow binaries and Docker images compiled with GPU support and CPU optimizations.☆14Jul 30, 2019Updated 6 years ago
- Tutorials, Examples about Kubeflow Pipeline.☆13Nov 21, 2022Updated 3 years ago
- Kaggling Home Credit Default Risk in a pipeline fashion.☆12Sep 20, 2018Updated 7 years ago
- ☆20Dec 16, 2025Updated 2 months ago
- The official implementation of dLLM-Var☆31Nov 6, 2025Updated 4 months ago
- a sample REST API in Django and Python for employees. Supports GET, POST, PUT, DEL☆10Aug 22, 2018Updated 7 years ago
- A curated list of awesome exploration RL resources (continually updated)☆645Dec 2, 2025Updated 3 months ago