opendilab / awesome-RLVRView external linksLinks
A curated list of reinforcement learning with verifiable rewards (continually updated)
☆68Dec 15, 2025Updated last month
Alternatives and similar repositories for awesome-RLVR
Users that are interested in awesome-RLVR are comparing it to the libraries listed below
Sorting:
- Python cli and package interface for local and remote plantuml☆14Jun 26, 2025Updated 7 months ago
- OpenDILab RL HPC OP Lib, including CUDA and Triton kernel☆240Jul 4, 2024Updated last year
- [ICML 2025] This is the official PyTorch implementation of "OmniBal: Towards Fast Instruction-Tuning for Vision-Language Models via Omniv…☆27Jun 16, 2025Updated 7 months ago
- ☆10Jun 24, 2020Updated 5 years ago
- Building open-ended embodied agent in battle royale FPS game☆38Feb 6, 2024Updated 2 years ago
- Active Learning for SN photometric classification☆10Oct 10, 2025Updated 4 months ago
- ☆18Sep 20, 2025Updated 4 months ago
- ☆12Aug 15, 2024Updated last year
- synchronous and asynchronous event based c++ executor libray☆13Sep 25, 2016Updated 9 years ago
- The official repository of MM-R5☆28Jun 22, 2025Updated 7 months ago
- ☆13Apr 2, 2018Updated 7 years ago
- Jupyter Notebooks and other code for 4CE data visualizations.☆13Jan 25, 2023Updated 3 years ago
- Demo code for Gemini Live Integration☆14Jul 29, 2025Updated 6 months ago
- Tools to expand Python's enum module.☆10Jan 21, 2026Updated 3 weeks ago
- Code of "Instruction Multi-Constraint Molecular Generation Using a Teacher-Student Large Language Model"☆14Jul 8, 2025Updated 7 months ago
- ☆12Nov 16, 2022Updated 3 years ago
- Predicting treatment effects from RCTs (Circulation: CQO 2019).☆10Jun 21, 2022Updated 3 years ago
- Solution for N+1 fish, N+2 fish DrivenData competition (2nd place)☆13Sep 12, 2019Updated 6 years ago
- Python library for solving reinforcement learning (RL) problems using generative models.☆11Feb 18, 2025Updated 11 months ago
- OpenDILab RL Kubernetes Custom Resource and Operator Lib☆253Jan 9, 2023Updated 3 years ago
- The code for the paper, 'Meta-Curvature, Eunbyung Park and Junier Oliver, NeurIPS 2019'☆11Jan 20, 2020Updated 6 years ago
- Tutorials, Examples about Kubeflow Pipeline.☆13Nov 21, 2022Updated 3 years ago
- ☆14Jul 25, 2024Updated last year
- Fun project to run your own LLM chat bot using llama.cpp☆11Jun 9, 2023Updated 2 years ago
- a sample REST API in Django and Python for employees. Supports GET, POST, PUT, DEL☆10Aug 22, 2018Updated 7 years ago
- The official implementation of dLLM-Var☆31Nov 6, 2025Updated 3 months ago
- Recruit Restaurant Visitor Forecasting 25th place solution☆12Feb 19, 2018Updated 7 years ago
- ☆11Apr 22, 2018Updated 7 years ago
- 原神七圣召唤模拟环境 Simulator of Genius Invocation☆49Apr 29, 2024Updated last year
- ☆12Apr 18, 2023Updated 2 years ago
- A curated list of awesome exploration RL resources (continually updated)☆640Dec 2, 2025Updated 2 months ago
- A CUDA implementation of the ZeroOut tensorflow custom op, just for fun☆11Feb 1, 2017Updated 9 years ago
- A collection of deep reinforcement learning-based & GFlowNet drug molecule generators focused on generation of molecules using Graphs/SEL…☆10Dec 11, 2022Updated 3 years ago
- ☆14Nov 13, 2024Updated last year
- ARCADE198 Dataset from the ACL 2018 MRQA Workshop☆15Oct 29, 2018Updated 7 years ago
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆18Feb 29, 2024Updated last year
- Training of a Word2Vec Model on a Text Dataset☆12Sep 12, 2017Updated 8 years ago
- Frequently updated list of dLLM (Diffusion Large Language Models) papers, models, and other resources☆22Jan 30, 2026Updated 2 weeks ago
- ☆13May 8, 2019Updated 6 years ago