KMnO4-zx / hand-on-rl
☆15Updated 6 months ago
Alternatives and similar repositories for hand-on-rl
Users that are interested in hand-on-rl are comparing it to the libraries listed below
Sorting:
- ☆73Updated 7 months ago
- An easier PyTorch deep reinforcement learning library.☆208Updated 4 months ago
- ☆207Updated this week
- NeurIPS 2024 DACER☆106Updated last week
- ☆81Updated 3 weeks ago
- ICLR 2025 Agent-Related Papers☆67Updated 6 months ago
- rl-papers☆47Updated 2 years ago
- 🎓Automatically Update agent Papers Daily using Github Actions (Update Every 12th hours)☆29Updated this week
- The mirror of RL_Coding_Exercise.☆83Updated 8 months ago
- llm & rl☆120Updated this week
- ☆40Updated 8 months ago
- ☆65Updated last year
- Build a bridge that connects beginners to deep reinforcement learning.☆11Updated 7 months ago
- Train your grpo with zero dataset and low resources, 8bit/4bit/lora/qlora supported, multi-gpu supported ...☆71Updated 2 weeks ago
- 通过动画学强化学习笔记☆51Updated 2 months ago
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆113Updated last week
- ☆74Updated 9 months ago
- 通义千问的DPO训练☆47Updated 7 months ago
- The official implementation of Natural Language Fine-Tuning☆49Updated 4 months ago
- ☆60Updated last week
- ☆170Updated 2 months ago
- A comprehensive collection of process reward models.☆76Updated last week
- Tutorial4RL: Tutorial for Reinforcement Learning. 强化学习入门教程.☆144Updated last year
- A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models☆39Updated last month
- A curated list of visual reinforcement learning resources☆265Updated 2 weeks ago
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆112Updated last week
- Not interactive deep reinforcement learning book with no-framework code, copied math, no discussions. Adopted at only -1 university(Shanh…☆23Updated 9 months ago
- Official code for the ICLR 2025 paper, "Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining"☆22Updated 5 months ago
- This is the official implementation of paper "Leveraging Dual Process Theory in Language Agent Framework for Simultaneous Human-AI Collab…☆35Updated last month
- ☆37Updated last month