KMnO4-zx / hand-on-rlLinks
☆15Updated 7 months ago
Alternatives and similar repositories for hand-on-rl
Users that are interested in hand-on-rl are comparing it to the libraries listed below
Sorting:
- ☆78Updated 8 months ago
- ☆217Updated 2 weeks ago
- LLM, RL, DPO, SFT, Distillation, Alignment. 由《大模型算法》作者发起(By the author of the book📘 "Large Model Algorithms")☆44Updated 2 weeks ago
- ICLR 2025 Agent-Related Papers☆71Updated 6 months ago
- Build a bridge that connects beginners to deep reinforcement learning.☆11Updated 8 months ago
- A curated list of RL resources☆40Updated last year
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆123Updated this week
- An easier PyTorch deep reinforcement learning library.☆224Updated 5 months ago
- llm & rl☆139Updated this week
- 通过动画学强化学习笔记☆53Updated 3 months ago
- ☆83Updated last month
- ☆76Updated 9 months ago
- NeurIPS 2024 DACER☆113Updated last week
- 🎓Automatically Update agent Papers Daily using Github Actions (Update Every 12th hours)☆30Updated this week
- 全网最全-2025年AI领域最值得关注的两百位博主和一手信息源盘点☆104Updated 4 months ago
- rl-papers☆47Updated 2 years ago
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆82Updated 2 months ago
- Tutorial4RL: Tutorial for Reinforcement Learning. 强化学习入门教程.☆146Updated last year
- pytorch distribute tutorials☆136Updated 2 weeks ago
- ☆38Updated 2 months ago
- This is the official implementation of paper "Leveraging Dual Process Theory in Language Agent Framework for Simultaneous Human-AI Collab…☆35Updated last week
- ☆51Updated last year
- The official implementation of Natural Language Fine-Tuning☆50Updated 5 months ago
- ☆18Updated 9 months ago
- 本项目将基于多模态,RAG以及LLM等技术,打造了一个基于手相算命的系统☆27Updated 9 months ago
- Not interactive deep reinforcement learning book with no-framework code, copied math, no discussions. Adopted at only -1 university(Shanh…☆23Updated 9 months ago
- ☆40Updated 9 months ago
- Run TRex with PPO☆38Updated 3 weeks ago
- LLM multi-agent discussion framework for multi-agent/robot situations.☆34Updated 8 months ago
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆121Updated this week