KMnO4-zx / hand-on-rl
☆13Updated 4 months ago
Alternatives and similar repositories for hand-on-rl:
Users that are interested in hand-on-rl are comparing it to the libraries listed below
- ☆50Updated 5 months ago
- ☆128Updated last month
- Build a bridge that connects beginners to deep reinforcement learning.☆9Updated 6 months ago
- NeurIPS 2024 DACER☆91Updated last month
- 本项目将基于多模态,RAG以及LLM等技术,打造了一个基于手相算命的系统☆25Updated 7 months ago
- ☆74Updated 4 months ago
- An easier PyTorch deep reinforcement learning library.☆192Updated 3 months ago
- 🎓Automatically Update agent Papers Daily using Github Actions (Update Every 12th hours)☆29Updated this week
- rl-papers☆47Updated 2 years ago
- A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.☆362Updated 11 months ago
- ☆30Updated 7 months ago
- ICLR 2025 Agent-Related Papers☆57Updated 4 months ago
- SOTA RL fine-tuning solution for advanced math reasoning of LLM☆91Updated this week
- llm & rl☆73Updated this week
- ☆59Updated last month
- ☆18Updated 7 months ago
- LLM multi-agent discussion framework for multi-agent/robot situations.☆32Updated 6 months ago
- Robot Learning Algorithms☆25Updated 7 months ago
- This is the official implementation of paper "Leveraging Dual Process Theory in Language Agent Framework for Simultaneous Human-AI Collab…☆29Updated last week
- ☆39Updated last year
- pytorch distribute tutorials☆117Updated last month
- The Role Playing Project of Honor-of-Kings Based on LnternLM2。峡谷小狐仙--王者荣耀领域的角色扮演聊天机器人,结合多模态技术将英雄妲己的形象带入大模型中。☆24Updated 8 months ago
- [NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…☆260Updated 4 months ago
- Official code for the ICLR 2025 paper, "Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining"☆20Updated 3 months ago
- ☆57Updated 7 months ago
- ☆296Updated last month
- 这是一个高效,快捷的arXiv论文爬虫,它可以将指定时间范围,指定主题,包含指定关键词的论文信息爬取到本地,并且将其中的标题和摘要翻译成中文。☆85Updated 6 months ago
- ☆24Updated 2 months ago
- The official implementation of Natural Language Fine-Tuning☆47Updated 2 months ago
- The mirror of RL_Coding_Exercise.☆80Updated 6 months ago