ChengpengLi1003 / Q-learningLinks
针对最经典的表格型Q learning算法进行了复现,能够支持gym中大多数的离散动作和状态空间的环境,譬如CliffWalking-v0。
☆10Updated 5 years ago
Alternatives and similar repositories for Q-learning
Users that are interested in Q-learning are comparing it to the libraries listed below
Sorting:
- LLM multi-agent discussion framework for multi-agent/robot situations.☆40Updated last year
- ☆81Updated last year
- Official code repository for CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models☆24Updated 4 months ago
- NeurIPS 2024 DACER☆166Updated 3 weeks ago
- Official code release of AAAI 2024 paper SayCanPay.☆53Updated 3 months ago
- Enhancing LLM/VLM capability for robot task and motion planning with extra algorithm based tools.☆74Updated last year
- HiCRISP Full Code, containing VirtualHome, pybullet simulator and Real AGV platform.☆15Updated last year
- [ICML 2024] Learning Reward for Robot Skills Using Large Language Models via Self-Alignment☆18Updated last year
- Robot Learning Algorithms☆26Updated last year
- EDIS: Energy-guided DIffusion Sampling☆18Updated last year
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆41Updated last year
- rl-papers☆50Updated 2 years ago
- Pessimistic Value Iteration for Multi-Task Data Sharing in Offline RL☆18Updated 2 years ago
- Public release for "Distillation and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections"☆48Updated last year
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆53Updated last year
- Official implementation of the RLC 2024 paper "Policy-Guided Diffusion"☆151Updated last year
- ☆19Updated last year
- Code repository for SMART-LLM: Smart Multi-Agent Robot Task Planning using Large Language Models☆179Updated last year
- MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampl…☆22Updated 2 years ago
- [ICML'2023 Oral] "AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners"☆79Updated 2 years ago
- ☆25Updated last year
- 💩里淘金☆31Updated 2 months ago
- ☆44Updated last month
- official implementation of QVPO☆60Updated 2 weeks ago
- ☆21Updated last year
- Official PyTorch implementation of "ACE:Off-Policy Actor-Critic with Causality-Aware Entropy Regularization"☆35Updated last year
- [AAAI 2024 (Oral)] Safety-MuJoCo Environments.☆11Updated last year
- ☆71Updated 7 months ago
- ☆90Updated 3 years ago
- basic theory and code of RL.☆49Updated 2 years ago