ChengpengLi1003 / Q-learningLinks
针对最经典的表格型Q learning算法进行了复现,能够支持gym中大多数的离散动作和状态空间的环境,譬如CliffWalking-v0。
☆9Updated 4 years ago
Alternatives and similar repositories for Q-learning
Users that are interested in Q-learning are comparing it to the libraries listed below
Sorting:
- The code of paper Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic. Zhihai Wang, Jie Wang*, Qi Zhou, Bin…☆20Updated 3 years ago
- The code of paper *Learning Robust Policy against Disturbance in Transition Dynamics via State-Conservative Policy Optimization*.☆19Updated 3 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆87Updated last year
- MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampl…☆18Updated last year
- ☆15Updated 7 months ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆47Updated last year
- A novel template-free retrosynthesizer that can generate diverse sets of reactants for a desired product via discrete conditional variati…☆15Updated 2 years ago
- [ICLR 2024] Official Implementation of ACORM☆48Updated last year
- EDIS: Energy-guided DIffusion Sampling☆15Updated 9 months ago
- This repo contains the code of "Structure-Aware Transformer Policy for Inhomogeneous Multi-Task Reinforcement Learning".☆13Updated 3 years ago
- Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model☆42Updated last year
- Official code of the paper "Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts" at ICLR2024☆19Updated 6 months ago
- Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning.☆24Updated last month
- ☆12Updated 2 years ago
- ☆21Updated 9 months ago
- [ICLR 2024] Tree-Planner: Efficient Close-loop Task Planning with Large Language Models☆17Updated last year
- LLM multi-agent discussion framework for multi-agent/robot situations.☆34Updated 8 months ago
- ☆78Updated last year
- Code accompanying the paper Graph Neural Network Guided Local Search for the Traveling Salesperson Problem☆28Updated 2 years ago
- Open-source codebase for MAZero, from "Efficient Multi-agent Reinforcement Learning by Planning" at ICLR 2024.☆29Updated last year
- rl-papers☆47Updated 2 years ago
- Official Code for Guided Trajectory Generation with Diffusion Models for Offline Model-based Optimization (NIPS 2024)☆15Updated 9 months ago
- LLM-Empowered State Representation for Reinforcement Learning (ICML2024 Accepted paper)☆30Updated 11 months ago
- ☆9Updated last year
- ☆60Updated last year
- [ICLR 2025] Official code of "Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization"☆14Updated last year
- Official PyTorch implementation of "ACE:Off-Policy Actor-Critic with Causality-Aware Entropy Regularization"☆29Updated last year
- Unofficial Supplementary Materials for Reinforcement Learning Course at CUHK: textbooks, slides, related papers, assignment, code ...☆27Updated 4 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆62Updated last year
- Official code for "DHRL: A Graph-Based Approach for Long-Horizon and Sparse Hierarchical Reinforcement Learning" (NeurIPS 2022 Oral)☆27Updated 2 years ago