jidiai / SummerCourse2021
☆25Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for SummerCourse2021
- ☆97Updated 3 years ago
- ☆28Updated last year
- ☆158Updated last year
- ☆121Updated 3 years ago
- RLA is a tool for managing your RL experiments automatically☆70Updated last year
- Implement many Sparse Reward algorithms in Gym Fetch environment☆81Updated 4 years ago
- The implement of all kinds of dqn reinforcement learning with Pytorch☆89Updated 3 years ago
- ☆38Updated last month
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆82Updated last year
- Assignments for CS294-112.☆30Updated 5 years ago
- ☆117Updated 3 months ago
- Personal Repo to keep track of RL papers☆31Updated 3 years ago
- A collection of offline reinforcement learning algorithms.☆158Updated 5 months ago
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆27Updated 2 years ago
- Python interface for accessing the near real-world offline reinforcement learning (NeoRL) benchmark datasets☆108Updated last year
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆154Updated 2 years ago
- Inverse Constrained Reinforcement Learning (ICML 2021)☆18Updated 3 years ago
- ☆28Updated 2 years ago
- ☆71Updated 5 years ago
- Learning Individual Intrinsic Reward in MARL☆62Updated last year
- Meta RL codebase for Unstable Baselines☆20Updated last year
- Pytorch implementation of Multi-Agent Generative Adversarial Imitation Learning☆37Updated 2 years ago
- ☆32Updated last year
- ☆17Updated 2 years ago
- ☆106Updated last year
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆72Updated 10 months ago
- Pytorch Implementation for First Order Constrained Optimization in Policy Space (FOCOPS).☆24Updated 2 years ago
- Implement PPO algorithm on mujoco environment,such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.☆50Updated 4 years ago
- 这是一个关于基于模型的强化学习的资料,包括一些代码地址、paper、slide等。☆38Updated 4 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆127Updated last year