zhshao17 / Discovery-of-Optimal-Reward-functionLinks
Official implementation of the paper "Discovery of the Reward Function for Embodied RL Agents".
☆85Updated 3 months ago
Alternatives and similar repositories for Discovery-of-Optimal-Reward-function
Users that are interested in Discovery-of-Optimal-Reward-function are comparing it to the libraries listed below
Sorting:
- a clean and robust Pytorch implementation of SAC on continuous action space☆89Updated 9 months ago
- ☆55Updated 7 months ago
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆431Updated last month
- A plotter for reinforcement learning (RL) using Weights & Biases☆15Updated 2 years ago
- ☆106Updated 6 months ago
- 深度强化学习各算法介绍与Pytorch实现☆74Updated last year
- General Optimal control Problem Solver (GOPS), an easy-to-use PyTorch reinforcement learning solver package for industrial control.☆289Updated 3 months ago
- A Reinforcement Learning Project using PPO + LSTM☆109Updated 2 years ago
- Code for our paper: Scalable Multi-Agent Reinforcement Learning through Intelligent Information Aggregation☆141Updated 6 months ago
- NeurIPS 2024 DACER☆160Updated 3 months ago
- Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"☆101Updated 6 months ago
- A clean and robust Pytorch implementation of SAC on discrete action space☆42Updated last year
- PyTorch implementation of Constrained Reinforcement Learning for Soft Actor Critic Algorithm☆61Updated 3 years ago
- (ICML 2024) The official code for EvoRainbow: Combining Improvements in Evolutionary Reinforcement Learning for Policy Search☆34Updated last year
- Solve BipedalWalkerHardcore-v2 with TD3☆96Updated 2 years ago
- ☆48Updated 3 years ago
- implementation of MADDPG using PettingZoo and PyTorch☆163Updated 2 years ago
- ☆101Updated 6 months ago
- ☆32Updated 2 years ago
- Hybrid Action PPO in stable-baselines3☆15Updated last year
- Official Github Repository for "Trust Region-Based Safe Distributional Reinforcement Learning for Multiple Constraints". (NeurIPS 2023)☆20Updated last month
- ☆35Updated 8 months ago
- Source Code☆222Updated last year
- PPO, DDPG, SAC implementation on mujoco environment☆124Updated 3 years ago
- ☆118Updated 2 years ago
- 用于教学的RL算法仓库,里面放置各种算法的最简单实现,目的是快速理解某个算法☆47Updated 7 months ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆169Updated last year
- Implementation of PPO Lagrangian in PyTorch☆54Updated 3 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆218Updated last year
- TD3 in Pytorch☆35Updated 4 years ago