zhshao17 / Discovery-of-Optimal-Reward-functionLinks
Official implementation of the paper "Discovery of the Reward Function for Embodied RL Agents".
☆38Updated 2 months ago
Alternatives and similar repositories for Discovery-of-Optimal-Reward-function
Users that are interested in Discovery-of-Optimal-Reward-function are comparing it to the libraries listed below
Sorting:
- Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"☆99Updated 6 months ago
- ☆55Updated 6 months ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆90Updated 8 months ago
- NeurIPS 2024 DACER☆152Updated 2 months ago
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆427Updated 3 weeks ago
- PPO, DDPG, SAC implementation on mujoco environment☆122Updated 3 years ago
- ☆106Updated 5 months ago
- ☆101Updated 5 months ago
- ☆118Updated 2 years ago
- ☆413Updated last year
- Official Github Repository for "Trust Region-Based Safe Distributional Reinforcement Learning for Multiple Constraints". (NeurIPS 2023)☆20Updated 3 weeks ago
- General Optimal control Problem Solver (GOPS), an easy-to-use PyTorch reinforcement learning solver package for industrial control.☆285Updated 2 months ago
- A simple implementation of Generative Adversarial Imitation Learning with PyTorch☆173Updated 3 years ago
- PyTorch implementation of Constrained Reinforcement Learning for Soft Actor Critic Algorithm☆60Updated 3 years ago
- Implementation of PPO Lagrangian in PyTorch☆53Updated 3 years ago
- ☆33Updated 7 months ago
- (ICML 2024) The official code for EvoRainbow: Combining Improvements in Evolutionary Reinforcement Learning for Policy Search☆33Updated last year
- This repository provides a survey on the applications of deep generative models for offline reinforcement learning and imitation learning…☆52Updated 7 months ago
- 深度强化学习各算法介绍与Pytorch实现☆74Updated last year
- Source Code☆219Updated last year
- A clean and robust Pytorch implementation of SAC on discrete action space☆41Updated last year
- ☆46Updated 3 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆95Updated 2 years ago
- NeurIPS 2023: Safe Policy Optimization: A benchmark repository for safe reinforcement learning algorithms☆388Updated last year
- DSAC; Distributional Soft Actor-Critic☆135Updated 10 months ago
- ☆70Updated 5 months ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆175Updated 2 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆169Updated last year
- Official code of the paper "Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts" at ICLR2024☆37Updated last year
- A Reinforcement Learning Project using PPO + LSTM☆102Updated 2 years ago