deep-reinforcement-learning-book / Chapter13-Learning-to-Run
Chapter 13 Learning to Run in book Deep Reinforcement Learning: code example of solving NIPS 2017: Learning to Run challenge with paralleled Soft Actor-Critic (SAC) algorithm.
☆13Updated 3 years ago
Alternatives and similar repositories for Chapter13-Learning-to-Run:
Users that are interested in Chapter13-Learning-to-Run are comparing it to the libraries listed below
- Paper Collection for Batch RL with brief introductions.☆85Updated 3 years ago
- Click Me -->☆31Updated 2 years ago
- Deep Q-Network (DQN) and Fitted Q-Iteration (FQI) tutorial for RL Summer School 2023☆63Updated 4 months ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 5 years ago
- Inverse Constrained Reinforcement Learning (ICML 2021)☆22Updated 3 years ago
- Benchmarked implementations of Offline RL Algorithms.☆72Updated 3 weeks ago
- Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization☆80Updated last year
- ☆16Updated 3 years ago
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆29Updated 3 years ago
- Python interface for accessing the near real-world offline reinforcement learning (NeoRL) benchmark datasets☆117Updated 4 months ago
- ☆88Updated 2 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆85Updated 4 years ago
- Implementations of a large collection of reinforcement learning algorithms.☆27Updated last year
- soft q learning and soft actor critic☆15Updated 6 years ago
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆18Updated 11 months ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆82Updated last year
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆73Updated 2 years ago
- Implementation of Multi-Game Decision Transformers in PyTorch☆46Updated 2 years ago
- ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning☆25Updated 3 months ago
- Repo for Implicit Diffusion Q-Learning☆104Updated last year
- Source files to replicate experiments in my ICLR 2022 paper.☆70Updated 9 months ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆29Updated 6 years ago
- Code for Automatic Curriculum Learning through Value Disagreement☆30Updated 4 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆26Updated 2 years ago
- ☆33Updated 7 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Updated 3 weeks ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆55Updated last year
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆14Updated 2 years ago
- RLA is a tool for managing your RL experiments automatically☆71Updated 2 years ago
- ☆42Updated 3 years ago