deep-reinforcement-learning-book / Chapter13-Learning-to-RunLinks
Chapter 13 Learning to Run in book Deep Reinforcement Learning: code example of solving NIPS 2017: Learning to Run challenge with paralleled Soft Actor-Critic (SAC) algorithm.
☆13Updated 3 years ago
Alternatives and similar repositories for Chapter13-Learning-to-Run
Users that are interested in Chapter13-Learning-to-Run are comparing it to the libraries listed below
Sorting:
- ☆31Updated 2 years ago
- Paper Collection for Batch RL with brief introductions.☆84Updated 3 years ago
- Python interface for accessing the near real-world offline reinforcement learning (NeoRL) benchmark datasets☆123Updated 7 months ago
- Inverse Constrained Reinforcement Learning (ICML 2021)☆23Updated 3 years ago
- rl-papers☆47Updated 2 years ago
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆74Updated 2 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆88Updated 4 years ago
- Benchmarked implementations of Offline RL Algorithms.☆73Updated 3 months ago
- ☆89Updated 2 years ago
- Chapter 15 AlphaZero in book Deep Reinforcement Learning: code example of AlphaZero solving Gomoku game.☆33Updated 5 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆87Updated last year
- Official PyTorch implementation of "ACE:Off-Policy Actor-Critic with Causality-Aware Entropy Regularization"☆30Updated last year
- Implementation of Multi-Game Decision Transformers in PyTorch☆47Updated 2 years ago
- Click Me -->☆32Updated 2 years ago
- Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization☆82Updated 2 years ago
- RLA is a tool for managing your RL experiments automatically☆71Updated 2 years ago
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆19Updated last year
- CS285 Homework☆26Updated 4 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆58Updated last year
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆170Updated 3 years ago
- ☆99Updated 4 years ago
- Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021☆34Updated 2 years ago
- ☆24Updated 3 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆70Updated 11 months ago
- A collection of offline reinforcement learning algorithms.☆189Updated 7 months ago
- ☆48Updated 2 years ago
- ☆49Updated this week
- ☆41Updated 3 years ago
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆30Updated 3 years ago
- ☆21Updated 10 months ago