Evolution-based Soft Actor-Critic (ESAC)
☆42Jul 25, 2024Updated last year
Alternatives and similar repositories for esac
Users that are interested in esac are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Energy-based Surprise Minimization for Multi-Agent Value Factorization☆12Oct 20, 2023Updated 2 years ago
- Hierarchical Attention in Reinforcement Learning for Stock Order Executions☆32Apr 7, 2021Updated 4 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆107Nov 17, 2020Updated 5 years ago
- ☆17Dec 12, 2020Updated 5 years ago
- Codebase for Evolutionary Reinforcement Learning (ERL) from the paper "Evolution-Guided Policy Gradients in Reinforcement Learning" publi…☆249Sep 13, 2020Updated 5 years ago
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆33Jul 22, 2021Updated 4 years ago
- Code to run the ASEBO algorithm from the paper: From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox Optimization... …☆16Oct 14, 2020Updated 5 years ago
- Keras implementation of guide actor-critic for continuous control☆11Mar 12, 2018Updated 8 years ago
- Code for Efficient Continuous Control with Double Actors and Regularized Critics, AAAI 2022.☆22Mar 11, 2022Updated 4 years ago
- Code for "Proximal Distilled Evolutionary Reinforcement Learning", accepted at AAAI 2020☆56Jul 25, 2024Updated last year
- ☆19Feb 18, 2022Updated 4 years ago
- ☆10Aug 17, 2022Updated 3 years ago
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Feb 9, 2023Updated 3 years ago
- ☆14Jun 26, 2019Updated 6 years ago
- code of IJCAI submission "Soft Hindsight Experience Replay"☆13Mar 23, 2020Updated 6 years ago
- Code for the paper Model-Predictive Control via Cross-Entropy and Gradient-Based Optimization☆69Apr 21, 2020Updated 5 years ago
- ☆73May 24, 2019Updated 6 years ago
- Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.☆16Aug 14, 2023Updated 2 years ago
- PyTorch implementation of Count-Based Exploration with Neural Density Models☆10Mar 22, 2018Updated 8 years ago
- original source code of the ASE 2019 paper: Wuji: Automatic Online Combat Game Testing Using Evolutionary Deep Reinforcement Learning☆28Jun 8, 2020Updated 5 years ago
- Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"☆22Oct 3, 2022Updated 3 years ago
- A reinforcement learning package implemented in Torch☆11Jan 24, 2016Updated 10 years ago
- ☆23Jul 15, 2021Updated 4 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- Authors' implementation of PEER☆11Jul 13, 2023Updated 2 years ago
- Code for the paper "Functional Regularization for Reinforcement Learning via Learned Fourier Features"☆20Oct 2, 2022Updated 3 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆46Oct 29, 2020Updated 5 years ago
- A Python implementation of COMO-CMA-ES, a non-elitist multiobjective Evolution Strategy☆16Jan 24, 2026Updated last month
- Paper: Challenges in High-dimensional Reinforcement Learning with Evolution Strategies☆29May 30, 2022Updated 3 years ago
- ☆71Jan 3, 2023Updated 3 years ago
- [ICLR'20] Learning to Learn by Zeroth-Order Oracle☆14Feb 7, 2020Updated 6 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- Convergent Policy Optimization for Safe Reinforcement Learning☆11Oct 26, 2019Updated 6 years ago
- Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"☆22Dec 17, 2019Updated 6 years ago
- Actor Prioritized Experience Replay☆18Nov 20, 2023Updated 2 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- This is the official implementation of ERL-Re2.☆72Jun 18, 2024Updated last year
- P3O paper code☆30Aug 7, 2019Updated 6 years ago
- Implementation of Deep Q-Network(DQN)and Model Predictive Control, and their evaluation on the Quanser robot platform☆15Jul 24, 2020Updated 5 years ago