Evolution-based Soft Actor-Critic (ESAC)
☆42Jul 25, 2024Updated last year
Alternatives and similar repositories for esac
Users that are interested in esac are comparing it to the libraries listed below
Sorting:
- Energy-based Surprise Minimization for Multi-Agent Value Factorization☆12Oct 20, 2023Updated 2 years ago
- Implementation of OpenAI's Evolution Strategies in PyTorch.☆20Apr 22, 2020Updated 5 years ago
- Hierarchical Attention in Reinforcement Learning for Stock Order Executions☆32Apr 7, 2021Updated 4 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆107Nov 17, 2020Updated 5 years ago
- ☆17Dec 12, 2020Updated 5 years ago
- ☆10Aug 17, 2022Updated 3 years ago
- Keras implementation of guide actor-critic for continuous control☆11Mar 12, 2018Updated 7 years ago
- Code for "Proximal Distilled Evolutionary Reinforcement Learning", accepted at AAAI 2020☆56Jul 25, 2024Updated last year
- Code for Efficient Continuous Control with Double Actors and Regularized Critics, AAAI 2022.☆22Mar 11, 2022Updated 3 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- Cross-entropy method variants for optimization in Julia☆12Apr 29, 2021Updated 4 years ago
- ☆14Jun 26, 2019Updated 6 years ago
- Convergent Policy Optimization for Safe Reinforcement Learning☆11Oct 26, 2019Updated 6 years ago
- code of IJCAI submission "Soft Hindsight Experience Replay"☆13Mar 23, 2020Updated 5 years ago
- Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.☆16Aug 14, 2023Updated 2 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- ☆31Jul 1, 2019Updated 6 years ago
- P3O paper code☆30Aug 7, 2019Updated 6 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- A Python implementation of COMO-CMA-ES, a non-elitist multiobjective Evolution Strategy☆16Jan 24, 2026Updated last month
- [ICLR'20] Learning to Learn by Zeroth-Order Oracle☆14Feb 7, 2020Updated 6 years ago
- Code for VIREL: A Variational Inference Framework for Reinforcement Learning☆14Dec 1, 2019Updated 6 years ago
- ☆33Jul 30, 2024Updated last year
- Code for the paper Model-Predictive Control via Cross-Entropy and Gradient-Based Optimization☆68Apr 21, 2020Updated 5 years ago
- Implementation of Deep Q-Network(DQN)and Model Predictive Control, and their evaluation on the Quanser robot platform☆15Jul 24, 2020Updated 5 years ago
- Combining Evolutionary Algorithms and deep Reinforcement Learning☆19Jul 17, 2018Updated 7 years ago
- ☆19Feb 18, 2022Updated 4 years ago
- ☆73May 24, 2019Updated 6 years ago
- Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"☆22Oct 3, 2022Updated 3 years ago
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Feb 9, 2023Updated 3 years ago
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆22Oct 26, 2018Updated 7 years ago
- Repository for the PGA-MAP-Elites algorithm. PGA-MAP-Elites was developed to efficiently scale MAP-Elites to large genotypes and noisy d…☆57Oct 18, 2021Updated 4 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆46Oct 29, 2020Updated 5 years ago
- A biologically inspired, hierarchical bipedal locomotion controller for robots, trained using deep reinforcement learning.☆25Feb 21, 2021Updated 5 years ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆295Feb 24, 2021Updated 5 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Jun 24, 2020Updated 5 years ago
- Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”☆26Mar 6, 2023Updated 2 years ago
- [NeurIPS 2020 Spotlight Oral] "Training Stronger Baselines for Learning to Optimize", Tianlong Chen*, Weiyi Zhang*, Jingyang Zhou, Shiyu …☆29Dec 30, 2021Updated 4 years ago