brain-research / guided-evolutionary-strategies
Guided Evolutionary Strategies
☆267Updated last year
Alternatives and similar repositories for guided-evolutionary-strategies:
Users that are interested in guided-evolutionary-strategies are comparing it to the libraries listed below
- lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.☆373Updated 2 years ago
- A colab that implements the Symplectic Gradient Adjustment optimizer from "The mechanics of n-player differentiable games"☆154Updated 6 years ago
- Velocity in deep-learning research☆276Updated 2 years ago
- Implementation of Model-Agnostic Meta-Learning (MAML) in Jax☆188Updated 2 years ago
- I am implementing a lot of reinforcement learning and imitation learning algorithms since I'm sick of reading about them but not really u…☆51Updated 4 years ago
- A reinforcement learning framework☆154Updated 6 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆266Updated 5 years ago
- Evolution Strategies in PyTorch☆352Updated 7 years ago
- Easy TensorFlow logging for quick prototypes☆110Updated 3 years ago
- Augmented environments with RL☆103Updated 5 years ago
- 2019 talk at GECCO☆68Updated 5 years ago
- ☆182Updated 5 months ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆152Updated 7 years ago
- Full World Models Implementation in Chainer☆165Updated 6 years ago
- ☆117Updated 4 years ago
- Code for paper "L4: Practical loss-based stepsize adaptation for deep learning"☆124Updated 5 years ago
- ☆160Updated 7 years ago
- Basic pytorch implementation of NAC/NALU from Neural Arithmetic Logic Units paper by trask et.al☆115Updated 6 years ago
- Code for the paper "Evolved Policy Gradients"☆248Updated 6 years ago
- NIPS 2017 Value Prediction Network☆166Updated 7 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆86Updated 5 years ago
- General Game Playing with Schema Networks☆41Updated 2 years ago
- safemutations☆144Updated 6 years ago
- Publicly releasable baselines for the Retro contest☆127Updated 6 years ago
- A parallel version of Trust Region Policy Optimization☆65Updated 7 years ago
- Open source implementation of the PAAC algorithm presented in Efficient Parallel Methods for Deep Reinforcement Learning☆206Updated 7 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated last year
- World Models applied to the Open AI Sonic Retro Contest☆77Updated 6 years ago