google-research / evoflow
☆32Updated 4 years ago
Related projects: ⓘ
- Source code for ICLR 2020 paper: "Learning to Guide Random Search"☆39Updated 2 weeks ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆77Updated 11 months ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 5 years ago
- Map-Elites based on Evolution Strategies☆31Updated 2 years ago
- ☆80Updated 11 months ago
- Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies☆15Updated 5 years ago
- Distributed implementation of popular evolutionary methods☆63Updated 6 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆15Updated 5 years ago
- Mixture Density Networks (Bishop, 1994) tutorial in JAX☆57Updated 4 years ago
- ☆19Updated 5 years ago
- Code for NeurIPS 2019 paper: "Symmetry-Based Disentangled Representation Learning requires Interaction with Environments" by H. Caselles-…☆34Updated 4 years ago
- ☆45Updated last month
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Updated 5 years ago
- Library for learning and inference with Sum-product Networks utilizing TensorFlow 2.x and Keras☆47Updated 3 years ago
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆44Updated last year
- ☆14Updated 5 years ago
- Autoregressive Energy Machines☆77Updated last year
- Statistical adaptive stochastic optimization methods☆32Updated 4 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆93Updated 4 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- Starter kit for the black box optimization challenge at Neurips 2020☆113Updated 4 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆30Updated 5 years ago
- Train self-modifying neural networks with neuromodulated plasticity☆77Updated 4 years ago
- DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm☆25Updated last year
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆26Updated 4 years ago
- ProBO: Versatile Bayesian Optimization Using Any Probabilistic Programming Language☆16Updated 5 years ago
- Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks☆47Updated last year
- Reinforcement Learning Assembly☆92Updated 3 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆45Updated 5 years ago
- Separating value functions across time-scales.☆17Updated 5 years ago