IBM / sau-explore
Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"
☆14Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for sau-explore
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆20Updated 3 years ago
- Implicit Normalizing Flows + Reinforcement Learning☆60Updated 5 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- ☆44Updated 2 years ago
- Representation Learning in RL☆16Updated 2 years ago
- A collection of meta-learning algorithms in Jax☆23Updated 2 years ago
- [NeurIPS'19] Deep Equilibrium Models Jax Implementation☆37Updated 4 years ago
- Invariant Causal Prediction for Block MDPs☆43Updated 4 years ago
- Contains code for the NeurIPS 2020 paper by Pan et al., "Continual Deep Learning by FunctionalRegularisation of Memorable Past"☆44Updated 4 years ago
- Taming MAML: efficient unbiased meta-reinforcement learning☆29Updated 2 years ago
- Generalised UDRL☆37Updated 2 years ago
- ☆85Updated 3 months ago
- implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies☆10Updated 3 years ago
- Variational Reinforcement Learning☆16Updated 3 months ago
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Updated last year
- Re-implementation of Hamiltonian Generative Networks paper☆33Updated 2 years ago
- Gradient Estimation with Discrete Stein Operators (NeurIPS 2022)☆17Updated last year
- This repository contains the Julia code for the paper "Competitive Gradient Descent"☆23Updated 4 years ago
- Experiments for Meta-Learning Symmetries by Reparameterization☆56Updated 3 years ago
- A tensorflow implementation of the NIPS 2018 paper "Variational Inference with Tail-adaptive f-Divergence"☆20Updated 5 years ago
- Limitations of the Empirical Fisher Approximation☆45Updated 4 years ago
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆36Updated 4 years ago
- Code to reproduce results on toy tasks and companion blog for the paper.☆20Updated 2 years ago
- Code for Accelerated Linearized Laplace Approximation for Bayesian Deep Learning (ELLA, NeurIPS 22')☆16Updated 2 years ago
- ☆13Updated last year
- ☆15Updated last year
- ☆13Updated 5 years ago
- Model Agnostic Meta Learning (MAML) implemented in Flax, the neural network library for JAX.☆19Updated 4 years ago