andrewk1 / pytorch-deep-bayesian-bandits
PyTorch port and extension of the Deep Bayesian Bandits Library
☆42Updated 5 years ago
Alternatives and similar repositories for pytorch-deep-bayesian-bandits:
Users that are interested in pytorch-deep-bayesian-bandits are comparing it to the libraries listed below
- Code for "Neural causal learning from unknown interventions"☆100Updated 4 years ago
- Code accompanying the OptionGAN paper.☆43Updated 6 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated last year
- ☆85Updated 6 months ago
- Summaries and minimal implementations of ML / statistics research articles.☆39Updated 3 years ago
- ☆42Updated 6 years ago
- Simple tools for statistical analyses in RL experiments☆66Updated 6 years ago
- Code for "Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models" (ICML 2019)☆42Updated 4 years ago
- Invariant Causal Prediction for Block MDPs☆44Updated 4 years ago
- ☆30Updated 4 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Semi-synthetic experiments to test several approaches for off-policy evaluation and optimization of slate recommenders.☆43Updated 7 years ago
- Github page for the preprint paper "InfoCatVAE: Representation Learning with Categorical Variational Autoencoders"☆14Updated 4 years ago
- Automatically Composing Representation Transformations as a Means for Generalization☆24Updated 5 years ago
- Progress, Notes, Summaries and a lot of Questions on Machine Learning☆55Updated 5 years ago
- Experiment code for the ICLR 2020 paper "RTFM: Generalising to New Environment Dynamics via Reading".☆38Updated 3 years ago
- Comparison of bandit algorithms from the Reinforcement Learning bible.☆17Updated 6 years ago
- Implementation of iterative inference in deep latent variable models☆43Updated 5 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Updated 5 years ago
- Code for "Deep Convolutional Networks as shallow Gaussian Processes"☆39Updated 5 years ago
- Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future☆50Updated 5 years ago
- Train a simple convnet on the MNIST dataset and evaluate the BALD acquisition function☆16Updated 7 years ago
- DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm☆25Updated 2 years ago
- On the pitfalls of measuring emergent communication☆34Updated 5 years ago
- ☆68Updated 6 years ago
- ☆61Updated last year
- PyTorch implementation of Memory Augmented Self-Play☆50Updated 4 years ago
- Unified notation for Markov Decision Processes PO(MDP)s☆24Updated 6 years ago
- Code for "Systematic Generalization: What Is Required and Can It Be Learned"☆37Updated 5 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 6 years ago