RobertTLange / flexible-learning-group
A curated list of papers presented in the 📖"Flexible Learning Reading Group" @ TU Berlin. Join us! 🤗
☆27Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for flexible-learning-group
- A Tutorial on Deep Reinforcement Learning in PyTorch☆29Updated last year
- Progress, Notes, Summaries and a lot of Questions on Machine Learning☆55Updated 4 years ago
- Mixture Density Networks (Bishop, 1994) tutorial in JAX☆58Updated 4 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆27Updated 4 years ago
- Clockwork VAEs in JAX/Flax☆31Updated 3 years ago
- Variational Reinforcement Learning☆16Updated 3 months ago
- ☆81Updated 3 years ago
- ☆80Updated last year
- Reinforcement learning library in JAX.☆103Updated last year
- Graph Nets in pytorch☆27Updated last year
- Source code for ICLR 2020 paper: "Learning to Guide Random Search"☆39Updated 2 months ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- The implementation of "The Kanerva Machine" with Pytorch and Pyro☆12Updated 6 years ago
- Baselines for gymnax 🤖☆58Updated last year
- Vectorization techniques for fast population-based training.☆54Updated 2 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆17Updated 5 years ago
- Bayesian Bandits☆66Updated last year
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- Lightweight Cluster/Cloud VM Job Management 🚀☆41Updated 2 months ago
- Performant, differentiable reinforcement learning☆25Updated last year
- ☆28Updated 2 years ago
- A Towers of Hanoi environment in OpenAI Gym Style☆12Updated 5 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆15Updated 5 years ago
- Some small scale experiments for my blog posts 📝☆78Updated 2 years ago
- ☆27Updated 3 years ago
- Contains all materials for the paper "A counterfactual simulation model of causal judgment".☆22Updated 3 years ago
- ☆85Updated 3 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆78Updated last year
- Code for "Continuous-Time Meta-Learning with Forward Mode Differentiation" (ICLR 2022)☆27Updated 2 years ago
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆44Updated last year