yfletberliac / rlss-2019View external linksLinks
Materials for the Practical Sessions of the Reinforcement Learning Summer School 2019: Bandits, RL & Deep RL (PyTorch).
☆91Aug 21, 2019Updated 6 years ago
Alternatives and similar repositories for rlss-2019
Users that are interested in rlss-2019 are comparing it to the libraries listed below
Sorting:
- Reinforcement learning tutorials using the rlberry library.☆17Jan 9, 2023Updated 3 years ago
- 算法工程师技术栈学习笔记☆15Aug 22, 2022Updated 3 years ago
- Robustness via Retrying: Closed-Loop Robotic Manipulation with Self-Supervised Learning☆16Nov 7, 2018Updated 7 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- ☆27May 17, 2019Updated 6 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Nov 14, 2019Updated 6 years ago
- Curated materials for different machine learning related summer schools☆19Mar 8, 2021Updated 4 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆55May 15, 2019Updated 6 years ago
- Clockwork VAEs in JAX/Flax☆32Jul 16, 2021Updated 4 years ago
- Implementation of multi-armed bandits in Julia☆12Jan 12, 2020Updated 6 years ago
- An easy-to-use reinforcement learning library for research and education.☆176Jan 19, 2026Updated 3 weeks ago
- ☆13May 30, 2019Updated 6 years ago
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- JAX/Haiku implementation of "Auction Learning as a Two-Player Game"☆11Jul 6, 2024Updated last year
- Multi-agent reinforcement learning environment☆38Jul 9, 2019Updated 6 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆10May 8, 2018Updated 7 years ago
- ☆14Jun 7, 2023Updated 2 years ago
- Official repository for our paper on "Action Inference by Maximising Evidence: Zero-Shot Imitation from Observation with World Models"☆13Dec 4, 2023Updated 2 years ago
- MuJoCo Models for Personal Robot 2 (PR2)☆11Aug 25, 2018Updated 7 years ago
- Reinforcement Learning from Hierarchical Critics☆13Jul 30, 2020Updated 5 years ago
- A simple Gridworld environment for Open AI gym☆25Jun 10, 2018Updated 7 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Jun 15, 2023Updated 2 years ago
- MuJoCo models for Unitree Robots☆12Nov 24, 2021Updated 4 years ago
- Lecture notes for a course on Decision and Game Theory for undergraduates studying AI☆13Dec 14, 2018Updated 7 years ago
- Attempt to create a boilerplate Python package structure up-to-date tools and workflows☆15Dec 17, 2022Updated 3 years ago
- A pack of control system algorithms implemented in C to be used in embedded systems.☆15Dec 7, 2024Updated last year
- ☆28Dec 29, 2025Updated last month
- research and implementations of Deep RL agents and their applications☆59Aug 31, 2025Updated 5 months ago
- Extending rllab to event-driven multiagent environments☆13Oct 1, 2018Updated 7 years ago
- GAIL learning to imitate PPO playing CartPole.☆12May 27, 2021Updated 4 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- ☆16Sep 14, 2020Updated 5 years ago
- My PhD thesis. I defended on the 30th of October, 2020! See https://github.com/eleurent/phd-defense/☆16Sep 21, 2021Updated 4 years ago
- Implementation of the POIS algorithm☆15Apr 9, 2019Updated 6 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆16Nov 14, 2018Updated 7 years ago
- My notes on reinforcement learning papers☆15Jun 14, 2018Updated 7 years ago
- ☆15Jul 6, 2023Updated 2 years ago
- Python module with simulation backends☆17Oct 22, 2020Updated 5 years ago
- Bandits Environments for the OpenAI Gym☆89Jan 15, 2020Updated 6 years ago