Materials for the Practical Sessions of the Reinforcement Learning Summer School 2019: Bandits, RL & Deep RL (PyTorch).
☆90Aug 21, 2019Updated 6 years ago
Alternatives and similar repositories for rlss-2019
Users that are interested in rlss-2019 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆54May 15, 2019Updated 6 years ago
- ☆28Dec 29, 2025Updated 3 months ago
- An easy-to-use reinforcement learning library for research and education.☆176Mar 16, 2026Updated last week
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- Official repository for our paper on "Action Inference by Maximising Evidence: Zero-Shot Imitation from Observation with World Models"☆13Dec 4, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- ☆13May 30, 2019Updated 6 years ago
- ☆14Jun 7, 2023Updated 2 years ago
- Reinforcement learning tutorials using the rlberry library.☆17Jan 9, 2023Updated 3 years ago
- My PhD thesis. I defended on the 30th of October, 2020! See https://github.com/eleurent/phd-defense/☆16Sep 21, 2021Updated 4 years ago
- ☆27May 17, 2019Updated 6 years ago
- Clockwork VAEs in JAX/Flax☆32Jul 16, 2021Updated 4 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Nov 14, 2019Updated 6 years ago
- Implementation of multi-armed bandits in Julia☆12Jan 12, 2020Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 算法工程师技术栈学习笔记☆15Aug 22, 2022Updated 3 years ago
- Code for SPIBB-DQN and Soft-SPIBB-DQN☆11May 5, 2020Updated 5 years ago
- Multi-agent reinforcement learning environment☆38Jul 9, 2019Updated 6 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆10May 8, 2018Updated 7 years ago
- Attempt to create a boilerplate Python package structure up-to-date tools and workflows☆15Dec 17, 2022Updated 3 years ago
- Curated materials for different machine learning related summer schools☆19Mar 8, 2021Updated 5 years ago
- Robustness via Retrying: Closed-Loop Robotic Manipulation with Self-Supervised Learning☆16Nov 7, 2018Updated 7 years ago
- ☆17Oct 25, 2016Updated 9 years ago
- ☆15Jul 6, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Non-stationary Off-policy Evaluation☆13Nov 8, 2018Updated 7 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- Progress, Notes, Summaries and a lot of Questions on Machine Learning☆55Jan 22, 2020Updated 6 years ago
- Vaccination appointments on Doctolib☆13May 12, 2021Updated 4 years ago
- My notes on reinforcement learning papers☆15Jun 14, 2018Updated 7 years ago
- Simple gym environments for safety in Reinforcement Learning Research☆18Jul 17, 2024Updated last year
- Useful tools and practices for Python development☆18Jul 27, 2020Updated 5 years ago
- research and implementations of Deep RL agents and their applications☆59Aug 31, 2025Updated 6 months ago
- 🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorith…☆420Apr 30, 2024Updated last year
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- ☆16Jun 30, 2019Updated 6 years ago
- Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning☆22Jan 9, 2020Updated 6 years ago
- ☆27Oct 25, 2019Updated 6 years ago
- Generalised UDRL☆37May 12, 2022Updated 3 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆107Nov 17, 2020Updated 5 years ago
- Reduced (lite) unofficial version of drake (https://drake.mit.edu/) that can be built with CMake.☆12Sep 6, 2020Updated 5 years ago
- ☆10May 10, 2024Updated last year