rlberry-py / tutorials
Reinforcement learning tutorials using the rlberry library.
☆16Updated last year
Related projects: ⓘ
- Simple tools for statistical analyses in RL experiments☆66Updated 6 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- Code for reproducing experiments in Model-Based Active Exploration, ICML 2019☆77Updated 5 years ago
- Vectorization techniques for fast population-based training.☆52Updated 2 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆47Updated 2 years ago
- Reduce multiple PyTorch TensorBoard runs to new event (or CSV) files.☆68Updated 2 months ago
- Materials for the Practical Sessions of the Reinforcement Learning Summer School 2019: Bandits, RL & Deep RL (PyTorch).☆87Updated 5 years ago
- ☆10Updated this week
- Implementation of the Box-World environment from the paper "Relational Deep Reinforcement Learning"☆44Updated 11 months ago
- Revisiting Rainbow☆73Updated 3 years ago
- ☆65Updated 6 months ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆49Updated 2 years ago
- An implementation of MuZero in JAX.☆52Updated last year
- Implicit Normalizing Flows + Reinforcement Learning☆60Updated 5 years ago
- Accelerated replay buffers in JAX☆39Updated 2 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆93Updated 4 years ago
- ☆11Updated 4 years ago
- A collection of meta-learning algorithms in Jax☆23Updated 2 years ago
- 📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)☆24Updated 3 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆39Updated last year
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆41Updated 2 months ago
- Performant, differentiable reinforcement learning☆25Updated last year
- ☆47Updated 3 years ago
- krazy grid world☆25Updated 4 years ago
- ☆21Updated 2 years ago
- cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.☆27Updated last month
- The Differentiable Cross-Entropy Method☆122Updated 4 years ago
- ☆89Updated 2 months ago
- Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL☆22Updated 2 years ago