NVlabs / gbrl_sb3
GBRL-based Actor-Critic algorithms implemented in stable-baselines3
☆34Updated 2 weeks ago
Alternatives and similar repositories for gbrl_sb3:
Users that are interested in gbrl_sb3 are comparing it to the libraries listed below
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆55Updated 2 months ago
- Repo to reproduce the First-Explore paper results☆37Updated 3 months ago
- PyTorch Package For Quasimetric Learning☆41Updated 5 months ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆77Updated last year
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆27Updated 2 years ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆32Updated 5 months ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆72Updated 7 months ago
- ☆20Updated 9 months ago
- TaskMet Task-driven Metric Learning for Model Learning☆19Updated last year
- ☆28Updated 2 years ago
- Learn online intrinsic rewards from LLM feedback☆35Updated 4 months ago
- GPT implementation in Flax☆18Updated 3 years ago
- JAX implementations of core Deep RL algorithms☆79Updated 2 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆9Updated last year
- ☆41Updated 9 months ago
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆16Updated 10 months ago
- Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"☆27Updated 9 months ago
- Conformal Decision Theory code☆22Updated last year
- Gym environment for playing Wordle with RL agents☆39Updated 3 years ago
- A tool for recording RL trajectories.☆101Updated 5 months ago
- Generative cellular automaton-like learning environments for RL.☆19Updated 2 months ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- Galactic Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-Per-Second☆83Updated last year
- ☆19Updated 6 months ago
- Generalised UDRL☆37Updated 2 years ago
- ☆76Updated 3 weeks ago
- Fully differentiable RL environments, written in Ivy.☆64Updated last year
- Causal Analysis of Agent Behavior for AI Safety☆17Updated last year
- Official repository for the paper "Automating Continual Learning"☆13Updated last year
- Contains implementation of the FILTER algorithm for exponentially faster inverse reinforcement learning.☆49Updated 2 years ago