NVlabs / gbrl_sb3
GBRL-based Actor-Critic algorithms implemented in stable-baselines3
☆33Updated this week
Alternatives and similar repositories for gbrl_sb3:
Users that are interested in gbrl_sb3 are comparing it to the libraries listed below
- Repo to reproduce the First-Explore paper results☆37Updated 2 months ago
- Causal Analysis of Agent Behavior for AI Safety☆17Updated last year
- ☆31Updated 11 months ago
- Generative cellular automaton-like learning environments for RL.☆19Updated last month
- On-the-fly conversions between Jax and NumPy tensors☆49Updated 2 years ago
- Reinforcement Learning inside a 3D soccer simulation☆26Updated 6 months ago
- ☆20Updated 9 months ago
- A web based platform for collecting human actions in reinforcement learning environments☆28Updated last year
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆9Updated last year
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆30Updated 4 months ago
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆14Updated 9 months ago
- Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"☆27Updated 8 months ago
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆23Updated 5 months ago
- Gradient-based constrained optimization for JAX☆29Updated 2 years ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆27Updated 2 years ago
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆55Updated last month
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- GPT implementation in Flax☆18Updated 3 years ago
- ☆28Updated 2 years ago
- PyTorch Package For Quasimetric Learning☆41Updated 4 months ago
- Contains implementation of the FILTER algorithm for exponentially faster inverse reinforcement learning.☆49Updated last year
- Official Implementation of SFM and the baselines in Jax.☆15Updated 4 months ago
- Learn online intrinsic rewards from LLM feedback☆35Updated 3 months ago
- Generalised UDRL☆37Updated 2 years ago
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆24Updated 5 months ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆71Updated 7 months ago
- TaskMet Task-driven Metric Learning for Model Learning☆19Updated last year
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Updated 4 years ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago