MetcalfeTom / stable-baselines3-GPU
A GPU-accelerated fork of stable-baselines. Delivering reliable implementations of reinforcement learning algorithms.
☆23Updated 4 years ago
Alternatives and similar repositories for stable-baselines3-GPU:
Users that are interested in stable-baselines3-GPU are comparing it to the libraries listed below
- Deep Reinforcement Learning Framework done with PyTorch☆34Updated this week
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆35Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- ☆20Updated 8 months ago
- Flax Implementation of DreamerV3 on Crafter☆14Updated last week
- ☆31Updated 11 months ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆38Updated 4 years ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆93Updated 4 months ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆53Updated 11 months ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆110Updated 3 years ago
- ☆35Updated 2 years ago
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆21Updated 3 months ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- Scalable Opponent Shaping Experiments in JAX☆24Updated 11 months ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆67Updated 9 months ago
- Pytorch implementation of DreamerV2: Mastering Atari with Discrete World Models, based on the original implementation☆19Updated 2 years ago
- ☆74Updated 6 months ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆64Updated last year
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆27Updated 2 years ago
- ☆47Updated 2 years ago
- DreamerV3 implementation of Curious Replay, a method for prioritizing experience replay that is tailored to model-based reinforcement lea…☆35Updated last year
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- Skeleton for scalable and flexible Jax RL implementations☆73Updated last year
- a modular reinforcement learning library with JAX agents☆22Updated last week
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 9 months ago
- [ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and …☆37Updated last year
- Fast reinforcement learning research☆57Updated 3 months ago
- ☆73Updated 4 months ago
- A2C is a special case of PPO!☆19Updated 2 years ago