Howuhh / sac-n-jaxLinks
Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch
☆51Updated 2 years ago
Alternatives and similar repositories for sac-n-jax
Users that are interested in sac-n-jax are comparing it to the libraries listed below
Sorting:
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆73Updated 2 years ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆53Updated last month
- ☆19Updated last month
- ☆47Updated 2 years ago
- POPGym Library in JAX☆11Updated last year
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆40Updated last year
- Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Worksho…☆21Updated 2 years ago
- ☆42Updated 2 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆80Updated 2 years ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆18Updated 7 months ago
- Baselines for gymnax 🤖☆67Updated 2 years ago
- Code for the paper "Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters", ICML 2022☆28Updated 2 years ago
- A collection of matrix games in JAX☆11Updated 6 months ago
- A collection of RL algorithms written in JAX.☆98Updated 2 years ago
- Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC☆55Updated last year
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆25Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆24Updated 2 months ago
- Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"☆51Updated 3 weeks ago
- Conservative Q learning in Jax☆54Updated 2 years ago
- Corax: Core RL in JAX☆38Updated last year
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆11Updated 2 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆100Updated 3 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆56Updated 2 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆37Updated 2 years ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆100Updated 7 months ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- ☆46Updated 2 years ago
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year