sash-a / CleanRL.jl
Simple single file implementations of Reinforcement Learning algorithms in Julia
☆20Updated last year
Related projects: ⓘ
- A collection of matrix games in JAX☆9Updated 8 months ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆11Updated last year
- ☆17Updated 3 months ago
- An Open-Ended Agentic Simulator☆17Updated last month
- Modular Single-file Reinfocement Learning Algorithms Library☆37Updated last year
- ☆33Updated last year
- Accelerated replay buffers in JAX☆39Updated 2 years ago
- ☆56Updated 3 weeks ago
- Scalable Opponent Shaping Experiments in JAX☆19Updated 5 months ago
- Reinforcement Learning inside a 3D soccer simulation☆19Updated this week
- Vectorization techniques for fast population-based training.☆52Updated 2 years ago
- A modular implementation of PPO, and soon hopefully other algorithms.☆26Updated 8 months ago
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆10Updated 2 months ago
- OpenAi's gym environment wrapper to vectorize them with Ray☆22Updated last year
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆22Updated 2 months ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆53Updated 3 months ago
- A high-performance reinforcement learning library in jax specialized for robotic learning☆21Updated last year
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆46Updated last year
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆13Updated last year
- Model-based reinforcement learning in TensorFlow☆53Updated 3 years ago
- Actor-Sharer-Learner training framework for off-policy DRL algorithms☆19Updated last year
- Corax: Core RL in JAX☆30Updated 6 months ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆21Updated 5 months ago
- A2C is a special case of PPO!☆19Updated 2 years ago
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆46Updated last year
- ☆28Updated 2 years ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆49Updated 11 months ago
- ☆11Updated 2 months ago
- Baselines for gymnax 🤖☆57Updated last year
- On the model-based stochastic value gradient for continuous reinforcement learning☆54Updated last year