jinPrelude / simple-esLinks
Simple implementations of multi-agent evolutionary strategies using pytorch.
☆16Updated 3 years ago
Alternatives and similar repositories for simple-es
Users that are interested in simple-es are comparing it to the libraries listed below
Sorting:
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆36Updated 2 months ago
- ☆28Updated 2 years ago
- Scalable Opponent Shaping Experiments in JAX☆24Updated last year
- Generalised UDRL☆37Updated 3 years ago
- ☆32Updated 10 months ago
- ☆31Updated last year
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Updated 7 months ago
- AGAC: Adversarially Guided Actor-Critic☆49Updated 3 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆37Updated 2 years ago
- ☆44Updated 8 months ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- Baselines for gymnax 🤖☆66Updated 2 years ago
- A2C is a special case of PPO!☆21Updated 3 years ago
- Vectorization techniques for fast population-based training.☆56Updated 2 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆10Updated last year
- ☆31Updated 4 years ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆16Updated 2 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆21Updated 3 years ago
- ☆21Updated last year
- Gym wrapper for pysc2☆10Updated 2 years ago
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.☆15Updated 2 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- General Modules for JAX☆66Updated last month
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆56Updated 2 years ago
- Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …☆45Updated 3 years ago
- Revisiting Rainbow☆75Updated 3 years ago
- A high-performance reinforcement learning library in jax specialized for robotic learning☆22Updated last year
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆68Updated 3 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆28Updated 2 years ago