jinPrelude / simple-es
Simple implementations of multi-agent evolutionary strategies using pytorch.
☆15Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for simple-es
- OpenAi's gym environment wrapper to vectorize them with Ray☆22Updated last year
- Gym wrapper for pysc2☆10Updated 2 years ago
- Generalised UDRL☆37Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 2 years ago
- ☆41Updated last month
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆35Updated 2 weeks ago
- AGAC: Adversarially Guided Actor-Critic☆47Updated 3 years ago
- ☆28Updated 2 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆26Updated 2 years ago
- ☆20Updated 6 months ago
- Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"☆18Updated 2 years ago
- ☆28Updated 3 years ago
- Implementation of Proximal Policy Optimization in Jax+Flax☆16Updated last year
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆12Updated 2 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆37Updated last year
- ☆21Updated 6 months ago
- ☆30Updated 3 months ago
- [NeurIPS 2024] Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow☆14Updated last week
- Accelerated replay buffers in JAX☆39Updated 2 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆49Updated 2 years ago
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Updated last year
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆22Updated 7 months ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆70Updated 11 months ago
- Standardized Minecraft Diamond Environment for Reinforcement Learning☆25Updated last year
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆31Updated 4 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆21Updated 3 years ago
- A2C is a special case of PPO!☆19Updated 2 years ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Updated 3 years ago
- Scalable Opponent Shaping Experiments in JAX☆21Updated 6 months ago
- ☆36Updated last year