jetnew / SlimeRL
Code repository for the research project "You Play Ball, I Play Ball: Bayesian Multi-Agent Reinforcement Learning for Slime Volleyball", won 1st Prize at 17th STePS.
☆16Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for SlimeRL
- Understanding RL vision Distill article☆23Updated last year
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆45Updated 5 years ago
- Gym wrapper for pysc2☆10Updated 2 years ago
- High-quality reference implementations of various algorithms for Inverse Reinforcement Learning☆13Updated 6 years ago
- A2C is a special case of PPO!☆19Updated 2 years ago
- ☆16Updated 3 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆42Updated 4 years ago
- ☆28Updated 2 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆20Updated 3 years ago
- A minimal implementation of Go-Explore without domain knowledge☆13Updated 3 years ago
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆22Updated 4 months ago
- Evaluating different engineering tricks that make RL work☆15Updated 3 years ago
- Code for Deep Reinforcement and InfoMax Learning (Neurips 2020)☆10Updated 3 years ago
- Train agents on MiniGrid from human demonstrations using Inverse Reinforcement Learning☆14Updated 4 years ago
- AGAC: Adversarially Guided Actor-Critic☆47Updated 3 years ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Updated 3 years ago
- Variational Reinforcement Learning☆16Updated 3 months ago
- Minimal A2C/A3C example of an LSTM-based meta-learner.☆13Updated 3 years ago
- A JAX Implementation of the Twin Delayed DDPG Algorithm☆32Updated 4 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 3 years ago
- 📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)☆24Updated 3 years ago
- ☆21Updated 4 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆21Updated 3 years ago
- Generalised UDRL☆37Updated 2 years ago
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆36Updated 4 years ago
- ☆17Updated 2 years ago
- A PyTorch implementation of DeepMind's MCTSnet☆18Updated last year
- Made for a reading group at the Center for Safe AGI.☆11Updated 2 years ago
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorch☆17Updated 2 years ago
- Scalable Opponent Shaping Experiments in JAX☆21Updated 7 months ago