jetnew / SlimeRL
Code repository for the research project "You Play Ball, I Play Ball: Bayesian Multi-Agent Reinforcement Learning for Slime Volleyball", won 1st Prize at 17th STePS.
☆16Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for SlimeRL
- Understanding RL vision Distill article☆23Updated last year
- ☆16Updated 3 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆20Updated 3 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆45Updated 5 years ago
- Gym wrapper for pysc2☆10Updated 2 years ago
- ☆17Updated 2 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆42Updated 4 years ago
- ☆21Updated 4 years ago
- 📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)☆24Updated 3 years ago
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆21Updated 3 years ago
- A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.☆12Updated 3 years ago
- OpenAi's gym environment wrapper to vectorize them with Ray☆22Updated last year
- RL agent to play μRTS with Stable-Baselines3 and PyTorch☆25Updated 2 years ago
- Variational Reinforcement Learning☆16Updated 3 months ago
- AGAC: Adversarially Guided Actor-Critic☆47Updated 3 years ago
- Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"☆18Updated 2 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Updated 4 years ago
- Implicit Distributional Actor Critic☆10Updated 2 years ago
- A squad movement planning library for StarCraft AI using Monte Carlo Tree Search and Negamax☆14Updated 5 years ago
- A2C is a special case of PPO!☆19Updated 2 years ago
- Generalised UDRL☆37Updated 2 years ago
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆23Updated 4 months ago
- Code for Deep Reinforcement and InfoMax Learning (Neurips 2020)☆10Updated 3 years ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆17Updated 2 years ago
- Actor-Sharer-Learner training framework for off-policy DRL algorithms☆19Updated last year
- High-quality reference implementations of various algorithms for Inverse Reinforcement Learning☆13Updated 6 years ago
- Deep reinforcement learning implementation that trains AIs for the CodeCraft real-time strategy game.☆19Updated last year
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- A JAX Implementation of the Twin Delayed DDPG Algorithm☆32Updated 4 years ago
- Implementation for paper "A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning".☆59Updated last month