MehdiZouitine / gym_ma_toy
Toy environment set for multi-agent reinforcement learning and more
☆38Updated this week
Related projects ⓘ
Alternatives and complementary repositories for gym_ma_toy
- AGAC: Adversarially Guided Actor-Critic☆47Updated 3 years ago
- Robust Reinforcement Learning Suite☆20Updated 5 months ago
- Tidy up your machine learning experiments☆17Updated 5 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 3 years ago
- 🤖 Reinforcement Learning paper summaries, notebooks, and articles.☆26Updated 4 years ago
- Reinforcement learning algorithms in RLlib☆56Updated 6 months ago
- The source code for the gym-microrts paper.☆42Updated 2 years ago
- 🤖 Creation of an RL environment with Unity, where an agent must learn to survive by moving 🦿 and shooting🔫, using ML-Agents !☆17Updated 3 years ago
- ☆85Updated 3 months ago
- ☆22Updated 4 years ago
- PyTorch code to train and evaluate Procgen tasks☆23Updated 4 years ago
- PyRL - Reinforcement Learning Framework in Pytorch (Policy Gradient, DQN, DDPG, TD3, PPO, SAC, etc.)☆34Updated 2 years ago
- Deep reinforcement learning implementation that trains AIs for the CodeCraft real-time strategy game.☆20Updated last year
- Hierarchical Self-Play☆21Updated 5 years ago
- 🧩 Create your own puzzle, use my agents to solve it 🤖 try them out! 🧩☆9Updated 2 years ago
- An easy-to-use reinforcement learning library for research and education.☆161Updated 3 weeks ago
- Experiment code for testing effect of various action space transformations in reinforcement learning☆30Updated 4 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆30Updated this week
- ☆71Updated 5 months ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- A2C is a special case of PPO!☆19Updated 2 years ago
- MultiTask Environments for Reinforcement Learning.☆74Updated 2 years ago
- Revisiting Rainbow☆73Updated 3 years ago
- Baselines for gymnax 🤖☆60Updated last year
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆48Updated 2 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 4 years ago
- Series of deep reinforcement learning algorithms 🤖☆29Updated 3 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆28Updated 5 years ago
- ✨🌲 Hierarchical extreme multiclass and multi-label classification.☆17Updated last year