hilkoc / AI_ArenaLinks
The purpose of this project is to research Artificial Intelligence and Reinforcement Learning. In the AI Arena, multiple agents can interact with a single environment. After sending its action, each each agent will receive a reward. This allows agents to learn, improve their behavior and to adapt to each other. Interesting phenomena can arise..…
☆35Updated 8 years ago
Alternatives and similar repositories for AI_Arena
Users that are interested in AI_Arena are comparing it to the libraries listed below
Sorting:
- Actor-critic trained w PPO on OpenAI's Procgen Benchmark (PyTorch). Built from scratch.☆101Updated 5 years ago
- Atari - Deep Reinforcement Learning algorithms in TensorFlow☆138Updated last year
- Multi Agent Reinforcement Learning using MalmÖ☆264Updated 5 years ago
- ☆107Updated 5 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆151Updated 2 years ago
- StarCraft II / PySC2 Deep Reinforcement Learning Agents (A2C)☆138Updated 7 years ago
- Startcraft II Machine Learning research with DeepMind pysc2 python library .mini-games and agents.☆134Updated 6 years ago
- ☆303Updated 2 years ago
- A packaged and slightly-modified version of https://github.com/bbitmaster/ale_python_interface☆386Updated 2 years ago
- Half Field Offense in Robocup 2D Soccer☆237Updated 3 years ago
- A reinforcement learning framework☆157Updated 6 years ago
- A customizable framework to create maze and gridworld environments☆268Updated 6 years ago
- Actor-critic with experience replay☆256Updated 3 years ago
- Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability☆206Updated 5 years ago
- An environment of the board game Go using OpenAI's Gym API☆176Updated 3 years ago
- Repo for reproduction of sequential social dilemmas☆407Updated 9 months ago
- This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"☆190Updated 6 years ago
- ICML 2018 Self-Imitation Learning☆278Updated 5 years ago
- A continuous action space version of A3C LSTM in pytorch plus A3G design☆259Updated last year
- C51-DDQN in Keras☆126Updated 8 years ago
- Gridworld environments for OpenAI gym.☆79Updated last year
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 6 years ago
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆126Updated 5 years ago
- PySC2 OpenAI Gym Environments☆48Updated 6 years ago
- ☆69Updated 7 years ago
- Implementing reinforcement-learning algorithms for pysc2 -environment☆89Updated 7 years ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆206Updated 3 years ago
- Gym - Doom environments based on VizDoom.☆104Updated 8 years ago
- Code for the paper "Evolved Policy Gradients"☆253Updated 7 years ago
- ☆111Updated 3 years ago