lossv / gym_contraLinks
A gym game for Contra that for reinforcement learning
☆10Updated 3 years ago
Alternatives and similar repositories for gym_contra
Users that are interested in gym_contra are comparing it to the libraries listed below
Sorting:
- 中国象棋gym环境☆14Updated 5 years ago
- C++/python fight the lord with pybind11 (强化学习AI斗地主), Accepted to AIIDE-2020☆163Updated 4 years ago
- Reinforcement Learning attempts to beat Contra 3 for the SNES☆14Updated 6 years ago
- 以孤立语假设和宽度优先搜索为基础,构建了一种多通道堆叠注意力Transformer结构的斗地主ai☆94Updated 4 years ago
- The implementation of Discriminator Soft Actor Critic☆15Updated 5 years ago
- Random Network Distillation(RND) algo in Pytorch☆50Updated 6 years ago
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆80Updated 6 years ago
- (TG'2021) Code for paper "Efficient Reinforcement Learning for StarCraft by Abstract Forward Models and Transfer Learning". TG = Transact…☆10Updated 2 years ago
- dqn autoplay mario bros☆21Updated 8 years ago
- Reinforcement Learning for Super Mario Bros using A3C on GPU☆37Updated 7 years ago
- Example Code of Calling Python from C++ with PyBind11.☆64Updated 4 years ago
- Solving the Rubik's cube with deep reinforcement learning and Monte Carlo tree search☆104Updated 6 years ago
- A platform for intelligent agent learning based on a 3D open-world FPS game developed by Inspir.AI.☆61Updated 3 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆35Updated 7 years ago
- Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…☆32Updated 7 years ago
- Deep Deterministic Policy Gradient implemented in PyTorch for DeepMind Control Suite☆25Updated 6 years ago
- This project is implementation code of AlphaStar☆204Updated last year
- Efficient Reinforcement Learning with a Thought-Game for StarCraft☆46Updated 2 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Updated 6 years ago
- Pytorch implementation of the Deep Deterministic Policy Gradients for Continuous Control☆26Updated 2 years ago
- Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind☆10Updated 7 years ago
- It's the pytorch implementation of google research football.☆43Updated 6 years ago
- Official gym API for game FightingICE.☆12Updated 6 years ago
- self implementation of DPPO, Distributed Proximal Policy Optimization, by using tensorflow☆12Updated 8 years ago
- Atari-DRQN (keras ver.)☆33Updated 7 years ago
- C++ implementation of Proximal Policy Optimization☆87Updated 3 years ago
- This is the code for "OpenAI Five vs DOTA 2 Explained" By Siraj Raval on Youtube☆167Updated 7 years ago
- An AI program that plays Flappy Bird using reinforcement learning.☆43Updated 4 years ago
- 用于保存自己的机器学习项目☆16Updated 4 years ago
- A3C-LSTM algorithm tested on CartPole OpenAI Gym environment☆48Updated 7 years ago