chihkuanyeh / Automatic-Bridge-Bidding-by-Deep-Reinforcement-Learning
The released model of the paper 'Automatic Bridge Bidding by Deep Reinforcement Learning' in ECAI 2016
☆19Updated 7 years ago
Related projects: ⓘ
- Double Deep Q-Learning with Prioritized Experience Replay☆34Updated 6 years ago
- Multiagent deep reinforcement learning research project☆26Updated 3 months ago
- RainBow, Tensorflow☆49Updated 6 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆28Updated 5 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆60Updated 3 years ago
- We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting☆11Updated 6 years ago
- Tensorflow Implementation for "Noisy network for exploration"☆33Updated 7 years ago
- Fictitious Self-play & Reinforcement Learning☆19Updated 6 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆71Updated 5 years ago
- A Multi-agent Learning Framework☆61Updated 3 years ago
- TensorFlow 2.0 for Deep Reinforcement Learning.☆82Updated last year
- Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play☆24Updated 5 years ago
- Deep learning implementations (Asynchronous Deep Q-Learning) of multiple Game Theory algorithms for adversarial learning (WoLF-PHC, GIGA-…☆15Updated 7 years ago
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆129Updated last year
- Deep Reinforcement Learning for Nash Equilibria☆39Updated last year
- TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay)☆40Updated 4 years ago
- A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.☆11Updated 3 years ago
- ☆10Updated 4 years ago
- ☆50Updated this week
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆36Updated 6 years ago
- Deep Recurrent Attention Reinforcement Learning in Atari☆82Updated 6 years ago
- Yet another prioritized experience replay buffer implementation.☆47Updated last year
- Efficient Exploration through Bayesian Deep Q-Networks☆35Updated 6 years ago
- NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.☆23Updated 4 months ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆64Updated 7 years ago
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 4 years ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆63Updated 5 years ago
- FEN Code☆36Updated 4 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆62Updated 6 years ago
- My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.☆35Updated last year