chihkuanyeh / Automatic-Bridge-Bidding-by-Deep-Reinforcement-Learning
The released model of the paper 'Automatic Bridge Bidding by Deep Reinforcement Learning' in ECAI 2016
☆19Updated 7 years ago
Alternatives and similar repositories for Automatic-Bridge-Bidding-by-Deep-Reinforcement-Learning:
Users that are interested in Automatic-Bridge-Bidding-by-Deep-Reinforcement-Learning are comparing it to the libraries listed below
- Fictitious Self-play & Reinforcement Learning☆19Updated 6 years ago
- RainBow, Tensorflow☆49Updated 6 years ago
- TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay)☆40Updated 4 years ago
- Tensorflow implementation of Deep Deterministic Policy Gradients☆20Updated 7 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆74Updated 5 years ago
- ☆30Updated 6 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆36Updated 6 years ago
- Double Deep Q-Learning with Prioritized Experience Replay☆35Updated 6 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆28Updated 6 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 3 years ago
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆131Updated 2 years ago
- TensorFlow 2.0 for Deep Reinforcement Learning.☆84Updated last year
- Multiagent deep reinforcement learning research project☆27Updated 7 months ago
- Tensorflow Implementation for "Noisy network for exploration"☆32Updated 7 years ago
- Counterfactual Regret Minimization☆29Updated 6 years ago
- This code is based on the implementation of http://www.cs.cmu.edu/afs/cs/Web/People/sandholm/potential-aware_imperfect-recall.aaai14.pdf,…☆34Updated 6 years ago
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 5 years ago
- ☆9Updated 5 years ago
- An implementation of Counterfactual Regret Minimization (CFR) via Temporal Difference (TD) learning☆22Updated 11 years ago
- Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play☆25Updated 6 years ago
- Yet another prioritized experience replay buffer implementation.☆49Updated 2 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆59Updated 6 years ago
- implement of prioritized experience replay☆158Updated 6 years ago
- OpenAI Gym No Limit Texas Hold 'em Environment for Reinforcement Learning☆161Updated 5 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆182Updated 5 years ago
- Reinforcement learning algorithms to play Poker☆15Updated 3 years ago
- Bayesian Reward Shaping Framework for Deep Reinforcement Learning☆23Updated 5 years ago
- Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"☆70Updated 8 years ago
- Actor-critic with experience replay☆252Updated 2 years ago