johannbrehmer / rl-6-nimmtLinks
Solving the card game 6 nimmt! with reinforcement learning
β13Updated 4 years ago
Alternatives and similar repositories for rl-6-nimmt
Users that are interested in rl-6-nimmt are comparing it to the libraries listed below
Sorting:
- π© A simple and clean python banner generator - Bannersβ15Updated 3 years ago
- Reinforcement learning in pure JAX.β13Updated 2 weeks ago
- Understanding RL vision Distill articleβ25Updated 2 years ago
- A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learningβ12Updated 5 years ago
- Collection of Deep Reinforcement Learning Jupyter Notebooks. Each notebook is self-contained and presents single algorithm. These includeβ¦β38Updated 5 years ago
- Open AI Gym environment of the Missile Command Atari game.β14Updated 2 years ago
- A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.β15Updated 5 years ago
- Applying DeepMind's MuZero algorithm to the cart pole environment in gymβ22Updated 2 years ago
- A "build to learn" Alpha Zero implementation using Gradient Boosted Decision Trees (LightGBM)β87Updated 10 months ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)β17Updated 3 years ago
- Notes for papers or blog posts about ML, Robotics, CV.β15Updated 7 years ago
- Customizable RecSys Simulator for OpenAI Gymβ26Updated 4 years ago
- The Path to Nash Equilibriumβ38Updated 3 years ago
- A simple, continuous-control environment for OpenAI Gymβ23Updated 3 years ago
- β31Updated 3 years ago
- A GPU-accelerated fork of stable-baselines. Delivering reliable implementations of reinforcement learning algorithms.β25Updated 4 years ago
- Collection of game-theoretic algorithms for Pokerβ29Updated 6 years ago
- Implementation of Proximal Policy Optimization in Jax+Flaxβ21Updated 2 years ago
- Soft Actor-Critic implementation with SOTA model-free extension (REDQ) and SOTA model-based extension (MBPO).β15Updated 4 years ago
- Contextual Bandits Action Elimination DQNβ21Updated 7 years ago
- Source code for the AAAI 2019 paper "On-Line Learning of Linear Dynamical Systems: Exponential Forgetting in Kalman Filters" (https://arxβ¦β19Updated 4 years ago
- Reinforcement Learning Assemblyβ92Updated 4 years ago
- Simple single file implementations of Reinforcement Learning algorithms in Juliaβ23Updated 10 months ago
- Open-source code for paper CDT: Cascading Decision Trees for Explainable Reinforcement Learningβ38Updated 2 months ago
- OpenAI Gym Environment for ROS.β12Updated 8 years ago
- Made for a reading group at the Center for Safe AGI.β12Updated 3 years ago
- β37Updated 2 years ago
- Using self-play, MCTS, and a deep neural network to create a hearthstone ai playerβ29Updated 7 years ago
- Solving the Rubik's cube with deep reinforcement learning and Monte Carlo tree searchβ107Updated 6 years ago
- PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QTβ¦β16Updated 5 years ago