Simple C++ project that includes header only implementations of Monte Carlo Tree Search(MCTS), Temporal Difference Learning, Minimax, and other agents to evaluate their performance in various two player games. Also includes an implementation of AlphaZero for Connect4.
☆11Jan 29, 2026Updated last month
Alternatives and similar repositories for Reinforcement-Learning
Users that are interested in Reinforcement-Learning are comparing it to the libraries listed below
Sorting:
- PPDDL plan evalutation simulator☆15Dec 30, 2019Updated 6 years ago
- The simple C/C++ library for hexapod (Robot spider with 6 legs) on Arduino.☆13Dec 27, 2018Updated 7 years ago
- Hexapod Robot Control☆10May 8, 2023Updated 2 years ago
- This repository contains notes and assignment for the University of Pennsylvania/Coursera in the Robotics Specialization: Computational M…☆41Apr 16, 2016Updated 9 years ago
- Code for the paper 'Monte Carlo Tree Search for Asymmetric Trees'☆12May 24, 2018Updated 7 years ago
- Robot planning and learning research code repository from the RAIL Group at GMU☆10Jul 2, 2023Updated 2 years ago
- ☆11Aug 1, 2019Updated 6 years ago
- Monte Carlo value iteration for continuous-state POMDPs☆12Sep 3, 2013Updated 12 years ago
- Monte Carlo Tree Search (MCTS) ,realize using python☆12Mar 10, 2016Updated 9 years ago
- Comp 781 Project☆10Jan 2, 2026Updated last month
- Tensor Belief Propagation - algorithm for approximate inference in discrete graphical models☆12Feb 17, 2020Updated 6 years ago
- ☆10Apr 22, 2013Updated 12 years ago
- PyTorch implementation of "Variational Autoencoders with Jointly Optimized Latent Dependency Structure" [ICLR 2019]☆13Jul 14, 2019Updated 6 years ago
- Documentation and ressources of Kraby, an open-source hexapod robot☆14Aug 24, 2020Updated 5 years ago
- Recursive Bayesian Networks☆11May 11, 2025Updated 9 months ago
- Supporting code for the paper "Predicting aptamer sequences that interact with target proteins using an Aptamer-Protein Interaction class…☆15Dec 31, 2021Updated 4 years ago
- implicit behaviour cloning toy 2d example☆14Oct 8, 2021Updated 4 years ago
- FailureSensorIQ, a dataset and benchmark to probe LLMs’ reasoning and comprehension of sensor–failure relationships in industrial systems…☆33Feb 20, 2026Updated last week
- Neural Fictitious Self-Play in Leduc Holdem☆11Jul 4, 2018Updated 7 years ago
- A fast C++ impementation of Monte Carlo Tree Search with abstract classes that a user of this library can extend in order to use it. To d…☆48Sep 8, 2021Updated 4 years ago
- Aquaplanning QUick Automated Planning.☆13Oct 13, 2020Updated 5 years ago
- Behavior planner fusing runtime verification on traffic rules with single- and multi-agent Monte Carlo Tree Search☆11Jun 15, 2021Updated 4 years ago
- Code for Towards Visual Ego-Motion Learning in Robots☆12May 30, 2017Updated 8 years ago
- A Unity WebGL project for a TicTacToe game, using Monte Carlo Tree Search (MCTS) for its AI decision making.☆12Mar 18, 2023Updated 2 years ago
- Solving POMDPs using exact and approximate methods☆14Aug 9, 2017Updated 8 years ago
- LTL2PDDL tool☆11Jul 7, 2017Updated 8 years ago
- this is a set of examples on how to use ogre as: mouse/keyboard/gamepad input, how to use cg and glsl shaders and how to create a sphere …☆20Mar 17, 2010Updated 15 years ago
- ☆13Jul 4, 2020Updated 5 years ago
- Python program to convert a Context Free Grammar to Chomsky Normal Form.☆10May 9, 2025Updated 9 months ago
- A catkin wrapper for GTSAM☆14May 29, 2023Updated 2 years ago
- Flight code for our high-speed quadrotor obstacle avoidance method, as described in this paper:☆51Jan 17, 2017Updated 9 years ago
- A quadruped running machine in webots with three different gaits: trotting, pacing, and bounding.☆14May 22, 2022Updated 3 years ago
- Tic Tac Toe with Alpha Zero method - My first work☆18Aug 23, 2018Updated 7 years ago
- Our reproduction of paper "Pfeiffer M et al. From Perception to Decision: A Data-driven Approach to End-to-end Motion Planning for Autono…☆12Jun 16, 2019Updated 6 years ago
- ☆13Jul 22, 2019Updated 6 years ago
- Monte Carlo Motion Planning: ISRR 2015 Code☆17Apr 4, 2016Updated 9 years ago
- ☆14Oct 29, 2018Updated 7 years ago
- Repository for the paper "Generative Adversarial Network to Learn Valid Distributions of Robot Configurations for Inverse Kinematics and …☆17Jul 24, 2022Updated 3 years ago
- Hybrid Deep MILP Planner☆14Sep 6, 2022Updated 3 years ago