RL experiments
☆69Nov 21, 2022Updated 3 years ago
Alternatives and similar repositories for rl
Users that are interested in rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Solving the Rubik's cube with deep reinforcement learning and Monte Carlo tree search☆108Apr 15, 2019Updated 7 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Jul 12, 2017Updated 8 years ago
- Tabula Rasa Tic-Tac-Toe☆10Jan 3, 2019Updated 7 years ago
- stdr robot (turtlebot, etc) simulation with ROS, maze solving, navigation, multiple tasks.☆13Mar 24, 2018Updated 8 years ago
- Reinforcement Learning Assignment: Easy21☆12Jul 4, 2016Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.☆20Apr 13, 2021Updated 5 years ago
- A web based GUI for pgmpy☆15Jan 7, 2015Updated 11 years ago
- Machine learning in nim☆12Aug 16, 2014Updated 11 years ago
- Explore and Control with Adversarial Surprise☆10Jul 20, 2021Updated 4 years ago
- The Common Lisp Vim IDE☆20Mar 6, 2010Updated 16 years ago
- hacking torch-like neural networks in Julia☆11Jan 29, 2015Updated 11 years ago
- Gym - Doom environments based on VizDoom.☆105Mar 17, 2017Updated 9 years ago
- RCrawler: An R package for parallel web crawling and scraping. To cite this software publication: http://www.sciencedirect.com/science/ar…☆13Nov 8, 2016Updated 9 years ago
- Code for "Spinning Up a Pong AI With Deep RL" on FloydHub.☆55Dec 6, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)☆15Jan 19, 2021Updated 5 years ago
- Evolution of Discrete data with Reinforcement Learning☆13Dec 8, 2019Updated 6 years ago
- Round 1 Starter Kit for the MarLo challenge☆21Sep 27, 2018Updated 7 years ago
- Turtlebot3 navigation and maze solving approach using ROS☆11Mar 24, 2018Updated 8 years ago
- Electron Microscopy Images, Neuron Segmentation Task. https://cremi.org☆17Mar 10, 2018Updated 8 years ago
- Javascript implementation of Fractran☆15Sep 14, 2017Updated 8 years ago
- Atari gauntlet for RL agents☆29Mar 18, 2017Updated 9 years ago
- Prioritized Sequence Experience Replay☆10Aug 16, 2021Updated 4 years ago
- Repository for Iterated Relearning: The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning☆11Jun 8, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- ⚠️ NOTICE: This starter kit was used for 2019 challenge and has been deprecated in favour of 2020 Flatland challenge's starter kit presen…☆20Jun 9, 2020Updated 5 years ago
- Neural Fictitious Self-Play in Leduc Holdem☆11Jul 4, 2018Updated 7 years ago
- uct tree search + supervised lerning for atari games☆12Feb 14, 2017Updated 9 years ago
- Replication of Uber Neuroevolution paper☆46Apr 14, 2018Updated 8 years ago
- This is a repository of DQN and its variants implementation in PyTorch based on the original papar.☆13Nov 18, 2019Updated 6 years ago
- f-GAN Tensorflow f-GAN: Training Generative Neural Samplers Using Variational Divergence Minimization☆12Sep 15, 2018Updated 7 years ago
- Dialogue corpus creation and evaluation scripts for the Ubuntu Dialogue Corpus.☆15Jun 9, 2023Updated 2 years ago
- PyTorch implementation of DQN☆13Sep 27, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- simple UCI chess engine written by self learner from scratch☆12Sep 16, 2020Updated 5 years ago
- ☆23Oct 7, 2018Updated 7 years ago
- ☆12Jun 5, 2016Updated 9 years ago
- collections of language style transfer papers☆10Jan 4, 2018Updated 8 years ago
- My Homepage☆10May 16, 2026Updated last week
- Statistics on most cited papers in recent years of each conferences☆13Oct 24, 2018Updated 7 years ago
- Facebear's minimal implementation of SBAC (Soft behavior regularized actor critic, NIPS22 offline RL workshop)☆11Jul 4, 2022Updated 3 years ago