aijunbai / taxiView external linksLinks
Hierarchical Online Planning and Reinforcement Learning on Taxi
☆32Oct 23, 2017Updated 8 years ago
Alternatives and similar repositories for taxi
Users that are interested in taxi are comparing it to the libraries listed below
Sorting:
- Solution for Taxi env using HRL (Hierarchical reinforcement learning) (2018)☆21Nov 3, 2019Updated 6 years ago
- N-Layered FeUdal Networks based on FeUdal Networks adapted to suit PySC2 observations☆18Sep 17, 2019Updated 6 years ago
- An implementation of the Escape Room domain for Hierarchical Reinforcement Learning.☆25May 15, 2019Updated 6 years ago
- ☆10Aug 17, 2022Updated 3 years ago
- Monte Carlo value iteration for continuous-state POMDPs☆12Sep 3, 2013Updated 12 years ago
- Dynamic Simulation Environments for Reinforcement Learning☆13Apr 17, 2021Updated 4 years ago
- Scalable MCTS for team scenarios☆16Jun 14, 2024Updated last year
- hierarchical Q-learning implementation☆11Jun 9, 2015Updated 10 years ago
- Negative Update Intervals in Multi-Agent Deep Reinforcement Learning☆35May 14, 2019Updated 6 years ago
- ☆14Mar 26, 2019Updated 6 years ago
- rlcourse-march-17-hugobb created by GitHub Classroom☆16Jul 3, 2024Updated last year
- A small tool to parse a POMDP and load into python objects.☆37Jan 6, 2020Updated 6 years ago
- Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…☆24Nov 29, 2018Updated 7 years ago
- Combining Evolutionary Algorithms and deep Reinforcement Learning☆19Jul 17, 2018Updated 7 years ago
- Simple implementation of the model presented in Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic …☆17Jan 22, 2019Updated 7 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆74Jun 1, 2017Updated 8 years ago
- Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs☆15Jun 20, 2016Updated 9 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆28Dec 7, 2021Updated 4 years ago
- ☆21Aug 14, 2017Updated 8 years ago
- A library to benchmark reinforcement learning algorithms☆21Apr 18, 2018Updated 7 years ago
- Implementation of Data Efficient Reinforcement Learning in Pytorch☆20Aug 6, 2019Updated 6 years ago
- Hierarchical Self-Play☆21Dec 5, 2018Updated 7 years ago
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆47Dec 10, 2021Updated 4 years ago
- Deep Learning Project☆23Jan 18, 2020Updated 6 years ago
- This repository contains the code for our paper on Dynamic Mirror Descent based MPC for Model-Free RL☆26Feb 1, 2022Updated 4 years ago
- Python RRT algorithm with visualization, kinodynamic constraints and other optimizations/extensions☆21Mar 24, 2014Updated 11 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Feb 3, 2022Updated 4 years ago
- Stochastic Neural Networks for Hierarchical Reinforcement Learning☆94Apr 17, 2018Updated 7 years ago
- Codebase for Numerical Renaissance by Thomas Bewley☆16Mar 11, 2024Updated last year
- ☆30Dec 2, 2021Updated 4 years ago
- ☆31Jul 1, 2019Updated 6 years ago
- Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines☆50Jun 3, 2022Updated 3 years ago
- Implementation of Hierarchical Deep Q-Learning (Kulkarni et al., 2016)☆35May 18, 2019Updated 6 years ago
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆33Jul 22, 2021Updated 4 years ago
- Bayesian adaptive stimulus placement of psychometric function for MATLAB.☆10Nov 7, 2018Updated 7 years ago
- A Caffe/C++ implementation of Deep Deterministic Policy Gradient☆10Feb 1, 2019Updated 7 years ago
- ☆14Aug 12, 2024Updated last year
- Fast, free, easy, and object-agnostic video anonymization☆11Dec 12, 2020Updated 5 years ago
- A nonparametric variational information bottleneck (NVIB) layer in Pytorch☆11Apr 15, 2025Updated 9 months ago