harpribot / IRL-maxentView external linksLinks
Experiments showing effects of parameters on Maximum Entropy Inverse Reinforcement Learning using grid world
☆15Nov 26, 2016Updated 9 years ago
Alternatives and similar repositories for IRL-maxent
Users that are interested in IRL-maxent are comparing it to the libraries listed below
Sorting:
- An implementation of popular Inverse Reinforcement Learning algorithms for various tasks.☆21Jul 26, 2017Updated 8 years ago
- Algorithms for Uni-Modal Inverse Reinforcement Learning☆22Sep 23, 2022Updated 3 years ago
- ☆13Dec 13, 2024Updated last year
- Implementing the two pioneering IRL papers "Algorithms for Inverse Reinforcement Learning" - (Ng &Russell 2000) and "Maximum Entropy Inve…☆32Jul 6, 2023Updated 2 years ago
- Deep Gaussian Process for Inverse Reinforcement Learning☆33Jul 6, 2017Updated 8 years ago
- Non stationary bandit for experiments with Reinforcement Learning☆33Mar 24, 2017Updated 8 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- Implementation of Inverse Reinforcement Learning (IRL) algorithms in Python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL☆664May 10, 2024Updated last year
- Inverse Reinforcement Learning, Inverse Optimal Control, Apprenticeship Learning, Imitation Learning review☆46Apr 27, 2021Updated 4 years ago
- Robust Reinforcement Learning Benchmark☆12Sep 22, 2024Updated last year
- Visual Transition State Clustering☆13Jan 6, 2018Updated 8 years ago
- ☆12Mar 6, 2023Updated 2 years ago
- 2019 Fall - Game theory and Multi-agent RL Termproject☆11Dec 13, 2019Updated 6 years ago
- Popular questions on stackexchange network☆13Dec 13, 2017Updated 8 years ago
- A Python CLI game and library for Tic-tac-toe.☆10Apr 4, 2017Updated 8 years ago
- Implementation of Monte Carlo Word Movers Distance in Python with TensorFlow☆12Sep 12, 2016Updated 9 years ago
- Official implementation of Recurrent Action Transformer with Memory, an offline RL agent with memory mechanisms. https://sites.google.com…☆18Nov 23, 2025Updated 2 months ago
- 一个开源数学大模型项目,旨在探索大模型是否具有数学创造能力,以及大模型在前沿数学研究中的潜在能力。☆17May 16, 2025Updated 8 months ago
- ☆20Dec 4, 2024Updated last year
- Python 2/3 compatible .npz CIFAR-10 dataset☆10Mar 1, 2017Updated 8 years ago
- Docker image for Tensorflow and Keras with CUDA support☆10Dec 1, 2016Updated 9 years ago
- MID (Mutual Information Dimension) for measuring statistical dependence between two random variables☆12Apr 21, 2013Updated 12 years ago
- Plotting data on the interactive maps, IPython Notebook friendly.☆10Jun 5, 2020Updated 5 years ago
- Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow - Tensorlfow Im…☆13Feb 2, 2019Updated 7 years ago
- ☆11Dec 1, 2017Updated 8 years ago
- Fast IdEntification of State-of-The-Art models using adaptive bandit algorithms☆14Jul 15, 2022Updated 3 years ago
- clone from geda-project☆19Oct 24, 2025Updated 3 months ago
- Implementations of selected inverse reinforcement learning algorithms.☆1,064Oct 21, 2022Updated 3 years ago
- We implement AI for Hearthstone using open source Hearthstone simulator FirePlace: https://github.com/jleclanche/fireplace.☆11Jun 12, 2016Updated 9 years ago
- Simple code for running and visualizing replicator dynamics☆11Jan 31, 2024Updated 2 years ago
- Spearmint uses Gaussian Processes to automatically optimize hyper parameter. This is a fork of Spearmint for the deep learning community.…☆11Nov 30, 2016Updated 9 years ago
- Hippo7, modular vjing tool☆11Apr 17, 2020Updated 5 years ago
- A jekyll template for courses☆17Sep 1, 2014Updated 11 years ago
- This repository contains binaries for the multiple teacher approach to learning differential private ML models: https://arxiv.org/abs/161…☆10Nov 16, 2016Updated 9 years ago
- A step by step implementation of building an AI agent that plays 3d shooting game☆19Jul 16, 2025Updated 6 months ago
- tensorflow_serving inception gRPC client☆12Feb 16, 2017Updated 8 years ago
- ☆12Nov 8, 2017Updated 8 years ago
- Code related to "Explaining and Generalizing Skip-Gram through Exponential Family Principal Component Analysis" (EACL 2017)☆11Feb 5, 2018Updated 8 years ago
- Emotion-preserving face swapping algorithms using deep generative models☆11Mar 11, 2020Updated 5 years ago