akshaykhadse / reinforcement-learning
Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay
☆17Updated 6 years ago
Alternatives and similar repositories for reinforcement-learning:
Users that are interested in reinforcement-learning are comparing it to the libraries listed below
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆56Updated 5 years ago
- Deep Gaussian Process for Inverse Reinforcement Learning☆33Updated 7 years ago
- ☆54Updated 7 years ago
- ☆36Updated 8 years ago
- ☆11Updated 5 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆28Updated 5 years ago
- Code for the paper Model-Predictive Control via Cross-Entropy and Gradient-Based Optimization☆69Updated 5 years ago
- TD-Regularized Actor-Critic Methods☆36Updated 5 years ago
- Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.0918…☆43Updated 6 years ago
- Bayesian Reward Shaping Framework for Deep Reinforcement Learning☆23Updated 6 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Updated 3 years ago
- Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"☆34Updated 5 years ago
- research and implementations of Deep RL agents and their applications☆49Updated 3 weeks ago
- Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]☆26Updated 3 years ago
- Discriminative Particle Filter Reinforcement Learning for Complex Partial Observations (ICLR 2020)☆25Updated 3 years ago
- Reinforcement learning algorithms with Generalized Advantage Estimation☆21Updated 6 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- ☆27Updated 5 years ago
- Efficient Exploration through Bayesian Deep Q-Networks☆37Updated 7 years ago
- Safe exploration in Markov Decision Processes☆37Updated 7 years ago
- Safe Reinforcement Learning algorithms☆74Updated 2 years ago
- Sample-Efficient Automated Deep Reinforcement Learning☆34Updated 4 years ago
- Code implementation of: "Graying the black box: Understanding DQNs"☆20Updated 8 years ago
- Value iteration, policy iteration, and Q-Learning in a grid-world MDP.☆28Updated last year
- Code implementing the CORE-RL algorithm with DDPG, PPO, and TRPO. See the paper "Control Regularization for Reduced Variance Reinforcemen…☆32Updated 4 years ago
- ☆19Updated 4 years ago
- TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay)☆40Updated 4 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago
- Code for training policies based on paper Coordinated Multi-Agent Imitation Learning☆26Updated 7 years ago
- Great resources for learning optimal control☆17Updated 5 years ago