ArmaanSethi / Hindsight-Experience-Replay-and-Hierarchical-Reinforcement-Learning
Comp 781 Project
☆8Updated 5 years ago
Related projects: ⓘ
- Autoregressive policies for continuous control reinforcement learning☆28Updated 5 years ago
- ☆15Updated 3 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆54Updated 5 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆32Updated 2 years ago
- ☆54Updated 6 years ago
- Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"☆10Updated 3 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆30Updated 4 years ago
- Code for the Reset-free Trial and Error learning paper (RTE) experiments☆10Updated 6 years ago
- Contains the code for "BaRC: Backward Reachability Curriculum for Robotic Reinforcement Learning" by Boris Ivanovic, James Harrison, Apoo…☆12Updated 6 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆25Updated 5 years ago
- Source code for "Multi-objective Model-based Policy Search for Data-efficient Learning with Sparse Rewards" (CoRL 2018)☆13Updated 5 years ago
- ☆26Updated 5 years ago
- ☆30Updated 10 months ago
- Code for CORL'18 paper "Risk-Aware Active Inverse Reinforcement Learning"☆15Updated 5 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆21Updated 5 months ago
- ☆20Updated 5 months ago
- Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"☆44Updated last year
- ☆27Updated 3 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆17Updated 4 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆37Updated 3 years ago
- Scalable MCTS for team scenarios☆15Updated 3 months ago
- The implementation of Discriminator Soft Actor Critic☆13Updated 4 years ago
- Dead-ends and Secure Exploration in Reinforcement Learning☆11Updated 5 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆28Updated 5 years ago
- Companion code to CoRL 2018 paper: E Bıyık, D Sadigh. "Batch Active Preference-Based Learning of Reward Functions". Conference on Robot L…☆28Updated 5 years ago
- Reward Propagation using Graph Convolutional Networks☆13Updated 3 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆35Updated 2 weeks ago
- Discriminative Particle Filter Reinforcement Learning for Complex Partial Observations (ICLR 2020)☆25Updated 2 years ago
- 📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)☆24Updated 3 years ago