philipjball / SAC_PyTorch
π§Ά Minimal PyTorch Soft Actor Critic (SAC) implementation
β36Updated 2 years ago
Related projects β
Alternatives and complementary repositories for SAC_PyTorch
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimationβ36Updated 3 weeks ago
- Meta-Inverse Reinforcement Learning with Probabilistic Context Variablesβ70Updated last year
- Self-implemented code for Model-Based Meta-Reinforcement Learningβ17Updated 5 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)β23Updated 5 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradientsβ32Updated 4 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"β28Updated 5 years ago
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICLβ¦β50Updated 3 years ago
- My Body Is A Cageβ38Updated 3 years ago
- β41Updated 3 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.β55Updated 5 years ago
- We investigate the effect of populations on finding good solutions to the robust MDPβ28Updated 3 years ago
- β14Updated 4 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]β34Updated 2 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimizationβ24Updated 4 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Explorationβ67Updated 3 years ago
- PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learningβ49Updated 3 years ago
- β97Updated last year
- Learning to Coordinate Manipulation Skills via Skill Behavior Diversification (ICLR 2020)β43Updated 2 years ago
- Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)β18Updated 2 years ago
- Model-Based Reinforcement Learning via Latent-Space Collocation.β31Updated last year
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]β48Updated 3 years ago
- Energy-Based Hindsight Experience Prioritization (CoRL 2018) Oral presentation (7%)β33Updated 5 years ago
- Code and project page for D-REX algorithm from the paper "Better-than-Demonstrator Imitation Learning via Automatically-Ranked Demonstratβ¦β49Updated last year
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.β36Updated 3 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimationβ25Updated last year
- PyTorch IMPALA implementationβ24Updated 5 years ago
- β90Updated 11 months ago
- β71Updated 5 months ago
- Disagreement-Regularized Imitation Learningβ30Updated 3 years ago
- β17Updated 3 years ago