anki08 / Option-Critic
A simple option critic framework using Q-Learning
☆11Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Option-Critic
- ☆24Updated 3 years ago
- Bayesian Inverse Reinforcement Learning with simple environments☆20Updated 2 years ago
- using information theory to encourage agents to cooperate and compete☆19Updated 6 years ago
- Implementation of the Option-Critic Architecture☆36Updated 5 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆35Updated 2 weeks ago
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆50Updated 3 years ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆31Updated last year
- 🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation☆36Updated 2 years ago
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆66Updated 11 months ago
- Model-Based Reinforcement Learning via Latent-Space Collocation.☆31Updated last year
- PyTorch implementation of Episodic Meta Reinforcement Learning on variants of the "Two-Step" task. Reproduces the results found in three …☆31Updated 3 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆34Updated 2 years ago
- implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies☆10Updated 3 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆17Updated 3 years ago
- Disagreement-Regularized Imitation Learning☆30Updated 3 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆26Updated 2 years ago
- 🧊 🚩Comparison of active inference, q-learning and bayesian rl using modified FrozenLake environment☆35Updated 4 years ago
- Reinforcement Learning through Active Inference☆67Updated 4 years ago
- Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"☆18Updated 2 years ago
- ☆18Updated last year
- Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)☆18Updated 2 years ago
- ☆21Updated 6 months ago
- ☆16Updated 2 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- ☆33Updated 2 months ago
- Code for the paper: "Causal Influence Detection for Improving Efficiency in Reinforcement Learning", by Seitzer, M., Schölkopf, B., Marti…☆36Updated 2 years ago
- Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020☆41Updated last year
- Implementation of GAIL and AIRL using chinerrl☆16Updated 2 years ago
- Code for the paper Model-Predictive Control via Cross-Entropy and Gradient-Based Optimization☆67Updated 4 years ago
- Non-linear policy graph improvement - planning for Dec-POMDPs☆16Updated 3 years ago