discrete soft Q learning(SQL) and soft Q imitation learning(SQIL) implementation in pytorch, simple!
☆58Oct 18, 2022Updated 3 years ago
Alternatives and similar repositories for soft-Q-learning
Users that are interested in soft-Q-learning are comparing it to the libraries listed below
Sorting:
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- Pytorch GAIL VAIL AIRL VAIRL EAIRL SQIL Implementation☆68May 25, 2021Updated 4 years ago
- The implement of GAIL with pytorch☆14Mar 11, 2020Updated 5 years ago
- ☆18Aug 14, 2023Updated 2 years ago
- Inverse Reinforcement learning proof-of-concept using the Guided Cost/Reward Learning approach☆10Mar 23, 2020Updated 5 years ago
- Landing a Spaceship using Upside-Down Reinforcement Learning (a.k.a ⅂ꓤ)☆13Oct 25, 2023Updated 2 years ago
- papers about reinforcement learning☆13Jan 4, 2021Updated 5 years ago
- TensorFlow implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"☆10Dec 8, 2022Updated 3 years ago
- Disagreement-Regularized Imitation Learning☆30May 25, 2021Updated 4 years ago
- Multi-Agent Deep Deterministic Policy Gradient implementation with pytorch☆10Aug 2, 2020Updated 5 years ago
- Learning Domain-Independent Planning Heuristics over Hypergraphs (ICAPS'20)☆14Mar 21, 2025Updated 11 months ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- Mutual Information State Intrinsic Control (ICLR 2021 Spotlight)☆38Mar 1, 2021Updated 5 years ago
- Exploration by Random Network Distillation☆15Dec 30, 2018Updated 7 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆16Nov 14, 2018Updated 7 years ago
- Model-based reinforcement learning using CEM, MPC and PETS☆16Nov 20, 2019Updated 6 years ago
- The implementation of Discriminator Soft Actor Critic☆15Jan 25, 2020Updated 6 years ago
- This is a deterministic Tensorflow 2.0 (keras) implementation of a Open Ai's proximal policy optimization actor critic algorithm PPO.☆12Sep 3, 2020Updated 5 years ago
- soft q learning and soft actor critic☆16Dec 23, 2018Updated 7 years ago
- Codebase for ICRA 2020 paper "Towards Practical Multi-object Manipulation using Relational Reinforcement Learning"☆104Dec 8, 2022Updated 3 years ago
- ☆15Nov 22, 2019Updated 6 years ago
- Reinforcement Learning -- Imitation Learning, Behavior Cloning, DAgger (Data Aggregation)☆21Apr 15, 2018Updated 7 years ago
- Revisiting Rainbow☆76Jun 9, 2021Updated 4 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆436Nov 28, 2023Updated 2 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- Code for Sibling Rivalry and experiments presented in associated paper☆17May 1, 2025Updated 10 months ago
- Reimplementation of Policy Optimization with Demonstrations (POfD) from ICML 2018.☆16Jun 5, 2019Updated 6 years ago
- Collection of reinforcement learning algorithms☆16Sep 29, 2025Updated 5 months ago
- Code for Paper "State Alignment-based Imitation Learning". Under maintenance☆17May 1, 2020Updated 5 years ago
- Code for magnetic mirror descent.☆17Oct 5, 2023Updated 2 years ago
- Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"☆46Nov 22, 2022Updated 3 years ago
- Implementation of the Option-Critic Architecture☆41Dec 9, 2018Updated 7 years ago
- A collection of offline reinforcement learning algorithms.☆208Nov 26, 2024Updated last year
- Curiosity-driven Exploration by Self-supervised Prediction☆23Jun 13, 2019Updated 6 years ago
- Image-based gridworld experiment for learning Markov state abstractions☆21Sep 16, 2024Updated last year
- Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL☆24Nov 4, 2024Updated last year
- Advantage weighted Actor Critic for Offline RL☆52Aug 27, 2022Updated 3 years ago
- ☆45Nov 29, 2021Updated 4 years ago
- Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official imp…☆1,405Nov 29, 2023Updated 2 years ago