Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL
☆24Nov 4, 2024Updated last year
Alternatives and similar repositories for iql-pytorch
Users that are interested in iql-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A PyTorch implementation of Implicit Q-Learning☆97Oct 23, 2021Updated 4 years ago
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆28Feb 10, 2022Updated 4 years ago
- ☆12Apr 25, 2022Updated 3 years ago
- ☆26Jun 14, 2022Updated 3 years ago
- ☆11Jun 5, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆70Aug 8, 2022Updated 3 years ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆94Dec 1, 2024Updated last year
- PyTorch implementation of the implicit Q-learning algorithm (IQL)☆44Dec 17, 2021Updated 4 years ago
- Repo for Implicit Diffusion Q-Learning☆123Dec 5, 2023Updated 2 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆71Jul 17, 2025Updated 8 months ago
- ☆80Dec 9, 2022Updated 3 years ago
- MuJoCo models for Unitree Robots☆12Nov 24, 2021Updated 4 years ago
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆46Jul 27, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆35Jan 5, 2023Updated 3 years ago
- BeCL: Behavior Contrastive Learning for Unsupervised Skill Discovery.☆23May 11, 2023Updated 2 years ago
- Code for CoRL 2022 paper: https://arxiv.org/abs/2211.09006 (simulation environments)☆11Feb 9, 2023Updated 3 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆57Apr 6, 2023Updated 2 years ago
- ☆44Sep 19, 2021Updated 4 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- ☆13Mar 7, 2022Updated 4 years ago
- Single-Life Reinforcement Learning☆14Dec 17, 2022Updated 3 years ago
- ☆28Dec 16, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆18Mar 18, 2026Updated last week
- Code for paper "Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning".☆14May 23, 2021Updated 4 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆400Dec 18, 2021Updated 4 years ago
- Anti exploration in offline reinforcement learning☆11May 17, 2021Updated 4 years ago
- Cat Detection and Breed Recognition☆16Oct 27, 2018Updated 7 years ago
- PALMER: Perception-Action Loop with Memory for Long-Horizon Planning, NeurIPS 2022☆15Dec 12, 2022Updated 3 years ago
- ☆13Jun 3, 2022Updated 3 years ago
- BASALT Benchmark datasets, evaluation code and agent training example.☆22Nov 29, 2023Updated 2 years ago
- Code Release for Task Agnostic Dynamics Priors for Deep Reinforcement Learning☆12Jun 13, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆322Jan 23, 2022Updated 4 years ago
- 一个基于scrapy+selenium+phantomjs的爬虫程序,用于抓取多个学校的学术报告信息☆10Sep 3, 2015Updated 10 years ago
- The official codebase for running the experiments described in the AVDC paper.☆20Oct 2, 2024Updated last year
- Official repository for paper "Conservative Offline Distributional Reinforcement Learning" (NeurIPS 2021)☆22Aug 1, 2021Updated 4 years ago
- Official source code for Time is Not Enough: Time-Frequency based Explanation for Time-Series Black-Box Models☆12Dec 5, 2024Updated last year
- Multi-agent active perception with prediction rewards☆11Nov 13, 2020Updated 5 years ago
- Official code for "DHRL: A Graph-Based Approach for Long-Horizon and Sparse Hierarchical Reinforcement Learning" (NeurIPS 2022 Oral)☆34Jan 23, 2023Updated 3 years ago