Manchery/iql-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Manchery/iql-pytorch)

Manchery / iql-pytorch

Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL

☆24

Alternatives and similar repositories for iql-pytorch

Users that are interested in iql-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DesikRengarajan / LOGO
View on GitHub
[ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration
☆28Feb 10, 2022Updated 4 years ago
stanford-iris-lab / batch-exploration
View on GitHub
☆12Apr 25, 2022Updated 4 years ago
sparkmxy / my-offlinerl
View on GitHub
☆26Jun 14, 2022Updated 4 years ago
yobibyte / amorpheus
View on GitHub
My Body Is A Cage
☆41Apr 13, 2021Updated 5 years ago
seohongpark / HIQL
View on GitHub
HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)
☆98Dec 1, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
philippe-eecs / IDQL
View on GitHub
Repo for Implicit Diffusion Q-Learning
☆126Dec 5, 2023Updated 2 years ago
TakuyaHiraoka / Dropout-Q-Functions-for-Doubly-Efficient-Reinforcement-Learning
View on GitHub
Source files to replicate experiments in my ICLR 2022 paper.
☆75Jul 17, 2025Updated last year
twitter / diffusion-rl
View on GitHub
☆80Dec 9, 2022Updated 3 years ago
vikashplus / unitree_sim
View on GitHub
MuJoCo models for Unitree Robots
☆12Nov 24, 2021Updated 4 years ago
ryanxhr / IVR
View on GitHub
[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…
☆46Jul 27, 2023Updated 3 years ago
ryanxhr / DWBC
View on GitHub
[ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"
☆37Jan 5, 2023Updated 3 years ago
Rooshy-yang / BeCL
View on GitHub
BeCL: Behavior Contrastive Learning for Unsupervised Skill Discovery.
☆23May 11, 2023Updated 3 years ago
hardcore-study-group / music-shorts
View on GitHub
🥁 Open source short music streaming service
☆13Apr 3, 2022Updated 4 years ago
DanielTakeshi / softgym_tfn
View on GitHub
Code for CoRL 2022 paper: https://arxiv.org/abs/2211.09006 (simulation environments)
☆12Feb 9, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
debashriroy / video-over-dsa
View on GitHub
HD video streaming over contested RF environment, leveraging the concept of Dynamic Spectrum Access
☆12Feb 18, 2024Updated 2 years ago
ryanxhr / POR
View on GitHub
[NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"
☆58Apr 6, 2023Updated 3 years ago
sfujim / TD3_BC
View on GitHub
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
☆410Dec 18, 2021Updated 4 years ago
Charmve / BallPlate
View on GitHub
板球控制系統/滾球系統/BallPlate 2017年全国大学生电子设计竞赛B题全国二等奖作品
☆15May 27, 2024Updated 2 years ago
kpertsch / star
View on GitHub
Official implementation of "Cross-Domain Transfer via Semantic Skill Imitation", Pertsch et al., CoRL 2022
☆15Dec 15, 2022Updated 3 years ago
miyosuda / evolution_and_ai
View on GitHub
☆12Jul 6, 2023Updated 3 years ago
rmrafailov / kitchen
View on GitHub
☆13Mar 7, 2022Updated 4 years ago
joenghl / HYPO
View on GitHub
☆14Dec 29, 2023Updated 2 years ago
etaoxing / kitchen-shift
View on GitHub
KitchenShift: Evaluating Zero-Shot Generalization of Imitation-Based Policy Learning Under Domain Shifts
☆20Jun 21, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
LunjunZhang / world-model-as-a-graph
View on GitHub
Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)
☆72Jul 17, 2021Updated 5 years ago
brentyi / transformer-exercises-jax
View on GitHub
☆18Apr 17, 2026Updated 3 months ago
anniesch / single-life-rl
View on GitHub
Single-Life Reinforcement Learning
☆14Dec 17, 2022Updated 3 years ago
arjunbhorkar / ReViND
View on GitHub
☆28Dec 16, 2022Updated 3 years ago
NOHYC / autonomous_driving_car_project
View on GitHub
☆12Dec 5, 2021Updated 4 years ago
Mehooz / BIRD_code
View on GitHub
Code for paper "Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning".
☆14May 23, 2021Updated 5 years ago
shidilrzf / Anti-exploration-RL
View on GitHub
Anti exploration in offline reinforcement learning
☆11May 17, 2021Updated 5 years ago
tinalulu1327 / Cat_Recognition
View on GitHub
Cat Detection and Breed Recognition
☆17Oct 27, 2018Updated 7 years ago
jyqhahah / rl_maddpg_matd3
View on GitHub
☆14May 26, 2021Updated 5 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
EPFL-VILAB / palmer
View on GitHub
PALMER: Perception-Action Loop with Memory for Long-Horizon Planning, NeurIPS 2022
☆15Dec 12, 2022Updated 3 years ago
anair13 / bullet-manipulation-affordances
View on GitHub
☆13Jun 3, 2022Updated 4 years ago
wataruhashimoto52 / svgd_tf
View on GitHub
Implementation of Stein Variational Gradient Descent with TensorFlow 2.0
☆12Sep 11, 2019Updated 6 years ago
ikostrikov / implicit_q_learning
View on GitHub
☆332Jan 23, 2022Updated 4 years ago
minerllabs / basalt-benchmark
View on GitHub
BASALT Benchmark datasets, evaluation code and agent training example.
☆22Nov 29, 2023Updated 2 years ago
gustmd0121 / Time_is_not_Enough
View on GitHub
Official source code for Time is Not Enough: Time-Frequency based Explanation for Time-Series Black-Box Models
☆13Dec 5, 2024Updated last year
outsider86 / Residual-MPPI
View on GitHub
The official implementation of Residual-MPPI
☆19Mar 22, 2025Updated last year