jrobine / twm
Transformer-based World Models
☆66Updated last year
Related projects: ⓘ
- ☆43Updated 3 months ago
- Official code repository for Prompt-DT.☆93Updated 2 years ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆71Updated 9 months ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆87Updated 3 months ago
- Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.☆90Updated last year
- ☆69Updated 2 years ago
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆81Updated 10 months ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning☆68Updated last month
- Official repository for Paper "Offline Goal-Conditioned Reinforcement Learning via f-Advantage Regression" (NeurIPS 2022)☆34Updated 11 months ago
- Skeleton for scalable and flexible Jax RL implementations☆58Updated last year
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆71Updated 4 months ago
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…☆104Updated last year
- ☆46Updated last year
- [NeurIPS 2023] Efficient Diffusion Policy☆74Updated 10 months ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆49Updated 11 months ago
- DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …☆52Updated 3 months ago
- Pytorch version of Dreamer, which follows the original TF v2 codes.☆112Updated 2 years ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆65Updated 5 months ago
- Source files to replicate experiments in my ICLR 2022 paper.☆59Updated 2 months ago
- [ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and …☆31Updated 6 months ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆53Updated 3 months ago
- a simple and scalable agent for training adaptive policies with sequence-based RL☆79Updated this week
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆36Updated 7 months ago
- ☆20Updated 11 months ago
- Masked World Models for Visual Control☆114Updated last year
- Synthetic Experience Replay☆62Updated 3 months ago
- Repo for Implicit Diffusion Q-Learning☆85Updated 9 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆108Updated last year
- Extreme Q-Learning: Max Entropy RL without Entropy☆78Updated last year
- ☆71Updated last year