vmicheli / delta-iris
Efficient World Models with Context-Aware Tokenization. ICML 2024
☆73Updated 2 months ago
Related projects: ⓘ
- DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model.☆206Updated last week
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆41Updated 3 months ago
- ☆65Updated 2 months ago
- ☆39Updated 3 months ago
- Open source code for paper "Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning" ICML 2023☆42Updated 4 months ago
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code☆20Updated 2 weeks ago
- Galactic Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-Per-Second☆81Updated last year
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆84Updated last year
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆18Updated 2 months ago
- Efficient baselines for autocurricula in JAX.☆165Updated 3 weeks ago
- Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).☆49Updated 3 weeks ago
- Using advances in generative modeling to learn reward functions from unlabeled videos.☆106Updated 7 months ago
- MTM Masked Trajectory Models for Prediction, Representation, and Control.☆145Updated last year
- a simple and scalable agent for training adaptive policies with sequence-based RL☆79Updated this week
- OMNI: Open-endedness via Models of human Notions of Interestingness☆34Updated 9 months ago
- JAX reimplementation of the DeepMind paper "Genie: Generative Interactive Environments"☆27Updated last week
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆53Updated 5 months ago
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆19Updated 3 months ago
- Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024☆46Updated 3 months ago
- Intrinsic Motivation from Artificial Intelligence Feedback☆117Updated 10 months ago
- BASALT Benchmark datasets, evaluation code and agent training example.☆19Updated 9 months ago
- ☆24Updated 2 weeks ago
- Code for the paper "Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference"☆35Updated 2 months ago
- Minimal but scalable implementation of large language models in JAX☆17Updated 3 weeks ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆87Updated last month
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆81Updated 10 months ago
- ☆24Updated last month
- Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)☆68Updated last month
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆82Updated 9 months ago
- We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effe…☆17Updated 7 months ago