Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).
☆272Aug 21, 2024Updated last year
Alternatives and similar repositories for open-genie
Users that are interested in open-genie are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- JAX reimplementation of the DeepMind paper "Genie: Generative Interactive Environments"☆105Jan 23, 2025Updated last year
- A framework for training world models with virtual environments, complete with annotated environment dataset (RetroAct), exploration agen…☆72Oct 25, 2025Updated 6 months ago
- Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)☆140Jul 31, 2024Updated last year
- Official repository of Learning to Act from Actionless Videos through Dense Correspondences.☆254Apr 25, 2024Updated 2 years ago
- [ICLR 2025] LAPA: Latent Action Pretraining from Videos☆517Jan 22, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- world modeling challenge for humanoid robots☆556Nov 8, 2024Updated last year
- ☆150Jul 8, 2025Updated 9 months ago
- code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"☆1,230Nov 9, 2025Updated 5 months ago
- Implementation of Retention-Network in PyTorch☆17Aug 12, 2023Updated 2 years ago
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆178Sep 23, 2025Updated 7 months ago
- [ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"☆659Jul 1, 2025Updated 10 months ago
- Official code for "Behavior Generation with Latent Actions" (ICML 2024 Spotlight)☆205Feb 28, 2024Updated 2 years ago
- [ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos☆173Oct 1, 2025Updated 7 months ago
- [NeurIPS 2024] CLOVER: Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation☆133Sep 8, 2025Updated 7 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- [RSS 2024] Learning Manipulation by Predicting Interaction☆120Jul 2, 2025Updated 10 months ago
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆310Apr 22, 2024Updated 2 years ago
- ElasticTok: Adaptive Tokenization for Image and Video☆91Nov 4, 2024Updated last year
- Official implementation of GR-MG☆91Jan 12, 2025Updated last year
- The official codebase for running the experiments described in the AVDC paper.☆20Oct 2, 2024Updated last year
- ☆96Sep 4, 2024Updated last year
- ☆19Jun 26, 2024Updated last year
- Heterogeneous Pre-trained Transformer (HPT) as Scalable Policy Learner.☆534Dec 6, 2024Updated last year
- Implementation of MagViT2 Tokenizer in Pytorch☆660Jan 12, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official PyTorch Implementation of Unified Video Action Model (RSS 2025)☆370Jul 23, 2025Updated 9 months ago
- [TMLR 2024] Official PyTorch Implementation of Deep Dynamic Latent Particles☆17Mar 25, 2026Updated last month
- Modular PyTorch (Lightning) implementation of Diffusion Probabilistic Models☆22Mar 26, 2023Updated 3 years ago
- Pandora: Towards General World Model with Natural Language Actions and Video States☆536Sep 23, 2024Updated last year
- The official implementation of flow Q-learning (FQL)☆304Jul 21, 2025Updated 9 months ago
- SEED-Voken: A Series of Powerful Visual Tokenizers☆1,003Nov 25, 2025Updated 5 months ago
- Cosmos-Predict1 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world m…☆442Jan 6, 2026Updated 3 months ago
- DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control☆117Oct 27, 2024Updated last year
- A Video Tokenizer Evaluation Dataset☆156Jan 13, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆329May 22, 2025Updated 11 months ago
- Official Code Repo for GENIMA☆77Oct 29, 2025Updated 6 months ago
- OpenVLA: An open-source vision-language-action model for robotic manipulation.☆6,050Mar 23, 2025Updated last year
- ☆467Apr 14, 2026Updated 3 weeks ago
- Official Repository of "Transcrib3D: 3D Referring Expression Resolution through Large Language Models" accepted at IROS 2024☆12Mar 30, 2026Updated last month
- Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence☆1,450Jan 31, 2025Updated last year
- Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.☆1,641Jul 31, 2024Updated last year