eloialonso / diamondLinks
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
☆1,952Updated last year
Alternatives and similar repositories for diamond
Users that are interested in diamond are comparing it to the libraries listed below
Sorting:
- Inference script for Oasis 500M☆2,033Updated last year
- A suite of image and video neural tokenizers☆1,702Updated 11 months ago
- Official repository for our work on micro-budget training of large-scale diffusion models.☆1,548Updated last year
- PyTorch code and models for VJEPA2 self-supervised learning from video.☆2,853Updated 5 months ago
- A minimal implementation of DeepMind's Genie world model☆1,118Updated 2 months ago
- code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"☆1,144Updated 2 months ago
- MineWorld: A Real-time interactive world model on Minecraft☆437Updated 5 months ago
- PyTorch code and models for V-JEPA self-supervised learning from video.☆3,485Updated 11 months ago
- The official implementation of CVPR'25 Oral paper "Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped No…☆1,063Updated 3 months ago
- A unified inference and post-training framework for accelerated video generation.☆3,028Updated this week
- Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.☆2,081Updated last year
- [ICCV'25]DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion☆1,324Updated 3 months ago
- Stable Virtual Camera: Generative View Synthesis with Diffusion Models☆1,549Updated 7 months ago
- Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model☆2,643Updated last month
- ☆1,170Updated last year
- ☆284Updated 4 months ago
- The first behavioral foundation model to control a virtual physics-based humanoid agent for a wide range of whole-body tasks.☆727Updated 7 months ago
- 4M: Massively Multimodal Masked Modeling☆1,789Updated 7 months ago
- Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model☆1,822Updated 3 months ago
- Unifying 3D Mesh Generation with Language Models☆1,136Updated 10 months ago
- Mastering Diverse Domains through World Models☆2,730Updated 4 months ago
- Scalable and memory-optimized training of diffusion models☆1,327Updated 7 months ago
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation