eloialonso / diamondLinks
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
☆1,929Updated last year
Alternatives and similar repositories for diamond
Users that are interested in diamond are comparing it to the libraries listed below
Sorting:
- Inference script for Oasis 500M☆1,998Updated last year
- Official repository for our work on micro-budget training of large-scale diffusion models.☆1,542Updated 11 months ago
- A minimal implementation of DeepMind's Genie world model☆1,062Updated last month
- Continuous Thought Machines, because thought takes time and reasoning is a process.☆1,640Updated this week
- A suite of image and video neural tokenizers☆1,692Updated 10 months ago
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆4,831Updated this week
- The official implementation of CVPR'25 Oral paper "Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped No…☆1,055Updated 2 months ago
- code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"☆1,105Updated last month
- Stable Virtual Camera: Generative View Synthesis with Diffusion Models☆1,520Updated 6 months ago
- The best OSS video generation models, created by Genmo☆3,537Updated last month
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,910Updated last year
- MineWorld: A Real-time interactive world model on Minecraft☆423Updated 4 months ago
- Mastering Diverse Domains through World Models☆2,544Updated 2 months ago
- Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model☆2,524Updated 2 weeks ago
- [SIGGRAPH Asia 2023 (Technical Communications)] EasyVolcap: Accelerating Neural Volumetric Video Research☆1,509Updated 11 months ago
- ☆1,152Updated last year
- ☆273Updated 3 months ago
- Unifying 3D Mesh Generation with Language Models☆1,131Updated 8 months ago
- The first behavioral foundation model to control a virtual physics-based humanoid agent for a wide range of whole-body tasks.☆699Updated 6 months ago
- PyTorch code and models for VJEPA2 self-supervised learning from video.☆2,556Updated 3 months ago
- CVPR2025☆904Updated 7 months ago
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling☆3,141Updated last year
- Scalable and memory-optimized training of diffusion models☆1,312Updated 6 months ago
- [ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video☆1,663Updated 3 weeks ago
- [ICCV'25]DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion☆1,318Updated 2 months ago
- PyTorch code and models for V-JEPA self-supervised learning from video.☆3,324Updated 9 months ago
- ☆316Updated 7 months ago
- ☆1,403Updated 11 months ago
- Simple and readable code for training and sampling from diffusion models☆671Updated 6 months ago
- Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.☆2,068Updated last year