eloialonso / diamondLinks
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
☆1,943Updated last year
Alternatives and similar repositories for diamond
Users that are interested in diamond are comparing it to the libraries listed below
Sorting:
- Inference script for Oasis 500M☆2,009Updated last year
- A suite of image and video neural tokenizers☆1,697Updated 11 months ago
- Official repository for our work on micro-budget training of large-scale diffusion models.☆1,547Updated 11 months ago
- A minimal implementation of DeepMind's Genie world model☆1,088Updated last month
- MineWorld: A Real-time interactive world model on Minecraft☆434Updated 5 months ago
- Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model☆2,593Updated 3 weeks ago
- PyTorch code and models for VJEPA2 self-supervised learning from video.☆2,686Updated 4 months ago
- Mastering Diverse Domains through World Models☆2,630Updated 3 months ago
- Official Repository for "DrEureka: Language Model Guided Sim-To-Real Transfer" (RSS 2024)☆911Updated last year
- code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"☆1,129Updated 2 months ago
- A unified inference and post-training framework for accelerated video generation.☆2,898Updated last week
- PyTorch code and models for V-JEPA self-supervised learning from video.☆3,427Updated 10 months ago
- CVPR2025☆908Updated 7 months ago
- The official implementation of CVPR'25 Oral paper "Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped No…☆1,061Updated 2 months ago
- The first behavioral foundation model to control a virtual physics-based humanoid agent for a wide range of whole-body tasks.☆705Updated 7 months ago
- Stable Virtual Camera: Generative View Synthesis with Diffusion Models☆1,539Updated 7 months ago
- ☆1,164Updated last year
- [SIGGRAPH Asia 2023 (Technical Communications)] EasyVolcap: Accelerating Neural Volumetric Video Research☆1,519Updated 11 months ago
- [ICCV'25]DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion☆1,325Updated 2 months ago
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,918Updated last year
- Automating the Search for Artificial Life with Foundation Models!☆448Updated 2 months ago
- The best OSS video generation models, created by Genmo☆3,562Updated last month
- [ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video☆1,691Updated last month
- Unifying 3D Mesh Generation with Language Models☆1,133Updated 9 months ago
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆4,884Updated 3 weeks ago
- 4M: Massively Multimodal Masked Modeling☆1,780Updated 7 months ago
- Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.☆1,476Updated 3 weeks ago
- MuJoCo fruit fly body model and locomotion RL tasks☆481Updated 5 months ago
- Interactive visualizations of the geometric intuition behind diffusion models.☆923Updated last week
- A general fine-tuning kit geared toward image/video/audio diffusion models.☆2,705Updated this week