eloialonso / diamondLinks
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
☆1,826Updated 6 months ago
Alternatives and similar repositories for diamond
Users that are interested in diamond are comparing it to the libraries listed below
Sorting:
- Inference script for Oasis 500M☆1,852Updated 7 months ago
- PyTorch code and models for VJEPA2 self-supervised learning from video.☆1,522Updated last week
- Continuous Thought Machines, because thought takes time and reasoning is a process.☆1,070Updated 3 weeks ago
- A suite of image and video neural tokenizers☆1,636Updated 4 months ago
- [SIGGRAPH Asia 2023 (Technical Communications)] EasyVolcap: Accelerating Neural Volumetric Video Research☆1,372Updated 5 months ago
- code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"☆898Updated 2 months ago
- The best OSS video generation models☆3,231Updated 5 months ago
- The first behavioral foundation model to control a virtual physics-based humanoid agent for a wide range of whole-body tasks.☆616Updated 2 weeks ago
- Stable Virtual Camera: Generative View Synthesis with Diffusion Models☆1,327Updated 3 weeks ago
- Unifying 3D Mesh Generation with Language Models☆1,058Updated 2 months ago
- 4M: Massively Multimodal Masked Modeling☆1,740Updated 3 weeks ago
- A general fine-tuning kit geared toward diffusion models.☆2,386Updated last week
- The official implementation of CVPR'25 Oral paper "Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped No…☆961Updated 2 weeks ago
- ☆295Updated last month
- Roblox Foundation Model for 3D Intelligence☆738Updated last month
- Official repository for our work on micro-budget training of large-scale diffusion models.☆1,481Updated 5 months ago
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆4,300Updated 3 weeks ago
- [CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering☆2,834Updated 8 months ago
- Official Repository for "DrEureka: Language Model Guided Sim-To-Real Transfer" (RSS 2024)☆896Updated 10 months ago
- [ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.☆1,913Updated 10 months ago
- MuJoCo fruit fly body model and locomotion RL tasks☆439Updated last month
- Pretraining code for a large-scale depth-recurrent language model☆783Updated 2 weeks ago
- Official code for the paper "LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes".☆1,455Updated last year
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,113Updated 7 months ago
- [CVPR 2025 Highlight] GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control☆754Updated this week
- PyTorch code and models for V-JEPA self-supervised learning from video.☆3,102Updated 4 months ago
- Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.☆1,918Updated last year
- DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion☆1,252Updated 6 months ago
- SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images☆835Updated last month
- VideoSys: An easy and efficient system for video generation☆1,980Updated 3 months ago