eloialonso / diamond
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
☆1,777Updated 4 months ago
Alternatives and similar repositories for diamond:
Users that are interested in diamond are comparing it to the libraries listed below
- Inference script for Oasis 500M☆1,790Updated 5 months ago
- Official repository for our work on micro-budget training of large-scale diffusion models.☆1,370Updated 2 months ago
- The best OSS video generation models☆3,070Updated 3 months ago
- ☆2,882Updated 3 weeks ago
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆3,900Updated this week
- Witness the aha moment of VLM with less than $3.☆3,500Updated last month
- [CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents☆1,559Updated this week
- FastVideo is a lightweight framework for accelerating large video diffusion models.☆1,317Updated this week
- A suite of image and video neural tokenizers☆1,596Updated last month
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling☆2,880Updated 3 months ago
- Official PyTorch implementation for "Large Language Diffusion Models"☆1,424Updated this week
- [ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior☆2,895Updated 7 months ago
- Stable Virtual Camera: Generative View Synthesis with Diffusion Models☆1,146Updated last week
- Distributed Training Over-The-Internet☆895Updated 4 months ago
- High-resolution models for human tasks.☆4,926Updated 4 months ago
- CVPR2025☆814Updated 2 weeks ago
- Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.☆1,973Updated 8 months ago
- A general fine-tuning kit geared toward diffusion models.☆2,185Updated this week
- Simple and readable code for training and sampling from diffusion models☆468Updated 3 months ago
- ☆1,112Updated 3 months ago
- ☆954Updated 5 months ago
- [ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video☆814Updated last week
- DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion☆1,223Updated 4 months ago
- The first behavioral foundation model to control a virtual physics-based humanoid agent for a wide range of whole-body tasks.☆566Updated last month
- SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images☆758Updated last month
- 4M: Massively Multimodal Masked Modeling☆1,709Updated last month
- Code for BLT research paper☆1,442Updated last week
- SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement☆1,423Updated 2 months ago
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo☆1,281Updated last week
- ☆3,271Updated last month