DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
☆1,979Dec 6, 2024Updated last year
Alternatives and similar repositories for diamond
Users that are interested in diamond are comparing it to the libraries listed below
Sorting:
- Inference script for Oasis 500M☆2,055Nov 8, 2024Updated last year
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆119Sep 22, 2024Updated last year
- code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"☆1,169Nov 9, 2025Updated 3 months ago
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling☆3,161Dec 21, 2024Updated last year
- A suite of image and video neural tokenizers☆1,711Feb 11, 2025Updated last year
- [ICCV 2025] GameFactory: Creating New Games with Generative Interactive Videos☆472Mar 22, 2025Updated 11 months ago
- Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.☆869Oct 14, 2024Updated last year
- world modeling challenge for humanoid robots☆554Nov 8, 2024Updated last year
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆4,969Feb 24, 2026Updated last week
- The best OSS video generation models, created by Genmo☆3,611Nov 14, 2025Updated 3 months ago
- Mastering Diverse Domains through World Models☆2,854Sep 23, 2025Updated 5 months ago
- ☆323May 22, 2025Updated 9 months ago
- Next-Token Prediction is All You Need☆2,355Jan 12, 2026Updated last month
- New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos☆8,082Jan 6, 2026Updated last month
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆12,449Nov 4, 2025Updated 3 months ago
- A generative world for general-purpose robotics & embodied AI learning.☆28,186Updated this week
- [NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory☆338Feb 21, 2026Updated last week
- Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"☆759May 21, 2025Updated 9 months ago
- Pandora: Towards General World Model with Natural Language Actions and Video States☆532Sep 23, 2024Updated last year
- [ICCV'25]DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion☆1,328Oct 17, 2025Updated 4 months ago
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,252Feb 16, 2025Updated last year
- [ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think☆1,553Mar 16, 2025Updated 11 months ago
- Official repository for LTX-Video☆9,367Jan 5, 2026Updated last month
- PyTorch code and models for V-JEPA self-supervised learning from video.☆3,550Feb 27, 2025Updated last year
- 🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning☆21,780Updated this week
- Official inference repo for FLUX.1 models☆25,225Jul 31, 2025Updated 7 months ago
- Minimal implementation of scalable rectified flow transformers, based on SD3's approach☆634Jul 1, 2024Updated last year
- VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and clou…☆3,766Nov 28, 2025Updated 3 months ago
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆18,560Dec 25, 2024Updated last year
- Scalable and memory-optimized training of diffusion models☆1,338Jun 4, 2025Updated 8 months ago
- MineWorld: A Real-time interactive world model on Minecraft☆452Aug 6, 2025Updated 6 months ago
- Large World Model -- Modeling Text and Video with Millions Context☆7,399Oct 19, 2024Updated last year
- [NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Mod…☆8,626Nov 10, 2025Updated 3 months ago
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,936Aug 15, 2024Updated last year
- ☆126Nov 25, 2025Updated 3 months ago
- A unified inference and post-training framework for accelerated video generation.☆3,111Updated this week
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆8,382May 31, 2024Updated last year
- DUSt3R: Geometric 3D Vision Made Easy☆6,975Sep 24, 2025Updated 5 months ago
- [ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"☆630Jul 1, 2025Updated 8 months ago