etched-ai / open-oasis
Inference script for Oasis 500M
☆1,368Updated this week
Related projects ⓘ
Alternatives and complementary repositories for open-oasis
- DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 S…☆1,517Updated this week
- ☆1,958Updated this week
- Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.☆1,829Updated 3 months ago
- ☆1,969Updated this week
- From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformer…☆2,030Updated 3 months ago
- ☆739Updated last week
- Text-to-Music Generation with Rectified Flow Transformers☆1,592Updated 2 months ago
- SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement☆1,167Updated last month
- Distributed Training Over-The-Internet☆683Updated 2 months ago
- The best OSS video generation models☆1,899Updated this week
- Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple te…☆565Updated last week
- [ECCV 2024] Code for VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models☆424Updated 2 months ago
- Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation☆923Updated last week
- 4M: Massively Multimodal Masked Modeling☆1,603Updated last month
- code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"☆591Updated 3 weeks ago
- ☆1,965Updated 10 months ago
- Code of Pyramidal Flow Matching for Efficient Video Generative Modeling☆2,237Updated last week
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…☆2,538Updated last month
- Minimal implementation of scalable rectified flow transformers, based on SD3's approach☆443Updated 4 months ago
- DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos☆940Updated last week
- OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340☆2,420Updated this week
- Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"☆801Updated 2 months ago
- A suite of image and video neural tokenizers☆695Updated this week
- DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion☆592Updated this week
- [ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior☆2,036Updated 2 months ago
- Next-Token Prediction is All You Need☆1,801Updated 2 weeks ago
- InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models☆3,308Updated 4 months ago
- NanoGPT (124M) quality in 7.8 8xH100-minutes☆965Updated this week