apple / ml-starflowLinks
☆497Updated last month
Alternatives and similar repositories for ml-starflow
Users that are interested in ml-starflow are comparing it to the libraries listed below
Sorting:
- Official GitHub repository for FLUX.1 Krea [dev].☆359Updated 5 months ago
- Official PyTorch implementation of TokenSet.☆127Updated 9 months ago
- F Lite is a 10B parameter diffusion model created by Freepik and Fal, trained exclusively on copyright-safe and SFW content.☆422Updated 4 months ago
- Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.☆1,476Updated last month
- ☆1,754Updated last month
- Krea Realtime 14B. An open-source realtime AI video model.☆449Updated 2 months ago
- Code release for "LLMs can see and hear without any training"☆458Updated 8 months ago
- ☆108Updated this week
- 🔥 Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"☆165Updated 6 months ago
- Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition☆673Updated last month
- Official repository for "VideoPrism: A Foundational Visual Encoder for Video Understanding" (ICML 2024)☆344Updated last week
- ☆171Updated 2 months ago
- Official Implementation of "MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation"☆285Updated last month
- OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.☆621Updated 2 months ago
- HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation☆669Updated 3 months ago
- The official code of Yume☆578Updated this week
- [arXiv] On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices☆130Updated last month
- NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s Multimodal Intellige…☆594Updated 3 weeks ago
- Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B☆560Updated last month
- [NeurIPS 2025] Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance☆519Updated last week
- ☆304Updated this week
- Cosmos-Transfer1-DiffusionRenderer: High-quality video de-lighting and re-lighting based on Cosmos video diffusion framework☆765Updated 3 months ago
- DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models☆150Updated 2 weeks ago
- faster parallel inference of mochi-1 video generation model☆126Updated 10 months ago
- ☆265Updated last week
- HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency☆952Updated this week
- FIBO is a SOTA, first open-source, JSON-native text-to-image model built for controllable, predictable, and legally safe image generation…☆295Updated last week
- The official GitHub Page for MiniMax☆60Updated 2 months ago
- ☆1,553Updated 2 months ago
- A minimal implementation of DeepMind's Genie world model☆1,097Updated last month