apple / ml-starflowLinks
☆510Updated last week
Alternatives and similar repositories for ml-starflow
Users that are interested in ml-starflow are comparing it to the libraries listed below
Sorting:
- Official GitHub repository for FLUX.1 Krea [dev].☆360Updated 6 months ago
- Official PyTorch implementation of TokenSet.☆127Updated 10 months ago
- F Lite is a 10B parameter diffusion model created by Freepik and Fal, trained exclusively on copyright-safe and SFW content.☆427Updated 5 months ago
- Krea Realtime 14B. An open-source realtime AI video model.☆477Updated 2 months ago
- Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.☆1,507Updated last month
- Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition☆688Updated 2 months ago
- NVIDIA FastGen: Fast Generation from Diffusion Models☆508Updated last week
- ☆1,773Updated last month
- [arXiv] On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices☆131Updated 2 months ago
- ☆175Updated 3 months ago
- ☆317Updated 2 weeks ago
- 🔥 Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"☆165Updated 6 months ago
- ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation☆114Updated last month
- FlexTok: Resampling Images into 1D Token Sequences of Flexible Length☆290Updated 8 months ago
- Code release for "LLMs can see and hear without any training"☆457Updated 9 months ago
- VIGA: Vision-as-Inverse-Graphics Agent☆720Updated last week
- OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.☆631Updated 3 months ago
- ☆278Updated last month
- The official GitHub Page for MiniMax☆62Updated 3 months ago
- A Unified Visual Generator with Interleaved OmniModal Context☆171Updated last month
- ☆109Updated last week
- Official repository for "VideoPrism: A Foundational Visual Encoder for Video Understanding" (ICML 2024)☆348Updated 3 weeks ago
- The official code of Yume☆607Updated 3 weeks ago
- [NeurIPS 2025] Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance☆557Updated last month
- Animate Any Character in Any World☆89Updated last month
- FIBO is a SOTA, first open-source, JSON-native text-to-image model built for controllable, predictable, and legally safe image generation…☆302Updated last month
- Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B☆569Updated 2 months ago
- Official PyTorch Implementation for Dual-Process Image Generation, ICCV 2025☆122Updated 5 months ago
- Inference-time scaling of diffusion-based image and video generation models.☆172Updated last month
- [ICCV 2025] Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning☆214Updated 3 months ago