apple / ml-starflowLinks
☆474Updated 3 weeks ago
Alternatives and similar repositories for ml-starflow
Users that are interested in ml-starflow are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation of TokenSet.☆127Updated 9 months ago
- F Lite is a 10B parameter diffusion model created by Freepik and Fal, trained exclusively on copyright-safe and SFW content.☆420Updated 4 months ago
- Official GitHub repository for FLUX.1 Krea [dev].☆357Updated 4 months ago
- Krea Realtime 14B. An open-source realtime AI video model.☆423Updated last month
- Code release for "LLMs can see and hear without any training"☆454Updated 7 months ago
- Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.☆1,446Updated last week
- Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B☆542Updated last month
- ☆156Updated this week
- faster parallel inference of mochi-1 video generation model☆126Updated 10 months ago
- OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.☆607Updated last month
- [arXiv] On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices☆128Updated 3 weeks ago
- ☆166Updated 2 weeks ago
- ☆1,734Updated last week
- ☆105Updated 2 weeks ago
- 🔥 Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"☆161Updated 5 months ago
- ☆345Updated 4 months ago
- NitroFusion: High-Fidelity Single-Step Diffusion through Dynamic Adversarial Training☆290Updated 6 months ago
- Qwen-Image-Layered: Layered Decomposition for Inherent Editablity☆352Updated this week
- Official repository for "VideoPrism: A Foundational Visual Encoder for Video Understanding" (ICML 2024)☆333Updated 2 months ago
- Large multi-modal models (L3M) pre-training.☆223Updated 3 months ago
- CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning☆277Updated last month
- The official GitHub Page for MiniMax☆60Updated last month
- Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition☆662Updated 3 weeks ago
- Code for Bolmo: Byteifying the Next Generation of Language Models☆66Updated last week
- ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation☆99Updated 2 weeks ago
- Inference-time scaling of diffusion-based image and video generation models.☆172Updated last week
- High-throughput tensor loading for PyTorch☆213Updated 3 weeks ago
- ☆78Updated 7 months ago
- GRadient-INformed MoE☆265Updated last year
- A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using …☆178Updated last week