apple / pico-banana-400kLinks
☆1,632Updated 3 weeks ago
Alternatives and similar repositories for pico-banana-400k
Users that are interested in pico-banana-400k are comparing it to the libraries listed below
Sorting:
- Official GitHub repository for FLUX.1 Krea [dev].☆355Updated 3 months ago
- F Lite is a 10B parameter diffusion model created by Freepik and Fal, trained exclusively on copyright-safe and SFW content.☆416Updated 2 months ago
- ☆1,300Updated this week
- Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.☆1,344Updated 3 weeks ago
- NitroFusion: High-Fidelity Single-Step Diffusion through Dynamic Adversarial Training☆289Updated 5 months ago
- LL3M writes Python code that generates 3D assets in Blender.☆488Updated last month
- Cog inference for flux models☆367Updated 3 months ago
- Code release for "LLMs can see and hear without any training"☆452Updated 6 months ago
- HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation☆656Updated last month
- Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition☆616Updated last month
- ☆357Updated 2 weeks ago
- 🔥🔥 Open-sourced unified customization model☆1,181Updated 2 months ago
- Cog wrapper for ostris/ai-toolkit + post-finetuning cog inference for flux models☆428Updated 5 months ago
- Qwen Image models through MPS☆231Updated this week
- ComfyDeployed☆426Updated 2 months ago
- A reimplementation of Stable Diffusion 3.5 in pure PyTorch☆684Updated 5 months ago
- 🌍 WorldGen - Generate Any 3D Scene in Seconds☆798Updated last week
- Qwen-Image-Lightning: Speed up Qwen-Image model with distillation☆972Updated last month
- Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, im…☆2,921Updated last month
- [AAAI-2026]FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation☆450Updated 8 months ago
- [ICCV 2025] Official pytorch implementation of "FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors"☆401Updated 8 months ago
- CVPR2025☆903Updated 6 months ago
- Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.☆725Updated 2 months ago
- [NeurIPS'25 Spotlight] Official repository for "Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment…☆739Updated last month
- A unified inference and post-training framework for accelerated video generation.☆2,601Updated this week
- Run larger LLMs with longer contexts on Apple Silicon by using differentiated precision for KV cache quantization. KVSplit enables 8-bit …☆360Updated 5 months ago
- Pusa: Thousands Timesteps Video Diffusion Model☆661Updated 2 months ago
- [Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset☆489Updated 3 weeks ago
- Train high-quality text-to-image diffusion models in a data & compute efficient manner☆509Updated 7 months ago
- Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model☆2,434Updated 3 weeks ago