apple / pico-banana-400kLinks
☆1,754Updated 3 weeks ago
Alternatives and similar repositories for pico-banana-400k
Users that are interested in pico-banana-400k are comparing it to the libraries listed below
Sorting:
- Official GitHub repository for FLUX.1 Krea [dev].☆358Updated 5 months ago
- Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.☆1,468Updated 3 weeks ago
- ☆494Updated last month
- ☆1,542Updated last month
- F Lite is a 10B parameter diffusion model created by Freepik and Fal, trained exclusively on copyright-safe and SFW content.☆421Updated 4 months ago
- Qwen-Image-Layered: Layered Decomposition for Inherent Editablity☆1,375Updated last week
- NitroFusion: High-Fidelity Single-Step Diffusion through Dynamic Adversarial Training☆290Updated 7 months ago
- Official inference repo for FLUX.2 models☆1,319Updated last month
- LL3M writes Python code that generates 3D assets in Blender.☆501Updated 2 months ago
- Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition☆672Updated last month
- Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.☆863Updated 4 months ago
- Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.☆937Updated this week
- HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation☆665Updated 2 months ago
- Code release for "LLMs can see and hear without any training"☆456Updated 8 months ago
- 🔥🔥 Open-sourced unified customization model☆1,200Updated 3 months ago
- 🌍 WorldGen - Generate Any 3D Scene in Seconds☆946Updated 2 months ago
- ☆1,257Updated last month
- Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, im…☆3,206Updated 3 months ago
- Cog inference for flux models☆367Updated 5 months ago
- Train high-quality text-to-image diffusion models in a data & compute efficient manner☆516Updated 9 months ago
- MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.☆3,028Updated 6 months ago
- DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation☆781Updated 6 months ago
- ☆377Updated 2 months ago
- Native Multimodal Models are World Learners☆1,394Updated last week
- A character-level language diffusion model trained on Tiny Shakespeare☆824Updated last week
- Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model☆2,593Updated 3 weeks ago
- Qwen-Image-Lightning: Speed up Qwen-Image model with distillation☆1,138Updated last week
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆934Updated 7 months ago
- The official repo for "Vidi: Large Multimodal Models for Video Understanding and Editing"☆549Updated last month
- HY-Motion model for 3D character animation generation.☆1,582Updated last week