apple / pico-banana-400kLinks
☆1,719Updated last month
Alternatives and similar repositories for pico-banana-400k
Users that are interested in pico-banana-400k are comparing it to the libraries listed below
Sorting:
- Official GitHub repository for FLUX.1 Krea [dev].☆358Updated 4 months ago
- F Lite is a 10B parameter diffusion model created by Freepik and Fal, trained exclusively on copyright-safe and SFW content.☆418Updated 3 months ago
- ☆400Updated 2 weeks ago
- Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.☆1,416Updated last month
- ☆1,399Updated 3 weeks ago
- Official inference repo for FLUX.2 models☆1,170Updated last week
- NitroFusion: High-Fidelity Single-Step Diffusion through Dynamic Adversarial Training☆290Updated 6 months ago
- LL3M writes Python code that generates 3D assets in Blender.☆497Updated last month
- Code release for "LLMs can see and hear without any training"☆454Updated 7 months ago
- HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation☆661Updated last month
- 🌍 WorldGen - Generate Any 3D Scene in Seconds☆912Updated last month
- Cog inference for flux models☆367Updated 4 months ago
- Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition☆646Updated 2 weeks ago
- 🔥🔥 Open-sourced unified customization model☆1,194Updated 3 months ago
- DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation☆776Updated 5 months ago
- A reimplementation of Stable Diffusion 3.5 in pure PyTorch☆687Updated 5 months ago
- Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"☆243Updated last week
- ☆367Updated last month
- A character-level language diffusion model trained on Tiny Shakespeare☆594Updated 3 weeks ago
- Train high-quality text-to-image diffusion models in a data & compute efficient manner☆511Updated 8 months ago
- ☆1,229Updated 3 weeks ago
- Qwen Image models through MPS☆243Updated 3 weeks ago
- Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.☆762Updated 3 months ago
- Qwen-Image-Lightning: Speed up Qwen-Image model with distillation☆1,031Updated last week
- Pusa: Thousands Timesteps Video Diffusion Model☆666Updated 3 months ago
- MotionStream: Real-Time Video Generation with Interactive Motion Controls☆423Updated last month
- Native Multimodal Models are World Learners☆1,342Updated 2 weeks ago
- CVPR2025☆905Updated 6 months ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆927Updated 6 months ago
- Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model☆2,503Updated last week