apple / ml-gbcLinks
☆101Updated 5 months ago
Alternatives and similar repositories for ml-gbc
Users that are interested in ml-gbc are comparing it to the libraries listed below
Sorting:
- Model code for inferencing T5☆65Updated 3 months ago
- A comprehensive codebase for training and finetuning Image <> Latent models.☆37Updated 3 months ago
- A minimalistic, hackable code base to finetune Wan video generation model☆40Updated 2 months ago
- Official PyTorch implementation of TokenSet.☆121Updated 3 months ago
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆113Updated 3 months ago
- ☆45Updated 7 months ago
- UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…☆107Updated 2 months ago
- Official Implementation of weights2weights☆143Updated 3 months ago
- Pytorch implementation of MIMO, Controllable Character Video Synthesis with Spatial Decomposed Modeling, from Alibaba Intelligence Group☆133Updated 8 months ago
- DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging☆45Updated 2 months ago
- Scripts to teach Flux the task of image editing from language with the Flux Control framework.☆87Updated 3 months ago
- Collection of scripts to build small-scale datasets for fine-tuning video generation models.☆62Updated 3 months ago
- Official implementation of DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized Image Generation☆118Updated 5 months ago
- ☆33Updated 7 months ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated 7 months ago
- ☆70Updated 8 months ago
- Pusa: Thousands Timesteps Video Diffusion Model☆199Updated last week
- finetune your florence2 model easy☆20Updated 11 months ago
- ☆72Updated last month
- 🔥 Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"☆118Updated this week
- faster parallel inference of mochi-1 video generation model☆121Updated 4 months ago
- A unified media (Image, Video, Audio, Text) diffusion repository, for education and learning.☆21Updated 2 months ago
- Let's try and finetune the OpenAI consistency decoder to work for SDXL☆24Updated last year
- ☆70Updated 7 months ago
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆86Updated last year
- TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder☆57Updated 5 months ago
- Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified …☆69Updated 7 months ago
- ☆102Updated this week
- Inference-time scaling of diffusion-based image and video generation models.☆151Updated 3 months ago
- ☆84Updated 10 months ago