KunpengSong / MoMA-inactiveLinks
[inactive] MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation
☆13Updated last year
Alternatives and similar repositories for MoMA-inactive
Users that are interested in MoMA-inactive are comparing it to the libraries listed below
Sorting:
- ☆127Updated 9 months ago
- See original repo here: https://github.com/google/RB-Modulation - ICLR 2025 (Oral)☆125Updated 10 months ago
- Diffusers Implementation of Controlling Text-to-Image Diffusion by Orthogonal Finetuning☆35Updated last year
- ☆42Updated last year
- Official implementation of "Single Image Iterative Subject-driven Generation and Editing".☆96Updated last month
- Explore how Flux Dev responds when you change the strengths of layers in the model.☆20Updated 9 months ago
- Official repo for DiffArtist (ACM MM 2025)☆121Updated last week
- Unofficial implementation. Stable diffusion model trained by AI Feedback-Based Self-Training Direct Preference Optimization.☆64Updated last year
- ☆91Updated last year
- Various training scripts used to train bigasp☆89Updated last month
- IP Adapter Instruct☆206Updated 11 months ago
- SliderSpace: Decomposing the Visual Capabilities of Diffusion Models☆97Updated 4 months ago
- ☆44Updated 6 months ago
- ControlAnimate Library☆48Updated last year
- ☆125Updated 4 months ago
- ☆127Updated 2 weeks ago
- Official implementation of "Normalized Attention Guidance"☆127Updated 2 weeks ago
- MoD Control Tile Upscaler for SDXL Pipeline☆59Updated 4 months ago
- Set of Utilities I Have Coded to Help Me Train RPGv6 on Flux1☆81Updated 10 months ago
- ☆247Updated last year
- CogVideoX-LoRAs is a centralized repository for all LoRA models created for CogVideoX, filling the gap for a unified sharing space. With …☆81Updated 7 months ago
- ☆99Updated 2 months ago
- 🔬 Visualize attention layers from Stable Diffusion☆85Updated 3 months ago
- A gradio based image captioning tool that uses the GPT-4-Vision API to generate detailed descriptions of images.☆60Updated 7 months ago
- MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation☆232Updated last year
- Vision Transformers Needs Registers. And Gated MLPs. And +20M params. Tiny modality gap ensues!☆47Updated last month
- Gradio UI for training video models using finetrainers☆30Updated 2 months ago
- Scripts for use with LongCLIP, including fine-tuning Long-CLIP☆61Updated 4 months ago
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆86Updated last year
- Official implement of ID-Aligner☆121Updated last year