ertugrul-dmr / qwen2vl-captioner-guiLinks
☆17Updated 9 months ago
Alternatives and similar repositories for qwen2vl-captioner-gui
Users that are interested in qwen2vl-captioner-gui are comparing it to the libraries listed below
Sorting:
- Scripts for use with LongCLIP, including fine-tuning Long-CLIP☆61Updated 4 months ago
- ☆42Updated last year
- This Flux latent upscaler workflow creates a lower-resolution initial pass, then advances to a second pass that upscales in latent space …☆114Updated 10 months ago
- Set of auxiliary tools to use with image and video generation libaries. Mainly created to be used with diffusers☆59Updated this week
- ComfyUI Node for FlashFace☆67Updated 4 months ago
- Gradio UI for training video models using finetrainers☆30Updated 3 months ago
- A video clipper for Hunyuan video training.☆83Updated 3 months ago
- NNT Neural Network Toolkit Custom Nodes for ComfyUI☆69Updated 6 months ago
- Adaptive ODE Solvers for ComfyUI☆51Updated 11 months ago
- ☆99Updated 2 months ago
- [inactive] MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation☆13Updated last year
- ☆125Updated 4 months ago
- ☆129Updated 2 weeks ago
- Various training scripts used to train bigasp☆90Updated last month
- Set of Utilities I Have Coded to Help Me Train RPGv6 on Flux1☆81Updated 10 months ago
- Vision Transformers Needs Registers. And Gated MLPs. And +20M params. Tiny modality gap ensues!☆47Updated last month
- This was orginally written by: https://github.com/hlky☆49Updated last year
- See original repo here: https://github.com/google/RB-Modulation - ICLR 2025 (Oral)☆125Updated 10 months ago
- Extended Musubi Tuner with latent previews, fp16 accumulation, advanced cfg scheduling and more☆23Updated 2 weeks ago
- Greatly increase the diversity of your generated images in Automatic1111 WebUI through Condition-Annealed Sampling.☆107Updated last year
- MoD Control Tile Upscaler for SDXL Pipeline☆59Updated 4 months ago
- Official implementation of "Normalized Attention Guidance"☆127Updated 2 weeks ago
- Batched Runge-Kutta Samplers for ComfyUI☆60Updated 11 months ago
- ☆57Updated 2 months ago
- ☆118Updated last year
- Janky implementation of DiffuseHigh for ComfyUI☆36Updated 2 months ago
- A ComfyUI implementation of Meta AI's AITemplate repo for faster inference using cpp/cuda.☆52Updated last year
- ☆73Updated 9 months ago
- ☆28Updated last year
- An image viewer and AI-assisted editing/captioning/masking tool that helps with curating datasets for generative AI models, finetunes and…☆130Updated this week