ertugrul-dmr / qwen2vl-captioner-guiLinks
☆20Updated last year
Alternatives and similar repositories for qwen2vl-captioner-gui
Users that are interested in qwen2vl-captioner-gui are comparing it to the libraries listed below
Sorting:
- A video clipper for Hunyuan video training.☆85Updated 2 months ago
- Set of Utilities I Have Coded to Help Me Train RPGv6 on Flux1☆82Updated last year
- ComfyUI Node for FlashFace☆69Updated 7 months ago
- This Flux latent upscaler workflow creates a lower-resolution initial pass, then advances to a second pass that upscales in latent space …☆117Updated last year
- Vision Transformers Needs Registers. And Gated MLPs. And +20M params. Tiny modality gap ensues!☆47Updated 4 months ago
- Gradio UI for training video models using finetrainers☆31Updated 5 months ago
- ComfyUI powertools for SD1.5 and SDXL model merging☆93Updated 6 months ago
- Extended Musubi Tuner with latent previews, fp16 accumulation, advanced cfg scheduling and more☆33Updated this week
- Scripts for use with LongCLIP, including fine-tuning Long-CLIP☆62Updated 6 months ago
- An image viewer and AI-assisted editing/captioning/masking tool that helps with curating datasets for generative AI models, finetunes and…☆136Updated this week
- ☆42Updated last year
- Various training scripts used to train bigasp☆102Updated last month
- This was orginally written by: https://github.com/hlky☆49Updated last year
- flux distillation and stuff☆119Updated 3 months ago
- ☆89Updated 5 months ago
- ☆49Updated 3 months ago
- For loading and running Pixtral models☆77Updated 8 months ago
- NNT Neural Network Toolkit Custom Nodes for ComfyUI☆68Updated 8 months ago
- The IMAGE-interrogator for SOTA image captioning☆85Updated last year
- Processes SafeTensors files for Stable Diffusion 1.5 (SD 1.5), Stable Diffusion XL (SDXL), and FLUX models. It extracts the UNet into a s…☆59Updated 10 months ago
- A tool to help adjust or zero-out Flux Block Weights and SAVE. I'm not a dev, so this implementation might be wrong.☆29Updated 10 months ago
- Adaptive ODE Solvers for ComfyUI☆53Updated last year
- ☆67Updated 4 months ago
- MoD Control Tile Upscaler for SDXL Pipeline☆61Updated 6 months ago
- ComfyUI Implementaion of ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment☆160Updated last year
- [inactive] MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation☆13Updated last year
- ☆73Updated last year
- ☆27Updated 5 months ago
- Janky implementation of DiffuseHigh for ComfyUI☆36Updated 5 months ago
- Greatly increase the diversity of your generated images in Automatic1111 WebUI through Condition-Annealed Sampling.☆108Updated last year