Python scripts to use for captioning images with VLMs
☆47Apr 23, 2025Updated last year
Alternatives and similar repositories for VLM-Captioning-Tools
Users that are interested in VLM-Captioning-Tools are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Feb 2, 2024Updated 2 years ago
- Custom LORA training on DynamiCrafter☆18Jul 26, 2024Updated last year
- Gradio UI for training video models using finetrainers☆34Apr 18, 2025Updated last year
- A funny extension that integrates image-browsing , downloader , deduplicate , cluster , can quickly collect, classify and process your im…☆24Aug 8, 2023Updated 2 years ago
- This repository is the official PyTorch implementation of MA-AGIQA:Large Multi-modality Model Assisted AI-Generated Image Quality Assessm…☆27Sep 28, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- XGEN-MM(BLIP3) Autocaptioning Tools☆17Jun 20, 2024Updated 2 years ago
- A convenient and user-friendly image data processing library that integrates various advanced image processing models.☆15May 14, 2025Updated last year
- ComfyUI powertools for SD1.5 and SDXL model merging☆97Mar 27, 2025Updated last year
- A tool to visualize convolutional layer activations on an input image.☆12Jun 5, 2016Updated 10 years ago
- A sd-webui extension for utilizing DanTagGen to "upsample prompts".☆13Jun 13, 2024Updated 2 years ago
- A multimodal inference pipeline that integrates InstructBLIP with textgen-webui for Vicuna and related models.☆33Jul 14, 2023Updated 2 years ago
- [ICLR 2026] RefAny3D: 3D Asset-Referenced Diffusion Models for Image Generation☆36Mar 10, 2026Updated 3 months ago
- The program used to occupy GPUs.☆10Mar 24, 2023Updated 3 years ago
- An AI-powered image dataset captioning tool☆31Feb 21, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [NeurIPS 2025 Spotlight] Official implementation for DNAEdit: Direct Noise Alignment for Text-Guided Rectified Flow Editing☆32Jan 23, 2026Updated 5 months ago
- ☆20Jun 2, 2026Updated last month
- ☆12Aug 14, 2019Updated 6 years ago
- Fine-tuning code for CLIP models☆275Jun 9, 2026Updated 3 weeks ago
- ☆21Feb 3, 2025Updated last year
- ☆52Jan 6, 2026Updated 5 months ago
- The Florence Tool CLI provides a command-line interface for processing images using the Florence-2 model. This tool allows users to apply…☆16Jan 21, 2025Updated last year
- attention으로 시계열 예측은 할 수 없을까☆10Apr 30, 2021Updated 5 years ago
- ☆55Jun 24, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Various training scripts used to train bigasp☆113Aug 13, 2025Updated 10 months ago
- Lingotion Thespeon AI-acting engine for Unity☆20Jun 24, 2026Updated last week
- Minimal unofficial implementation of Consistency Trajectory models on a 1D toy task.☆22Mar 11, 2024Updated 2 years ago
- 大学Latex答辩模版,当前包含川大、哈工大、中科大。☆11Jul 22, 2024Updated last year
- Official implementation of "Single Image Iterative Subject-driven Generation and Editing".☆99May 30, 2025Updated last year
- Official code for the CVPR 2024 Paper "Can Biases in ImageNet Models Explain Generalization?".☆13Jun 24, 2024Updated 2 years ago
- Awesome GAN-based Image Restoration☆12Mar 11, 2024Updated 2 years ago
- ☆17Jan 29, 2025Updated last year
- A software to automatically tag images. It's primary use is for training Stable Diffusion checkpoints and loras.☆24Dec 4, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- unpbook == Unix Networking Programming☆12Apr 29, 2018Updated 8 years ago
- PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation☆37Oct 28, 2024Updated last year
- Official repository for CVPR 2025 paper: OpenSDI: Spotting Diffusion-Generated Images in the Open World☆44Jun 18, 2026Updated 2 weeks ago
- [Unofficial] RF Inversion implemented for SD3 / SD3.5☆13Nov 4, 2024Updated last year
- MultiPriv offers multilingual, multimodal PII entities and prompts for studying privacy risks in LLMs/VLMs. It also supports broader PII-…☆32Updated this week
- RealESRGAN high order degradation pipeline☆11Mar 20, 2025Updated last year
- Collect papers and codes about VQGAN in various Computer Vision tasks☆10Dec 20, 2022Updated 3 years ago