Python scripts to use for captioning images with VLMs
☆45Apr 23, 2025Updated 11 months ago
Alternatives and similar repositories for VLM-Captioning-Tools
Users that are interested in VLM-Captioning-Tools are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Using CogVLM and CogAgent for image captioning☆15Dec 29, 2023Updated 2 years ago
- ☆13Feb 2, 2024Updated 2 years ago
- ☆21Jul 15, 2025Updated 8 months ago
- Custom LORA training on DynamiCrafter☆18Jul 26, 2024Updated last year
- [CVPR2022 Oral] The official code for "TransRank: Self-supervised Video Representation Learning via Ranking-based Transformation Recognit…☆18Aug 1, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Extension/Script for Stable Diffusion UI by AUTOMATIC1111 https://github.com/AUTOMATIC1111/stable-diffusion-webui☆17Dec 19, 2022Updated 3 years ago
- Gradio UI for training video models using finetrainers☆33Apr 18, 2025Updated 11 months ago
- ☆11Jun 1, 2016Updated 9 years ago
- A funny extension that integrates image-browsing , downloader , deduplicate , cluster , can quickly collect, classify and process your im…☆24Aug 8, 2023Updated 2 years ago
- ☆27Sep 28, 2024Updated last year
- XGEN-MM(BLIP3) Autocaptioning Tools☆17Jun 20, 2024Updated last year
- [ICLR 2026] RefAny3D: 3D Asset-Referenced Diffusion Models for Image Generation☆32Mar 10, 2026Updated last month
- A convenient and user-friendly image data processing library that integrates various advanced image processing models.☆14May 14, 2025Updated 11 months ago
- ComfyUI powertools for SD1.5 and SDXL model merging☆96Mar 27, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A basic pure pytorch implementation of flash attention☆16Oct 28, 2024Updated last year
- A sd-webui extension for utilizing DanTagGen to "upsample prompts".☆13Jun 13, 2024Updated last year
- A multimodal inference pipeline that integrates InstructBLIP with textgen-webui for Vicuna and related models.☆33Jul 14, 2023Updated 2 years ago
- The program used to occupy GPUs.☆10Mar 24, 2023Updated 3 years ago
- An AI-powered image dataset captioning tool☆26Feb 21, 2026Updated last month
- Snoopy v2.0.1 - Automated digital terrestrial tracking framework☆11Feb 10, 2017Updated 9 years ago
- Implementation of "Denoise Pretraining on Non-equilibrium Molecular Conformations for Accurate and Transferable Neural Potentials" in PyT…☆14Jul 26, 2023Updated 2 years ago
- ☆19Feb 4, 2026Updated 2 months ago
- ☆12Aug 14, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Fine-tuning code for CLIP models☆271Jan 28, 2026Updated 2 months ago
- ☆20Feb 3, 2025Updated last year
- ✨ waifu-diffusion tagger server / onnx | wd-tagger as api service☆20Feb 20, 2025Updated last year
- A small project that uses Discrete Denoising Diffusion Probabilistic Models (D3PMs), a generative model for discrete data that builds upo…☆15Aug 10, 2024Updated last year
- This repository focuses on the use of keystroke dynamics as a behavioral biometric to build machine learning models for user recognition.…☆11Mar 29, 2023Updated 3 years ago
- ☆52Jan 6, 2026Updated 3 months ago
- The Florence Tool CLI provides a command-line interface for processing images using the Florence-2 model. This tool allows users to apply…☆16Jan 21, 2025Updated last year
- attention으로 시계열 예측은 할 수 없을까☆10Apr 30, 2021Updated 4 years ago
- ☆53Jun 24, 2025Updated 9 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Various training scripts used to train bigasp☆113Aug 13, 2025Updated 8 months ago
- Lingotion Thespeon AI-acting engine for Unity☆17Mar 11, 2026Updated last month
- Official implementation of "Single Image Iterative Subject-driven Generation and Editing".☆100May 30, 2025Updated 10 months ago
- Official code for the CVPR 2024 Paper "Can Biases in ImageNet Models Explain Generalization?".☆13Jun 24, 2024Updated last year
- Awesome GAN-based Image Restoration☆12Mar 11, 2024Updated 2 years ago
- ☆17Jan 29, 2025Updated last year
- A software to automatically tag images. It's primary use is for training Stable Diffusion checkpoints and loras.☆24Dec 4, 2025Updated 4 months ago