ertugrul-dmr/qwen2vl-captioner-gui

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ertugrul-dmr/qwen2vl-captioner-gui)

ertugrul-dmr / qwen2vl-captioner-gui

☆21

Alternatives and similar repositories for qwen2vl-captioner-gui

Users that are interested in qwen2vl-captioner-gui are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fpgaminer / bigasp-training
View on GitHub
Various training scripts used to train bigasp
☆113Aug 13, 2025Updated 11 months ago
PasiKoodaa / ACE-Step-RADIO
View on GitHub
ACE-Step: A Step Towards Music Generation Foundation Model
☆50May 20, 2025Updated last year
Pavansomisetty21 / Image-Caption-Generation-using-LLMs-GEMINI-
View on GitHub
we generate captions to the images which are given by user(user input) using prompt engineering and Generative AI
☆10Aug 24, 2024Updated last year
mit-han-lab / VisCompare
View on GitHub
A WebUI for Side-by-Side Comparison of Media (Images/Videos) Across Multiple Folders
☆26Feb 21, 2025Updated last year
regiellis / ComfyUI-EasyPony
View on GitHub
Easy Pony is a helper node that simplifies the process of adding scoring and other attributes to the core when prompting with Pony models…
☆11Apr 5, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
alpertunga-bile / image-caption-comfyui
View on GitHub
Using image caption models to extract prompts in ComfyUI
☆12May 21, 2025Updated last year
g588928812 / qlora
View on GitHub
QLoRA: Efficient Finetuning of Quantized LLMs
☆11Jul 22, 2023Updated 3 years ago
bigdata-pw / florence-tool
View on GitHub
The Florence Tool CLI provides a command-line interface for processing images using the Florence-2 model. This tool allows users to apply…
☆16Jan 21, 2025Updated last year
IDGallagher / ComfyUI-IG-Motion-I2V
View on GitHub
ComfyUI implementation of Motion-I2V
☆41Sep 30, 2024Updated last year
christian-byrne / img2colors-comfyui-node
View on GitHub
Extract dominant or complementary color palettes from images. Convert colors to English names suitable for txt2img prompts.
☆16Jan 5, 2025Updated last year
Adamsw72 / whisper-standalone-win-simpleGUI
View on GitHub
An easy-to-use GUI addon for whisper-standalone-win. Designed for those who prefer a simple interface over typing commands and file paths…
☆13Dec 26, 2023Updated 2 years ago
danopdev / VideoFrame
View on GitHub
Extract individual frames from a video as png images (android)
☆13Dec 30, 2022Updated 3 years ago
overcrash66 / OpenTranslator
View on GitHub
Open Translator: Speech To Speech and Speech to text Translator with voice cloning and other cool features
☆18Mar 26, 2026Updated 4 months ago
SonicCodes / subcloning
View on GitHub
implementation of https://arxiv.org/pdf/2312.09299
☆21Jul 3, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
SleeeepyZhou / VLMCaption-TagCraft
View on GitHub
Image caption and manage tool for AI training
☆11Jan 24, 2025Updated last year
Karmabu / ComfyUI-Installer-GUI
View on GitHub
Windows ComfyUI Installer GUI
☆11Mar 30, 2025Updated last year
BaofengZan / mnn-llm-GOT-OCR2.0
View on GitHub
使用mnn-llm对GOT-OCR2.0进行推理
☆14Oct 2, 2024Updated last year
Auryg / Ideogram-Json-Captioner
View on GitHub
A program to help with Ideogram captioning
☆30Updated this week
brianGit78 / josh_crib_check
View on GitHub
☆13Feb 5, 2026Updated 5 months ago
doveg / whisper-real-time
View on GitHub
A real time offline transcriber with gui, based on OpenAI whisper
☆17Dec 25, 2025Updated 7 months ago
ycyy / ComfyUI-Yolo-World-EfficientSAM
View on GitHub
ComfyUI Yolo World EfficientSAM custom node
☆15Jul 16, 2024Updated 2 years ago
SkunkworksAI / CodeFusion
View on GitHub
☆14Oct 31, 2023Updated 2 years ago
huggingface / feel
View on GitHub
☆15May 26, 2026Updated 2 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
MNeMoNiCuZ / joy-caption-batch
View on GitHub
A batch captioning tool for joy_caption
☆196Aug 25, 2025Updated 11 months ago
Khalil-Rehman9 / CaptionAI
View on GitHub
A powerful and user-friendly tool that generates detailed captions for your images
☆21Nov 11, 2024Updated last year
deepbeepmeep / FluxFillGP
View on GitHub
Flux Fill 1.0 GO: flux Inpainting and outpainting starting with 8Gb of VRAM
☆78Jan 18, 2025Updated last year
lujiazho / AI_TryOn_mini
View on GitHub
An AI try-on application for generating photos with AI character wearing the same clothes as the one in the input photo.
☆14Sep 7, 2023Updated 2 years ago
shoutsid / townhall
View on GitHub
A Python-based chatbot project built on the autogen and tinygrad foundation, utilizing advanced agents for dynamic conversations and func…
☆27Oct 9, 2024Updated last year
silence-tang / GaussianActor
View on GitHub
[AAAI 2025] Official Implementation of 3D$^2$-Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic Gaussian Avatar Modeling
☆15Mar 30, 2025Updated last year
Pixelailabs / Save_Florence2_Bulk_Prompts
View on GitHub
☆17Nov 6, 2025Updated 8 months ago
KoelLabs / koellabs.com
View on GitHub
Koel Labs innovates open-source speech research, inclusive speech technologies, and real-time pronunciation feedback for language learner…
☆14Jul 13, 2026Updated 2 weeks ago
mweiherer / irbsm
View on GitHub
Official implementation of "iRBSM: A Deep Implicit 3D Breast Shape Model" (BVM'25).
☆15Dec 2, 2025Updated 7 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yolanother / DTAIImageToTextNode
View on GitHub
A ComfyUI node for describing an image
☆20May 22, 2024Updated 2 years ago
lukassteinwender / avatair
View on GitHub
A pipeline to generate user-preferred photo-realistic avatars using stable-diffusion and bayesian-optimization.
☆18Jun 12, 2026Updated last month
GeekyGhost / Automatic1111-Geeky-Remb
View on GitHub
Automatic1111 port of my comfyUI geely remb tool
☆17Oct 24, 2024Updated last year
IST-DASLab / peft-rosa
View on GitHub
A fork of the PEFT library, supporting Robust Adaptation (RoSA)
☆15Aug 16, 2024Updated last year
bycloudai / CVPR2022-DaGAN-Windows
View on GitHub
Official code for CVPR2022 paper: Depth-Aware Generative Adversarial Network for Talking Head Video Generation
☆23Sep 14, 2022Updated 3 years ago
agnJason / RGB-D-PIFuHD
View on GitHub
body reconstruction
☆16Jun 23, 2021Updated 5 years ago
robot-love / depth_from_video_in_the_wild
View on GitHub
SYDE 671 final project source code, copied from Google Research to avoid cloning the entire Google Research repo.
☆14Nov 17, 2019Updated 6 years ago