☆21Sep 28, 2024Updated last year
Alternatives and similar repositories for qwen2vl-captioner-gui
Users that are interested in qwen2vl-captioner-gui are comparing it to the libraries listed below
Sorting:
- A WebUI for Side-by-Side Comparison of Media (Images/Videos) Across Multiple Folders☆25Feb 21, 2025Updated last year
- ☆35Oct 9, 2025Updated 5 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆12Nov 14, 2025Updated 3 months ago
- Image caption and manage tool for AI training☆11Jan 24, 2025Updated last year
- ☆10Jan 23, 2025Updated last year
- speech to text gui for different (mostly Whisper, also Voxtral) models and backends, including whisper.cpp, mlx-whisper, faster-whisper, …☆11Dec 7, 2025Updated 3 months ago
- ☆22Dec 23, 2025Updated 2 months ago
- Generate music videos starring yourself.☆11Apr 3, 2025Updated 11 months ago
- Open Translator: Speech To Speech and Speech to text Translator with voice cloning and other cool features☆14Updated this week
- ☆23Dec 11, 2025Updated 2 months ago
- Various training scripts used to train bigasp☆113Aug 13, 2025Updated 6 months ago
- ComfyUI implementation of Motion-I2V☆41Sep 30, 2024Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- Image Text Segmentation using FAST corner detection and DBSCAN clustering with k-d tree data structure☆14Feb 27, 2019Updated 7 years ago
- ☆10May 24, 2020Updated 5 years ago
- [AAAI 2026] Official Code for VQAThinker: Exploring Generalizable and Explainable Video Quality Assessment via Reinforcement Learning☆19Nov 28, 2025Updated 3 months ago
- Using image caption models to extract prompts in ComfyUI☆10May 21, 2025Updated 9 months ago
- Example project from my "Manipulating Embedded Lua VMs" series. Read more at: https://openpunk.com/pages/manipulating-lua-vms-1/☆11Apr 21, 2019Updated 6 years ago
- Self-contained block explorer for Stacks☆10Nov 11, 2024Updated last year
- Nodes to run Hunyuan Image 3 locally with BF16 and NF4 quantized options in Comfyui☆33Feb 21, 2026Updated 2 weeks ago
- Koel Labs innovates open-source speech research, inclusive speech technologies, and real-time pronunciation feedback for language learner…☆12Dec 24, 2025Updated 2 months ago
- DigiKam media files search by contained objects☆13Feb 23, 2022Updated 4 years ago
- One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Model☆26Nov 21, 2025Updated 3 months ago
- ☆13May 17, 2025Updated 9 months ago
- Collection of usefull scripts for RunPod pods☆15Jan 26, 2024Updated 2 years ago
- easyanimate generete videos with ExLlamaV2 quantization LLM prompt☆13Jun 26, 2024Updated last year
- Search, download Vimeo videos and retrieve metadata in Go.☆11Feb 10, 2022Updated 4 years ago
- Easy Pony is a helper node that simplifies the process of adding scoring and other attributes to the core when prompting with Pony models…☆12Apr 5, 2025Updated 11 months ago
- A procedural macro to combine multiple configuration methods at compile time☆12Mar 29, 2023Updated 2 years ago
- ☆16Oct 26, 2025Updated 4 months ago
- An AI tool designed to generate explanations for every file in a project☆14Mar 7, 2025Updated last year
- An advanced AI-powered tool that automatically translates and dubs YouTube videos into different languages while dynamically adjusting vi…☆14Nov 9, 2024Updated last year
- Learning semantic embeddings from OSM data: A Pytorch implementation of the loc2vec general method outlined in: https://sentiance.com/loc…☆11Oct 7, 2024Updated last year
- ☆13Dec 16, 2024Updated last year
- ☆13Mar 25, 2025Updated 11 months ago
- Scripts to apply patches to genymotion that improve usability, such as arm translation.☆17Jun 26, 2025Updated 8 months ago
- Command line tool to execute Synology "Universal Search" from within a shell☆12May 8, 2021Updated 4 years ago
- A toy text-to-image model trained from scratch.☆19Jun 9, 2025Updated 9 months ago
- Official Code for "Painting with Words: Elevating Detailed Image Captioning with Benchmark and Alignment Learning" (ICLR 2025)☆12Mar 6, 2025Updated last year