ertugrul-dmr / qwen2vl-captioner-guiView external linksLinks
☆21Sep 28, 2024Updated last year
Alternatives and similar repositories for qwen2vl-captioner-gui
Users that are interested in qwen2vl-captioner-gui are comparing it to the libraries listed below
Sorting:
- A WebUI for Side-by-Side Comparison of Media (Images/Videos) Across Multiple Folders☆25Feb 21, 2025Updated 11 months ago
- Vision-Language Models Toolbox: Your all-in-one solution for multimodal research and experimentation☆12Feb 16, 2025Updated last year
- ☆22Dec 23, 2025Updated last month
- ☆22Dec 11, 2025Updated 2 months ago
- Open Translator: Speech To Speech and Speech to text Translator with voice cloning and other cool features☆13Updated this week
- Image caption and manage tool for AI training☆11Jan 24, 2025Updated last year
- speech to text gui for different (mostly Whisper, also Voxtral) models and backends, including whisper.cpp, mlx-whisper, faster-whisper, …☆11Dec 7, 2025Updated 2 months ago
- ☆24Jun 19, 2025Updated 7 months ago
- Various training scripts used to train bigasp☆111Aug 13, 2025Updated 6 months ago
- [AAAI 26 Demo] Offical repo for CAT-V - Caption Anything in Video: Object-centric Dense Video Captioning with Spatiotemporal Multimodal P…☆64Jan 27, 2026Updated 3 weeks ago
- An AI tool designed to generate explanations for every file in a project☆14Mar 7, 2025Updated 11 months ago
- Official Code for "Painting with Words: Elevating Detailed Image Captioning with Benchmark and Alignment Learning" (ICLR 2025)☆12Mar 6, 2025Updated 11 months ago
- ☆10May 24, 2020Updated 5 years ago
- A procedural macro to combine multiple configuration methods at compile time☆12Mar 29, 2023Updated 2 years ago
- Search, download Vimeo videos and retrieve metadata in Go.☆11Feb 10, 2022Updated 4 years ago
- [AAAI 2026] Official Code for VQAThinker: Exploring Generalizable and Explainable Video Quality Assessment via Reinforcement Learning☆19Nov 28, 2025Updated 2 months ago
- Revised use and access to weapon tints for qb-core - XP sink for mz-skills☆12Feb 20, 2023Updated 2 years ago
- Remote sensing labwork☆12Feb 27, 2018Updated 7 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- ☆11Sep 30, 2021Updated 4 years ago
- ☆16Oct 26, 2025Updated 3 months ago
- A toy text-to-image model trained from scratch.☆19Jun 9, 2025Updated 8 months ago
- Latent Editing Nodes for Comfyui☆31Aug 13, 2025Updated 6 months ago
- Example project from my "Manipulating Embedded Lua VMs" series. Read more at: https://openpunk.com/pages/manipulating-lua-vms-1/☆11Apr 21, 2019Updated 6 years ago
- ☆17Nov 6, 2025Updated 3 months ago
- ☆17Apr 25, 2025Updated 9 months ago
- ☆20Jul 24, 2025Updated 6 months ago
- Official Implementation of implicit reference attack☆11Oct 16, 2024Updated last year
- Using image caption models to extract prompts in ComfyUI☆10May 21, 2025Updated 8 months ago
- A repository containing prebuilt versions of the bitsandbytes library for Windows☆40Apr 6, 2023Updated 2 years ago
- A batch captioning tool for joy_caption☆196Aug 25, 2025Updated 5 months ago
- Extract individual frames from a video as png images (android)☆13Dec 30, 2022Updated 3 years ago
- ☆11Jan 8, 2025Updated last year
- Official repository for the paper "Scaling Painting Style Transfer"☆10Jul 1, 2024Updated last year
- Running x86_64 applications on Android☆12Oct 29, 2023Updated 2 years ago
- Official implementation of Humans as Calibration Pattern: Dynamic 3D Scene Reconstruction from Unsynchronized and Uncalibrated Videos (IC…☆15Oct 21, 2025Updated 3 months ago
- Expose a server running on your local machine to the internet, like Ngrok, based on Netty☆14Jun 1, 2021Updated 4 years ago
- Lightweight piece tokenization library☆12Apr 15, 2024Updated last year
- Yoga-82 dataset evaluation☆10Sep 11, 2020Updated 5 years ago