Sanster / VLM-demosLinks
Collect VLM models that can be tried online.
☆14Updated last year
Alternatives and similar repositories for VLM-demos
Users that are interested in VLM-demos are comparing it to the libraries listed below
Sorting:
- ☆47Updated last year
- Run Open Source Local AI Models in Excel with Ollama☆23Updated 3 months ago
- Auto Thinking Mode switch for Qwen3 in Open webui☆70Updated 6 months ago
- ComfyUI wrapper for Moondream's gaze detection☆55Updated 9 months ago
- Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".☆194Updated 8 months ago
- ☆71Updated last year
- Prompt 工程师利器,可同时比较多个 Prompts 在多个 LLM 模型上的效果☆97Updated 2 years ago
- Incredibly descriptive audiovisual summaries for videos☆40Updated last year
- Qwen-TTS offers a robust voice synthesis service using FastAPI, supporting bilingual and dialect options. Explore seamless audio generati…☆87Updated this week
- ☆44Updated 3 months ago
- [EMNLP 2025 Demo] PresentAgent: Multimodal Agent for Presentation Video Generation☆115Updated last week
- ImageSlider custom component for gradio.☆43Updated last year
- FLUX.1-dev LoRA Outfit Generator can create an outfit by detailing the color, pattern, fit, style, material, and type.☆70Updated last year
- ComfyUI YOLO-World Integration☆48Updated last year
- ☆185Updated 2 months ago
- ☆29Updated last year
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆15Updated last year
- Python SDK for Stable Diffusion API (Txt2Img/Img2Img/ControlNet/VAE)☆40Updated 2 years ago
- ☆16Updated last year
- Official Repo For the [AAAI'26 Oral] Paper “StyleTailor: Towards Personalized Fashion Styling via Hierarchical Negative Feedback”☆22Updated 3 months ago
- A minimalistic, hackable code base to finetune Wan video generation model☆47Updated 7 months ago
- ☆50Updated 2 months ago
- ☆14Updated last year
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆64Updated last year
- Official PixVerse Model Context Protocol (MCP) server that enables interaction with powerful AI video generation APIs.☆30Updated last month
- coze api to openai☆15Updated last year
- Enable tool-use ability for any LLM model (DeepSeek V3/R1, etc.)☆57Updated 6 months ago
- A diffusers pipeline for zero shot stylised couples portrait creation☆101Updated 11 months ago
- A lightweight script for processing HTML page to markdown format with support for code blocks☆81Updated last year
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Updated last year