Sanster / VLM-demosLinks
Collect VLM models that can be tried online.
☆14Updated last year
Alternatives and similar repositories for VLM-demos
Users that are interested in VLM-demos are comparing it to the libraries listed below
Sorting:
- Run Open Source Local AI Models in Excel with Ollama☆23Updated 2 months ago
 - ☆47Updated last year
 - Auto Thinking Mode switch for Qwen3 in Open webui☆68Updated 5 months ago
 - Qwen-TTS offers a robust voice synthesis service using FastAPI, supporting bilingual and dialect options. Explore seamless audio generati…☆80Updated this week
 - [EMNLP 2025 Demo] PresentAgent: Multimodal Agent for Presentation Video Generation☆107Updated last month
 - Prompt 工程师利器,可同时比较多个 Prompts 在多个 LLM 模型上的效果☆96Updated 2 years ago
 - qwen create prompt for sdxl☆34Updated last year
 - Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".☆194Updated 7 months ago
 - 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Updated last year
 - Incredibly descriptive audiovisual summaries for videos☆40Updated last year
 - An AI agent to control drones from your CLI☆133Updated 2 months ago
 - ComfyUI wrapper for Moondream's gaze detection☆55Updated 9 months ago
 - Enable tool-use ability for any LLM model (DeepSeek V3/R1, etc.)☆57Updated 5 months ago
 - XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆38Updated last year
 - ComfyUI YOLO-World Integration☆48Updated last year
 - Official Repo For THE Paper “StyleTailor: Towards Personalized Fashion Styling via Hierarchical Negative Feedback”☆22Updated 2 months ago
 - Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆34Updated 8 months ago
 - Tencent Hunyuan 7B (short as Hunyuan-7B) is one of the large language dense models of Tencent Hunyuan☆66Updated 2 months ago
 - ☆24Updated last year
 - Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆14Updated last year
 - Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆13Updated last year
 - ☆43Updated 2 months ago
 - Try out HallOumi, a state-of-the-art claim verification model in a simple UI!☆41Updated 7 months ago
 - Stream live plots to a matplotlib figure☆80Updated 6 months ago
 - A diffusers pipeline for zero shot stylised couples portrait creation☆101Updated 10 months ago
 - ☆33Updated last year
 - Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆45Updated 9 months ago
 - g1: Using GPT-4o to create o1-like reasoning chains☆20Updated last year
 - ☆182Updated last month
 - Python SDK for Stable Diffusion API (Txt2Img/Img2Img/ControlNet/VAE)☆40Updated 2 years ago