Sanster / VLM-demosLinks
Collect VLM models that can be tried online.
☆13Updated last year
Alternatives and similar repositories for VLM-demos
Users that are interested in VLM-demos are comparing it to the libraries listed below
Sorting:
- Auto Thinking Mode switch for Qwen3 in Open webui☆65Updated last month
- ☆39Updated 3 months ago
- ☆46Updated last year
- Animated optical illusions in ComfyUI☆21Updated last year
- ☆13Updated 10 months ago
- ComfyUI wrapper for Moondream's gaze detection☆53Updated 4 months ago
- 支持Taiyi-Diffusion-XL模型的Fooocus☆20Updated last year
- Diffusers Image Fill v3 -- Inpaint or Remove objects from an image - or Outpaint - or Outpaint Video Zoom: 16GB+ GPU | 32GB+ RAM | 20GB+…☆12Updated 7 months ago
- Get up and running with Llama 3, Mistral, Gemma, and other large language models.☆26Updated last week
- Fine-tune of Florence-2 for shot categorization.☆24Updated 3 months ago
- ☆29Updated last year
- A voice assistant that runs completely on your local device.☆19Updated last week
- I have successfully load the vision understanding fuction of the GLM4 in COMFYUI. Anyuser could use their own API-KEY to use this fuction☆26Updated last year
- Official PixVerse Model Context Protocol (MCP) server that enables interaction with powerful AI video generation APIs.☆23Updated last month
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Updated 11 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 4 months ago
- ComfyUI node for fast neural style transfer☆71Updated 2 months ago
- Incredibly descriptive audiovisual summaries for videos☆41Updated 10 months ago
- Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".☆173Updated 3 months ago
- 02. Enabling various applications to be AI-enabled or used by AI.☆28Updated 9 months ago
- ☆12Updated last year
- A minimalistic, hackable code base to finetune Wan video generation model☆40Updated 2 months ago
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆15Updated last year
- The inference code of RVC-Boss/GPT-SoVITS that can be developer-friendly.☆15Updated 8 months ago
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆64Updated 8 months ago
- 2400+节点可视化 Visualization | Collection of ComfyUI Custom Nodes☆25Updated last year
- Modern Stable Diffusion models family - Fluently☆32Updated last year
- HunyuanVideo: A Systematic Framework For Large Video Generation Model☆47Updated 6 months ago
- Rough LLM Interpreter of ComfyUI☆24Updated 5 months ago
- ☆24Updated last year