Sanster / VLM-demosLinks
Collect VLM models that can be tried online.
☆13Updated last year
Alternatives and similar repositories for VLM-demos
Users that are interested in VLM-demos are comparing it to the libraries listed below
Sorting:
- ☆46Updated last year
- Auto Thinking Mode switch for Qwen3 in Open webui☆67Updated 3 months ago
- PresentAgent: Multimodal Agent for Presentation Video Generation☆91Updated last week
- Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".☆180Updated 5 months ago
- ComfyUI wrapper for Moondream's gaze detection☆55Updated 6 months ago
- Enable tool-use ability for any LLM model (DeepSeek V3/R1, etc.)☆53Updated 2 months ago
- An AI agent to control drones from your CLI☆122Updated last week
- Get up and running with Llama 3, Mistral, Gemma, and other large language models.☆27Updated this week
- Incredibly descriptive audiovisual summaries for videos☆41Updated last year
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆15Updated last year
- HunyuanVideo: A Systematic Framework For Large Video Generation Model☆48Updated 7 months ago
- qwen create prompt for sdxl☆33Updated last year
- ☆29Updated last year
- ComfyUI node for fast neural style transfer☆71Updated 4 months ago
- ☆71Updated last year
- I have successfully load the vision understanding fuction of the GLM4 in COMFYUI. Anyuser could use their own API-KEY to use this fuction☆29Updated last year
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆39Updated 10 months ago
- A diffusers pipeline for zero shot stylised couples portrait creation☆101Updated 7 months ago
- ☆26Updated last year
- ComfyUI YOLO-World Integration☆47Updated last year
- Awesome Code Action - DeepWebSearch AgentKit App. Build 🙌 with 🤗 Hugging Face smolagents framework☆105Updated 2 weeks ago
- ☆24Updated last year
- A lightweight script for processing HTML page to markdown format with support for code blocks☆79Updated last year
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Updated last year
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 5 months ago
- Stream live plots to a matplotlib figure☆79Updated 3 months ago
- FLUX.1-dev LoRA Outfit Generator can create an outfit by detailing the color, pattern, fit, style, material, and type.☆68Updated 9 months ago
- coze api to openai☆14Updated 11 months ago
- Community ComfyUI workflows running on fal.ai☆58Updated 11 months ago
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆64Updated 10 months ago