Sanster / VLM-demosLinks
Collect VLM models that can be tried online.
☆13Updated last year
Alternatives and similar repositories for VLM-demos
Users that are interested in VLM-demos are comparing it to the libraries listed below
Sorting:
- Auto Thinking Mode switch for Qwen3 in Open webui☆61Updated 3 weeks ago
- ☆46Updated last year
- A minimalistic, hackable code base to finetune Wan video generation model☆39Updated last month
- Diffusers Image Fill v3 -- Inpaint or Remove objects from an image - or Outpaint - or Outpaint Video Zoom: 16GB+ GPU | 32GB+ RAM | 20GB+…☆12Updated 6 months ago
- Chrome extension to add a link from each Arxiv page to the corresponding HF Paper page☆26Updated last year
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆14Updated last year
- Awesome Code Action - DeepWebSearch AgentKit App. Build with 🤗 Hugging Face smolagents framework☆40Updated this week
- ComfyUI wrapper for Moondream's gaze detection☆53Updated 4 months ago
- ☆32Updated 4 months ago
- Get up and running with Llama 3, Mistral, Gemma, and other large language models.☆26Updated 3 weeks ago
- coze api to openai☆14Updated 9 months ago
- A voice assistant that runs completely on your local device.☆19Updated 2 weeks ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Updated 10 months ago
- Enable tool-use ability for any LLM model (DeepSeek V3/R1, etc.)☆22Updated last week
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Updated last year
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆12Updated 10 months ago
- Fine-tune of Florence-2 for shot categorization.☆24Updated 3 months ago
- ☆13Updated 9 months ago
- The inference code of RVC-Boss/GPT-SoVITS that can be developer-friendly.☆15Updated 8 months ago
- ☆29Updated last year
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆20Updated 3 weeks ago
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆38Updated 8 months ago
- qwen create prompt for sdxl☆32Updated last year
- ☆18Updated 4 months ago
- 🧠 Web AI / LLM in browser / Whisper in browser / WebGPU inference Examples☆20Updated 2 weeks ago
- EfficientSAM + YOLO World base model for use with Autodistill.☆10Updated last year
- Open source intent recognition framework powered by LLMs.☆19Updated 5 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 3 months ago
- Modern Stable Diffusion models family - Fluently☆31Updated 11 months ago
- ☆16Updated last year