NVIDIA-AI-IOT / live-vlm-webuiLinks
Real-time Vision Language Model interaction via webcam - WebRTC-based web interface
☆184Updated 3 weeks ago
Alternatives and similar repositories for live-vlm-webui
Users that are interested in live-vlm-webui are comparing it to the libraries listed below
Sorting:
- High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model di…☆125Updated 3 weeks ago
- ☆178Updated 4 months ago
- Service for testing out the new Qwen2.5 omni model☆61Updated 8 months ago
- your private, personal assistant☆59Updated 3 months ago
- An AI assistant building SDK in python☆39Updated 3 months ago
- 🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs ✨☆100Updated 6 months ago
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆338Updated last year
- llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work☆282Updated this week
- Wraps any OpenAI API interface as Responses with MCPs support so it supports Codex. Adding any missing stateful features. Ollama and Vllm…☆138Updated 2 months ago
- BUDDIE is the first full-stack open-source AI voice interaction solution, providing a complete end-to-end system from hardware design to …☆235Updated 4 months ago
- ☆195Updated 9 months ago
- Local modular AI assistant with speech, vision, and robotics support. Uses Qwen3-VL-4B in LM Studio.☆39Updated this week
- ☆430Updated last month
- ☆19Updated 6 months ago
- Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!☆29Updated 3 weeks ago
- Simple UI for Llama-3.2-11B-Vision & Molmo-7B-D☆135Updated last year
- ☆191Updated last month
- Llama.cpp runner/swapper and proxy that emulates LMStudio / Ollama backends☆49Updated 4 months ago
- A cross platform App that gives you the best UX to run models locally or remotely on your own hardware☆70Updated 2 weeks ago
- ☆93Updated 3 months ago
- Plug-and-play memory for LLMs in 3 lines of code. Add persistent, intelligent, human-like memory and recall to any model in minutes.☆242Updated last month
- Self-host the ultra-lightweight Kitten TTS model with this enhanced API server with an intuitive Web UI, large text processing for audiob…☆225Updated 5 months ago
- Running Microsoft's BitNet via Electron, React & Astro☆49Updated 3 months ago
- Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆126Updated 4 months ago
- Personal voice assistant, with voice interruption and Twilio support☆18Updated 10 months ago
- Inference service for Qwen2.5-VL-7b model☆208Updated 9 months ago
- Fast local speech-to-text for any app using faster-whisper☆145Updated 3 months ago
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆246Updated 11 months ago
- Local AI voice assistant stack for Home Assistant (GPU-accelerated) with persistent memory, follow-up conversation, and Ollama model reco…☆224Updated 5 months ago
- ☆51Updated 3 months ago