NVIDIA-AI-IOT / live-vlm-webuiLinks
Real-time Vision Language Model interaction via webcam - WebRTC-based web interface
☆204Updated last month
Alternatives and similar repositories for live-vlm-webui
Users that are interested in live-vlm-webui are comparing it to the libraries listed below
Sorting:
- Service for testing out the new Qwen2.5 omni model☆62Updated 9 months ago
- Port of Facebook's LLaMA model in C/C++☆67Updated 9 months ago
- Llama.cpp runner/swapper and proxy that emulates LMStudio / Ollama backends☆50Updated 5 months ago
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆347Updated last year
- Running Microsoft's BitNet via Electron, React & Astro☆51Updated 4 months ago
- Simple UI for Llama-3.2-11B-Vision & Molmo-7B-D☆134Updated last year
- High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model di…☆131Updated 2 weeks ago
- Inference service for Qwen2.5-VL-7b model☆209Updated 10 months ago
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆129Updated 2 years ago
- ☆178Updated 5 months ago
- Fast local speech-to-text for any app using faster-whisper☆145Updated 4 months ago
- A reference application for a local AI assistant with LLM and RAG☆117Updated last year
- ☆439Updated last month
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆245Updated last year
- ☆46Updated 2 weeks ago
- Zero-copy multimodal vector DB with CUDA and CLIP/SigLIP☆64Updated 8 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆267Updated 10 months ago
- ☆19Updated 6 months ago
- Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!☆29Updated last month
- Game Companion AI is an advanced application designed to enhance the gaming experience by providing real-time analysis and interpretation…☆54Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆31Updated 8 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆75Updated last year
- your private, personal assistant☆61Updated 4 months ago
- A web application that converts speech to speech 100% private☆82Updated 7 months ago
- ☆83Updated 11 months ago
- A real-time shared memory layer for multi-agent LLM systems.☆53Updated 2 weeks ago
- 🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs ✨☆103Updated 7 months ago
- Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆127Updated 4 months ago
- Running a LLM on the ESP32☆87Updated last year
- Personal voice assistant, with voice interruption and Twilio support☆18Updated 11 months ago