NVIDIA-AI-IOT / live-vlm-webuiLinks
Real-time Vision Language Model interaction via webcam - WebRTC-based web interface
☆141Updated 2 weeks ago
Alternatives and similar repositories for live-vlm-webui
Users that are interested in live-vlm-webui are comparing it to the libraries listed below
Sorting:
- Service for testing out the new Qwen2.5 omni model☆62Updated 7 months ago
- BUDDIE is the first full-stack open-source AI voice interaction solution, providing a complete end-to-end system from hardware design to …☆161Updated 3 months ago
- your private, personal assistant☆58Updated 2 months ago
- ☆176Updated 3 months ago
- High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model di…☆119Updated 2 weeks ago
- Port of Facebook's LLaMA model in C/C++☆64Updated 7 months ago
- Game Companion AI is an advanced application designed to enhance the gaming experience by providing real-time analysis and interpretation…☆53Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆31Updated 7 months ago
- Use smol agents to do research and then update csv coumns with its findings.☆41Updated 10 months ago
- Plug-and-play memory for LLMs in 3 lines of code. Add persistent, intelligent, human-like memory and recall to any model in minutes.☆217Updated 2 weeks ago
- ☆19Updated 5 months ago
- ☆180Updated this week
- A real-time shared memory layer for multi-agent LLM systems.☆50Updated 5 months ago
- Wraps any OpenAI API interface as Responses with MCPs support so it supports Codex. Adding any missing stateful features. Ollama and Vllm…☆137Updated last month
- 🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs ✨☆99Updated 5 months ago
- llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work☆278Updated 3 months ago
- Simple UI for Llama-3.2-11B-Vision & Molmo-7B-D☆136Updated last year
- automatically quant GGUF models☆219Updated last month
- Personal voice assistant, with voice interruption and Twilio support☆18Updated 9 months ago
- ☆83Updated 9 months ago
- Fast local speech-to-text for any app using faster-whisper☆144Updated 2 months ago
- Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆125Updated 3 months ago
- ☆20Updated last year
- LLM Fine Tuning Toolbox images for Ryzen AI 395+ Strix Halo☆36Updated 2 months ago
- Reachy Mini's SDK☆301Updated this week
- Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!☆29Updated last week
- An AI assistant building framework in python☆37Updated 2 months ago
- Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…☆51Updated 6 months ago
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆335Updated last year
- Running Microsoft's BitNet via Electron, React & Astro☆48Updated 2 months ago