ngxson / smolvlm-realtime-webcamLinks
Real-time webcam demo with SmolVLM and llama.cpp server
☆4,031Updated 2 months ago
Alternatives and similar repositories for smolvlm-realtime-webcam
Users that are interested in smolvlm-realtime-webcam are comparing it to the libraries listed below
Sorting:
- 100+ Fine-tuning LLM Notebooks on Google Colab, Kaggle, and more.☆2,634Updated this week
- A unified library for object tracking featuring clean room re-implementations of leading multi-object tracking algorithms☆1,851Updated last week
- This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025☆4,330Updated 2 months ago
- Sim Studio is an open-source AI agent workflow builder. Sim Studio's interface is a lightweight, intuitive way to quickly build and deplo…☆4,893Updated this week
- Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your …☆3,593Updated last week
- ☆693Updated last week
- ☆3,454Updated 3 months ago
- Lightweight coding agent that runs in your terminal☆1,903Updated 2 months ago
- SoTA open-source TTS☆9,357Updated last month
- ☆424Updated last month
- Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.☆1,884Updated last week
- A mini, open-weights, version of our Proxy assistant.☆944Updated 4 months ago
- Natural Language Web☆5,573Updated this week
- This repository delivers end-to-end, code-first tutorials covering every layer of production-grade GenAI agents, guiding you from spark …☆8,564Updated this week
- mcp-use is the easiest way to interact with mcp servers with custom agents☆4,352Updated this week
- Cross-platform framework for deploying LLM/VLM/TTS models locally on smartphones.☆2,113Updated this week
- Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.☆8,193Updated 2 weeks ago
- II-Agent: a new open-source framework to build and deploy intelligent agents☆2,664Updated this week
- Towards Human-Sounding Speech☆5,229Updated 2 months ago
- WhatsApp MCP server☆4,475Updated this week
- A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speec…☆2,482Updated this week
- The official ElevenLabs MCP server☆828Updated last week
- Tencent Hunyuan A13B (short as Hunyuan-A13B), an innovative and open-source LLM built on a fine-grained MoE architecture.☆679Updated last week
- Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation☆3,962Updated 3 weeks ago
- AI-powered multi-agent builder☆3,387Updated this week
- Expose your FastAPI endpoints as Model Context Protocol (MCP) tools, with Auth!☆6,595Updated this week
- A collection of 🤗 Transformers.js demos and example applications☆1,668Updated last month
- Riona Ai Agent 🌸 is built using Node.js and TypeScript 🛠️, designed for seamless job execution 📸. It's lightweight, efficient, and sti…☆3,488Updated this week
- ☆684Updated last week
- MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.☆2,665Updated last week