ngxson / smolvlm-realtime-webcamLinks
Real-time webcam demo with SmolVLM and llama.cpp server
☆4,745Updated 4 months ago
Alternatives and similar repositories for smolvlm-realtime-webcam
Users that are interested in smolvlm-realtime-webcam are comparing it to the libraries listed below
Sorting:
- Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your …☆4,321Updated this week
- 100+ Fine-tuning LLM Notebooks on Google Colab, Kaggle, and more.☆3,665Updated this week
- A Python package that makes it easy for developers to create AI apps powered by various AI providers.☆1,647Updated 5 months ago
- A unified library for object tracking featuring clean room re-implementations of leading multi-object tracking algorithms☆2,132Updated last week
- ☆1,244Updated this week
- A react-based starter app for using the Live API over websockets with Gemini☆2,336Updated last week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,264Updated 4 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆18,429Updated 2 months ago
- computer vision and sports☆4,609Updated last month
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆1,275Updated 5 months ago
- This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025☆6,583Updated 4 months ago
- Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.☆2,377Updated last week
- ☆2,031Updated 6 months ago
- A mini, open-weights, version of our Proxy assistant.☆963Updated 6 months ago
- Lightweight coding agent that runs in your terminal☆2,043Updated 4 months ago
- Render any git repo into a single static HTML page for humans or LLMs☆1,713Updated last month
- RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO and designed for fine-tuning.☆3,021Updated last week
- Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗☆681Updated 2 months ago
- ☆829Updated 4 months ago
- Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI O…☆9,697Updated this week
- Transform PDFs into AI podcasts for engaging on-the-go audio content.☆741Updated 3 months ago
- A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speec…☆2,692Updated last week
- SOTA search powered LLM☆3,640Updated 5 months ago
- A powerful coding assistant application that integrates with the DeepSeek API to process user conversations and generate structured JSON …☆2,250Updated 3 months ago
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆4,047Updated last week
- An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.☆13,095Updated this week
- A collection of guides and examples for the Gemma open models from Google.☆2,201Updated this week
- Document to Markdown OCR library with Llama 3.2 vision☆2,406Updated 8 months ago
- Exa is a Web Search API | This is Exa MCP (Model Context Protocol)☆2,251Updated this week
- Make Mac apps accessible for AI agents☆1,601Updated 6 months ago