ngxson / smolvlm-realtime-webcamLinks
Real-time webcam demo with SmolVLM and llama.cpp server
☆4,808Updated 5 months ago
Alternatives and similar repositories for smolvlm-realtime-webcam
Users that are interested in smolvlm-realtime-webcam are comparing it to the libraries listed below
Sorting:
- A unified library for object tracking featuring clean room re-implementations of leading multi-object tracking algorithms☆2,153Updated this week
 - This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025☆6,830Updated 5 months ago
 - Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your …☆4,475Updated this week
 - Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.☆10,213Updated 3 weeks ago
 - 100+ Fine-tuning Tutorial Notebooks on Google Colab, Kaggle and more.☆3,778Updated last week
 - The most accurate document search and store for building AI apps☆3,339Updated this week
 - A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive vi…☆16,708Updated this week
 - SoTA open-source TTS☆14,370Updated last month
 - Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗☆687Updated 3 months ago
 - Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,346Updated 6 months ago
 - GenAI Processors is a lightweight Python library that enables efficient, parallel content processing.☆1,981Updated last week
 - A react-based starter app for using the Live API over websockets with Gemini☆2,373Updated 2 weeks ago
 - RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO and designed for fine-tun…☆3,868Updated 2 weeks ago
 - Transform PDFs into AI podcasts for engaging on-the-go audio content.☆758Updated 5 months ago
 - ☆1,288Updated 3 weeks ago
 - ☆835Updated 5 months ago
 - Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆1,306Updated 6 months ago
 - The python library for real-time communication☆4,373Updated last month
 - ☆2,054Updated 7 months ago
 - A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speec…☆2,760Updated last week
 - A Python package that makes it easy for developers to create AI apps powered by various AI providers.☆1,650Updated 6 months ago
 - Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.☆2,543Updated last month
 - On-device TTS model by Neuphonic☆3,759Updated this week
 - ✨ Build a machine learning model from a prompt☆2,201Updated 2 months ago
 - streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL☆2,641Updated last week
 - 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL☆1,098Updated this week
 - ☆1,339Updated 6 months ago
 - 🪄 Create rich visualizations with AI☆14,041Updated this week
 - Python package and backend for the Elysia platform app.☆1,785Updated this week
 - Web-based tool converts GitHub repository contents into a single formatted text file☆1,587Updated 2 months ago