ngxson / smolvlm-realtime-webcamLinks
Real-time webcam demo with SmolVLM and llama.cpp server
☆4,077Updated 3 months ago
Alternatives and similar repositories for smolvlm-realtime-webcam
Users that are interested in smolvlm-realtime-webcam are comparing it to the libraries listed below
Sorting:
- 100+ Fine-tuning LLM Notebooks on Google Colab, Kaggle, and more.☆2,761Updated this week
- Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your …☆3,774Updated this week
- This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025☆5,263Updated 3 months ago
- Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.☆2,171Updated last week
- A unified library for object tracking featuring clean room re-implementations of leading multi-object tracking algorithms☆1,899Updated this week
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆1,255Updated 3 months ago
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,214Updated 3 months ago
- Towards Human-Sounding Speech☆5,377Updated 3 months ago
- Everything about the SmolLM and SmolVLM family of models☆3,108Updated last week
- ☆1,894Updated 4 months ago
- The python library for real-time communication☆4,190Updated last week
- Cross-platform framework for deploying LLM/VLM/TTS models locally on smartphones.☆2,696Updated this week
- Everything you need to know to build your own RAG application☆2,970Updated 3 weeks ago
- A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speec…☆2,543Updated last week
- Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation☆4,148Updated last month
- Tool for generating high quality Synthetic datasets☆1,117Updated last week
- LLM agents built for control. Designed for real-world use. Deployed in minutes.☆3,436Updated this week
- VIP cheatsheet for Stanford's CME 295 Transformers and Large Language Models☆2,309Updated 2 weeks ago
- A course on aligning smol models.☆6,096Updated last month
- ☆1,134Updated last week
- A Python package that makes it easy for developers to create AI apps powered by various AI providers.☆1,627Updated 4 months ago
- Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗☆671Updated last month
- Local realtime voice AI☆2,347Updated 5 months ago
- Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.☆8,273Updated last month
- ⚙️ Create and run workflows (RPA 2.0)☆3,644Updated this week
- ☆5,722Updated 3 months ago
- Transform PDFs into AI podcasts for engaging on-the-go audio content.☆727Updated 2 months ago
- GenAI Processors is a lightweight Python library that enables efficient, parallel content processing.☆1,912Updated last week
- Meet Ava, the WhatsApp Agent☆1,462Updated 3 months ago
- Document to Markdown OCR library with Llama 3.2 vision☆2,375Updated 6 months ago