ngxson / smolvlm-realtime-webcamLinks
Real-time webcam demo with SmolVLM and llama.cpp server
☆5,214Updated 7 months ago
Alternatives and similar repositories for smolvlm-realtime-webcam
Users that are interested in smolvlm-realtime-webcam are comparing it to the libraries listed below
Sorting:
- Open Source Machine Learning Research Platform designed for frontier AI/ML workflows. Local, on-prem, or in the cloud. Open source.☆4,741Updated this week
- This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025☆7,110Updated 8 months ago
- ☆2,080Updated 9 months ago
- A Python package that makes it easy for developers to create AI apps powered by various AI providers.☆1,650Updated 9 months ago
- Memory layer for AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer. Give your agents instant retrieval…☆10,543Updated last week
- 100+ Fine-tuning Tutorial Notebooks on Google Colab, Kaggle and more.☆3,937Updated this week
- WhatsApp MCP server☆5,184Updated 5 months ago
- Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗☆692Updated 5 months ago
- Document to Markdown OCR library with Llama 3.2 vision☆2,416Updated 11 months ago
- ☆531Updated 7 months ago
- RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO and designed for fine-tun…☆5,047Updated last month
- This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models.☆1,156Updated 11 months ago
- A unified library for object tracking featuring clean room re-implementations of leading multi-object tracking algorithms☆2,199Updated 2 weeks ago
- OCR model that handles complex tables, forms, handwriting with full layout.☆4,260Updated 3 weeks ago
- A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speec…☆3,181Updated this week
- ☆6,280Updated 4 months ago
- ☆2,749Updated 8 months ago
- Transform PDFs into AI podcasts for engaging on-the-go audio content.☆785Updated last week
- computer vision and sports☆4,813Updated 2 months ago
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆1,351Updated 8 months ago
- Kernels & AI inference engine for mobile devices.☆3,998Updated this week
- ☆1,328Updated last month
- 🖥️ Run AI Agent in your browser.☆15,412Updated 4 months ago
- ✨ Build a machine learning model from a prompt☆2,287Updated 4 months ago
- Lightweight coding agent that runs in your terminal☆2,161Updated 8 months ago
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,410Updated 8 months ago
- Have a natural, spoken conversation with AI!☆3,448Updated 5 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆19,014Updated last month
- The most accurate document search and store for building AI apps☆3,441Updated this week
- ContextGem: Effortless LLM extraction from documents☆1,750Updated 3 weeks ago