ngxson / smolvlm-realtime-webcamLinks
Real-time webcam demo with SmolVLM and llama.cpp server
☆4,840Updated 7 months ago
Alternatives and similar repositories for smolvlm-realtime-webcam
Users that are interested in smolvlm-realtime-webcam are comparing it to the libraries listed below
Sorting:
- A unified library for object tracking featuring clean room re-implementations of leading multi-object tracking algorithms☆2,185Updated last week
- A Python package that makes it easy for developers to create AI apps powered by various AI providers.☆1,654Updated 8 months ago
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,404Updated 7 months ago
- Open Source Machine Learning Research Platform designed for frontier AI/ML workflows. Local, on-prem, or in the cloud. Open source.☆4,593Updated this week
- RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO and designed for fine-tun…☆4,713Updated last month
- ☆2,074Updated 9 months ago
- ☆1,320Updated 3 weeks ago
- ☆840Updated 7 months ago
- Meet Ava, the WhatsApp Agent☆1,564Updated last month
- 100+ Fine-tuning Tutorial Notebooks on Google Colab, Kaggle and more.☆3,894Updated this week
- Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.☆10,492Updated 2 months ago
- Python library for Agentic Document Extraction from LandingAI☆2,301Updated 3 weeks ago
- 🎨 Turn your roughest sketches into stunning 3D worlds by vibe drawing☆1,970Updated 5 months ago
- A course on aligning smol models.☆6,541Updated last month
- A mini, open-weights, version of our Proxy assistant.☆977Updated 9 months ago
- GenAI Processors is a lightweight Python library that enables efficient, parallel content processing.☆2,015Updated this week
- "AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"☆8,271Updated 2 months ago
- Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗☆691Updated 5 months ago
- mcp-use is the easiest way to interact with mcp servers with custom agents☆8,573Updated this week
- ⚙️ Create and run workflows (RPA 2.0)☆3,826Updated last week
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆1,343Updated 8 months ago
- Everything about the SmolLM and SmolVLM family of models☆3,448Updated 3 weeks ago
- The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.☆7,998Updated this week
- Official Repo for "TheoremExplainAgent: Towards Video-based Multimodal Explanations for LLM Theorem Understanding" [ACL 2025 oral]☆1,440Updated 4 months ago
- Python package and backend for the Elysia platform app.☆1,835Updated last week
- The python library for real-time communication☆4,451Updated 3 weeks ago
- Web-based tool converts GitHub repository contents into a single formatted text file☆1,617Updated 4 months ago
- ☆2,138Updated last month
- SOTA search powered LLM☆3,737Updated 8 months ago
- ContextGem: Effortless LLM extraction from documents☆1,744Updated last month