ngxson / smolvlm-realtime-webcamLinks
Real-time webcam demo with SmolVLM and llama.cpp server
☆5,505Updated 8 months ago
Alternatives and similar repositories for smolvlm-realtime-webcam
Users that are interested in smolvlm-realtime-webcam are comparing it to the libraries listed below
Sorting:
- Open Source Machine Learning Research Platform designed for frontier AI/ML workflows. Local, on-prem, or in the cloud. Open source.☆4,757Updated this week
- This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025☆7,180Updated 8 months ago
- ☆1,343Updated last week
- Lightweight coding agent that runs in your terminal☆2,167Updated 8 months ago
- AirLLM 70B inference with single 4GB GPU☆1,908Updated 4 months ago
- A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.☆14,992Updated last week
- Multilingual Document Layout Parsing in a Single Vision-Language Model☆7,090Updated last month
- Kimi K2 is the large language model series developed by Moonshot AI team☆9,888Updated last week
- Towards Human-Sounding Speech☆5,909Updated last month
- computer vision and sports☆4,839Updated 2 months ago
- GenAI Processors is a lightweight Python library that enables efficient, parallel content processing.☆2,050Updated this week
- ☆2,269Updated 2 months ago
- Python library for Agentic Document Extraction from LandingAI☆2,327Updated 2 weeks ago
- VIP cheatsheet for Stanford's CME 295 Transformers and Large Language Models☆3,956Updated 6 months ago
- A curated list of 100+ libraries and frameworks for AI engineers building with LLMs☆2,416Updated 2 months ago
- Transform PDFs into AI podcasts for engaging on-the-go audio content.☆786Updated last week
- When Philosophy meets AI☆1,436Updated 3 months ago
- PersonaPlex code.☆3,110Updated this week
- A collection of 🤗 Transformers.js demos and example applications☆1,935Updated 2 months ago
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,435Updated 9 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆19,064Updated 2 months ago
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL☆2,657Updated this week
- Paper2Agent is a multi-agent AI system that automatically transforms research papers into interactive AI agents with minimal human input.☆1,959Updated last month
- ☆2,103Updated 10 months ago
- Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗☆696Updated 6 months ago
- ☆867Updated 3 months ago
- Implement a reasoning LLM in PyTorch from scratch, step by step☆2,689Updated this week
- AI agents can now use real Android and iOS apps, just like a human.☆2,138Updated this week
- ContextGem: Effortless LLM extraction from documents☆1,762Updated last month
- The most accurate document search and store for building AI apps☆3,456Updated last week