ngxson / smolvlm-realtime-webcamView external linksLinks
Real-time webcam demo with SmolVLM and llama.cpp server
☆5,524May 12, 2025Updated 9 months ago
Alternatives and similar repositories for smolvlm-realtime-webcam
Users that are interested in smolvlm-realtime-webcam are comparing it to the libraries listed below
Sorting:
- This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025☆7,198May 5, 2025Updated 9 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆19,109Nov 19, 2025Updated 2 months ago
- LLM inference in C/C++☆94,823Updated this week
- Official inference framework for 1-bit LLMs☆28,054Feb 3, 2026Updated last week
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆78,295Updated this week
- Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.☆92,709Feb 4, 2026Updated last week
- The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU cluste…☆4,803Updated this week
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆59,947Updated this week
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.☆51,922Updated this week
- Build multi-agent systems that learn and improve with every interaction.☆37,691Updated this week
- We write your reusable computer vision tools. 💜☆36,478Updated this week
- Universal memory layer for AI Agents☆47,230Feb 3, 2026Updated last week
- Get your documents ready for gen AI☆52,799Updated this week
- 🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data☆80,940Updated this week
- real time face swap and one-click video deepfake with only a single image☆79,430Updated this week
- In-depth tutorials on LLMs, RAGs and real-world AI agent applications.☆28,027Jan 30, 2026Updated 2 weeks ago
- FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app,…☆113,705Feb 6, 2026Updated last week
- An open-source RAG-based tool for chatting with your documents.☆25,019Jul 4, 2025Updated 7 months ago
- Python tool for converting files and office documents to Markdown.☆86,605Jan 8, 2026Updated last month
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆4,647Oct 27, 2025Updated 3 months ago
- 🪄 Create rich visualizations with AI☆14,829Updated this week
- A collection of MCP servers.☆80,690Feb 1, 2026Updated 2 weeks ago
- 🤗 smolagents: a barebones library for agents that think in code.☆25,422Jan 23, 2026Updated 3 weeks ago
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆123,582Updated this week
- Trackers gives you clean, modular re-implementations of leading multi-object tracking algorithms released under the permissive Apache 2.0…☆2,409Updated this week
- Build Real-Time Knowledge Graphs for AI Agents☆22,690Updated this week
- 🙌 OpenHands: AI-Driven Development☆67,779Updated this week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆57,756Updated this week
- tiny vision language model☆9,329Nov 14, 2025Updated 3 months ago
- mcp-use is the easiest way to interact with mcp servers with custom agents☆9,151Updated this week
- The AI Browser Automation Framework☆21,077Updated this week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆35,968Updated this week
- A Conversational Speech Generation Model☆14,488May 27, 2025Updated 8 months ago
- "AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"☆8,540Oct 16, 2025Updated 3 months ago
- Lightweight coding agent that runs in your terminal☆60,194Updated this week
- Memory layer for AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer. Give your agents instant retrieval…☆13,091Updated this week
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆9,593Updated this week
- Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost …☆24,939Nov 15, 2025Updated 3 months ago
- Kernels & AI inference engine for mobile devices.☆4,267Updated this week