ngxson / smolvlm-realtime-webcamView external linksLinks
Real-time webcam demo with SmolVLM and llama.cpp server
☆5,524May 12, 2025Updated 9 months ago
Alternatives and similar repositories for smolvlm-realtime-webcam
Users that are interested in smolvlm-realtime-webcam are comparing it to the libraries listed below
Sorting:
- This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025☆7,198May 5, 2025Updated 9 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆19,109Nov 19, 2025Updated 2 months ago
- LLM inference in C/C++☆95,169Updated this week
- Official inference framework for 1-bit LLMs☆28,443Feb 3, 2026Updated 2 weeks ago
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆78,295Updated this week
- Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.☆95,044Updated this week
- The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU cluste…☆4,803Feb 10, 2026Updated last week
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆59,947Updated this week
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.☆51,922Updated this week
- We write your reusable computer vision tools. 💜☆36,491Updated this week
- The programming language for agentic software.☆37,873Updated this week
- Universal memory layer for AI Agents☆47,230Feb 3, 2026Updated 2 weeks ago
- Get your documents ready for gen AI☆52,799Updated this week
- 🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data☆83,148Updated this week
- real time face swap and one-click video deepfake with only a single image☆79,430Updated this week
- In-depth tutorials on LLMs, RAGs and real-world AI agent applications.☆29,595Jan 30, 2026Updated 2 weeks ago
- FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app,…☆114,564Feb 6, 2026Updated last week
- An open-source RAG-based tool for chatting with your documents.☆25,095Jul 4, 2025Updated 7 months ago
- Python tool for converting files and office documents to Markdown.☆87,138Updated this week
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆4,647Oct 27, 2025Updated 3 months ago
- 🪄 Create rich visualizations with AI☆14,829Updated this week
- A collection of MCP servers.☆80,690Feb 1, 2026Updated 2 weeks ago
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆123,582Updated this week
- 🤗 smolagents: a barebones library for agents that think in code.☆25,422Jan 23, 2026Updated 3 weeks ago
- 🙌 OpenHands: AI-Driven Development☆67,779Updated this week
- Build Real-Time Knowledge Graphs for AI Agents☆22,841Updated this week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆58,056Updated this week
- Trackers gives you clean, modular re-implementations of leading multi-object tracking algorithms released under the permissive Apache 2.0…☆2,409Updated this week
- tiny vision language model☆9,329Nov 14, 2025Updated 3 months ago
- mcp-use is the easiest way to interact with mcp servers with custom agents☆9,151Updated this week
- The AI Browser Automation Framework☆21,077Updated this week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆35,968Updated this week
- A Conversational Speech Generation Model☆14,491May 27, 2025Updated 8 months ago
- "AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"☆8,571Oct 16, 2025Updated 4 months ago
- Lightweight coding agent that runs in your terminal☆60,194Updated this week
- Memory layer for AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer. Give your agents instant retrieval…☆13,091Feb 10, 2026Updated last week
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆9,642Updated this week
- Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost …☆24,982Nov 15, 2025Updated 3 months ago
- Kernels & AI inference engine for mobile devices.☆4,267Updated this week