Real-time webcam demo with SmolVLM and llama.cpp server
☆5,531May 12, 2025Updated 9 months ago
Alternatives and similar repositories for smolvlm-realtime-webcam
Users that are interested in smolvlm-realtime-webcam are comparing it to the libraries listed below
Sorting:
- This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025☆7,243May 5, 2025Updated 10 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆19,153Nov 19, 2025Updated 3 months ago
- LLM inference in C/C++☆97,252Updated this week
- Official inference framework for 1-bit LLMs☆28,697Feb 3, 2026Updated last month
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆79,644Mar 3, 2026Updated last week
- Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.☆99,935Mar 2, 2026Updated last week
- The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU cluste…☆4,821Updated this week
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.☆53,029Mar 3, 2026Updated last week
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆61,332Mar 3, 2026Updated last week
- We write your reusable computer vision tools. 💜☆36,654Mar 3, 2026Updated last week
- Build, run, manage agentic software at scale.☆38,516Updated this week
- Universal memory layer for AI Agents☆48,604Updated this week
- Get your documents ready for gen AI☆54,754Updated this week
- real time face swap and one-click video deepfake with only a single image☆79,811Updated this week
- 🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data☆89,344Updated this week
- In-depth tutorials on LLMs, RAGs and real-world AI agent applications.☆31,289Feb 26, 2026Updated last week
- FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app,…☆129,077Updated this week
- Python tool for converting files and office documents to Markdown.☆90,316Feb 20, 2026Updated 2 weeks ago
- An open-source RAG-based tool for chatting with your documents.☆25,193Updated this week
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆4,685Oct 27, 2025Updated 4 months ago
- 🪄 Create rich visualizations with AI☆15,103Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆126,337Updated this week
- 🤗 smolagents: a barebones library for agents that think in code.☆25,756Mar 1, 2026Updated last week
- A collection of MCP servers.☆82,016Feb 26, 2026Updated last week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆58,756Updated this week
- 🙌 OpenHands: AI-Driven Development☆68,459Mar 3, 2026Updated last week
- Build Real-Time Knowledge Graphs for AI Agents☆23,438Updated this week
- Trackers gives you clean, modular re-implementations of leading multi-object tracking algorithms released under the permissive Apache 2.0…☆2,932Updated this week
- tiny vision language model☆9,386Nov 14, 2025Updated 3 months ago
- The AI Browser Automation Framework☆21,356Updated this week
- The fullstack MCP framework to develop MCP Apps for ChatGPT / Claude & MCP Servers for AI Agents.☆9,377Updated this week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆37,994Updated this week
- A Conversational Speech Generation Model☆14,530May 27, 2025Updated 9 months ago
- "AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"☆8,630Oct 16, 2025Updated 4 months ago
- Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost …☆25,444Mar 2, 2026Updated last week
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆9,799Updated this week
- Memory layer for AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer. Give your agents instant retrieval…☆13,266Updated this week
- Lightweight coding agent that runs in your terminal☆62,963Updated this week
- Kortix – build, manage and train AI Agents.☆19,466Updated this week