The python library for real-time communication
☆4,535Jan 12, 2026Updated last month
Alternatives and similar repositories for fastrtc
Users that are interested in fastrtc are comparing it to the libraries listed below
Sorting:
- This tool has been deprecated. Use Agentic Document Extraction instead.☆5,257Jan 29, 2026Updated last month
- Toolkit for linearizing PDFs for LLM datasets/training☆16,947Feb 19, 2026Updated last week
- A fast multimodal LLM for real-time voice☆4,367Dec 12, 2025Updated 2 months ago
- A framework for building realtime voice AI agents 🤖🎙️📹☆9,441Updated this week
- Build, run, manage agentic software at scale.☆38,276Updated this week
- Open Source framework for voice and multimodal conversational AI☆10,529Updated this week
- A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcri…☆9,502Jul 11, 2025Updated 7 months ago
- SOTA Open Source TTS☆25,078Feb 2, 2026Updated last month
- Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!☆8,901Updated this week
- 🚀 The fast, Pythonic way to build MCP servers and clients.☆23,221Updated this week
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆9,750Feb 12, 2026Updated 2 weeks ago
- Universal memory layer for AI Agents☆47,994Feb 23, 2026Updated last week
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆79,028Feb 26, 2026Updated last week
- Open-source framework for conversational voice AI agents☆10,094Updated this week
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.☆53,029Updated this week
- A simple screen parsing tool towards pure vision based GUI agent☆24,406Sep 12, 2025Updated 5 months ago
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆27,949Sep 30, 2025Updated 5 months ago
- A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone☆23,942Feb 23, 2026Updated last week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆37,083Updated this week
- GenAI Agent Framework, the Pydantic way☆15,120Updated this week
- 🪄 Create rich visualizations with AI☆15,069Feb 24, 2026Updated last week
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆60,971Feb 25, 2026Updated last week
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents☆6,452Updated this week
- Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.☆23,280Oct 28, 2025Updated 4 months ago
- An open-source RAG-based tool for chatting with your documents.☆25,168Updated this week
- Build Real-Time Knowledge Graphs for AI Agents☆23,192Updated this week
- Get your documents ready for gen AI☆54,094Feb 24, 2026Updated last week
- Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…☆9,706May 27, 2025Updated 9 months ago
- A minimal Python framework for building custom AI inference servers with full control over logic, batching, and scaling.☆3,803Feb 25, 2026Updated last week
- 🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data☆87,163Updated this week
- Fully local web research and report writing assistant☆8,515Feb 25, 2026Updated last week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,342Feb 21, 2025Updated last year
- 🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org☆16,104Updated this week
- The AI Browser Automation Framework☆21,261Updated this week
- Spec-driven development for large codebases☆5,248Updated this week
- 🙌 OpenHands: AI-Driven Development☆68,459Updated this week
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆4,478Feb 24, 2026Updated last week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,360Feb 24, 2026Updated last week
- Simple, unified interface to multiple Generative AI providers☆13,486Dec 15, 2025Updated 2 months ago