gradio-app/fastrtc

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/gradio-app/fastrtc)

gradio-app / fastrtc

The python library for real-time communication

☆4,616

Alternatives and similar repositories for fastrtc

Users that are interested in fastrtc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

landing-ai / vision-agent
View on GitHub
This tool has been deprecated. Use Agentic Document Extraction instead.
☆5,287Jan 29, 2026Updated 5 months ago
allenai / olmocr
View on GitHub
Toolkit for linearizing PDFs for LLM datasets/training
☆19,160Mar 25, 2026Updated 3 months ago
fixie-ai / ultravox
View on GitHub
A fast multimodal LLM for real-time voice
☆4,478Dec 12, 2025Updated 7 months ago
livekit / agents
View on GitHub
A framework for building realtime voice AI agents 🤖🎙️📹
☆11,471Updated this week
agno-agi / agno
View on GitHub
Build, run, and manage agent platforms.
☆41,355Updated this week
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
pipecat-ai / pipecat
View on GitHub
Open Source framework for voice and multimodal conversational AI
☆13,643Updated this week
kyutai-labs / moshi
View on GitHub
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…
☆10,657May 16, 2026Updated 2 months ago
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,354Jun 9, 2026Updated last month
TEN-framework / ten-framework
View on GitHub
Open-source framework for conversational voice AI agents
☆10,942Updated this week
KoljaB / RealtimeSTT
View on GitHub
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcri…
☆10,002Jun 12, 2026Updated last month
PrefectHQ / fastmcp
View on GitHub
🚀 The fast, Pythonic way to build MCP servers and clients.
☆26,765Updated this week
microsoft / OmniParser
View on GitHub
A simple screen parsing tool towards pure vision based GUI agent
☆25,181Updated this week
oumi-ai / oumi
View on GitHub
Easily fine-tune, evaluate and deploy Gemma 4, Qwen3.5, Qwen3.6, gpt-oss, DeepSeek-R1, or any open source LLM / VLM!
☆9,363Updated this week
unslothai / unsloth
View on GitHub
Unsloth is a local UI for training and running Gemma 4, Qwen3.6, DeepSeek, Kimi, GLM and other models.
☆68,666Updated this week
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
sofdog-gh / realtime-transcription-fastrtc
View on GitHub
Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗
☆701Jul 10, 2025Updated last year
microsoft / data-formulator
View on GitHub
🪄 Data Formulator is an interactive AI-powered data analysis system makes it easy to connect, explore and visualize data.
☆15,979Updated this week
stanford-oval / storm
View on GitHub
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
☆30,233Sep 30, 2025Updated 9 months ago
OpenBMB / MiniCPM-V
View on GitHub
A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone
☆25,965Jun 25, 2026Updated 3 weeks ago
sinaptik-ai / pandas-ai
View on GitHub
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
☆23,661Oct 28, 2025Updated 8 months ago
getzep / graphiti
View on GitHub
Build Real-Time Knowledge Graphs for AI Agents
☆29,048Updated this week
huggingface / speech-to-speech
View on GitHub
Build local voice agents with open-source models
☆6,279Updated this week
mem0ai / mem0
View on GitHub
Universal memory layer for AI Agents
☆61,383Updated this week
emcie-co / parlant
View on GitHub
Build reliable customer-facing AI agents with Parlant: an interaction control harness optimized for controlled, consistent, and predictab…
☆18,181Jul 12, 2026Updated last week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Zipstack / unstract
View on GitHub
LLM-Driven Extraction of Unstructured Data — Built for API Deployments & ETL Pipeline Workflows
☆6,714Updated this week
Cinnamon / kotaemon
View on GitHub
An open-source RAG-based tool for chatting with your documents.
☆25,576Jul 14, 2026Updated last week
browser-use / browser-use
View on GitHub
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
☆106,085Updated this week
tadata-org / fastapi_mcp
View on GitHub
Expose your FastAPI endpoints as Model Context Protocol (MCP) tools, with Auth!
☆11,951Nov 24, 2025Updated 7 months ago
open-mmlab / Amphion
View on GitHub
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…
☆9,959Mar 25, 2026Updated 3 months ago
Lightning-AI / LitServe
View on GitHub
A minimal Python framework for building custom AI inference servers with full control over logic, batching, and scaling.
☆3,920Updated this week
browserbase / stagehand
View on GitHub
The SDK For Browser Agents
☆23,582Updated this week
canopyai / Orpheus-TTS
View on GitHub
Towards Human-Sounding Speech
☆6,256Dec 5, 2025Updated 7 months ago
langchain-ai / local-deep-researcher
View on GitHub
Fully local web research and report writing assistant
☆9,281Jul 14, 2026Updated last week
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
unclecode / crawl4ai
View on GitHub
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
☆74,132Updated this week
camel-ai / camel
View on GitHub
🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
☆17,464Updated this week
QuivrHQ / MegaParse
View on GitHub
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
☆7,403Feb 21, 2025Updated last year
pydantic / pydantic-ai
View on GitHub
AI Agent Framework, the Pydantic way
☆18,734Updated this week
BerriAI / litellm
View on GitHub
The fastest, litest AI Gateway. Rust core with Python SDK. Call 100+ LLM APIs in OpenAI (or native) format with cost tracking, guardrails…
☆54,241Updated this week
docling-project / docling
View on GitHub
Get your documents ready for gen AI
☆63,616Updated this week
datalab-to / surya
View on GitHub
OCR, layout analysis, reading order, table recognition in 90+ languages
☆21,135Updated this week