gradio-app / fastrtcLinks
The python library for real-time communication
β4,115Updated last week
Alternatives and similar repositories for fastrtc
Users that are interested in fastrtc are comparing it to the libraries listed below
Sorting:
- A fast multimodal LLM for real-time voiceβ4,087Updated last week
- A powerful framework for building realtime voice AI agents π€ποΈπΉβ6,687Updated last week
- Towards Human-Sounding Speechβ5,196Updated 2 months ago
- A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speecβ¦β2,470Updated last week
- Memory for AI Agents in 5 lines of codeβ6,336Updated this week
- File Parser optimised for LLM Ingestion with no loss π§ Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.β6,560Updated 4 months ago
- Vision agentβ4,934Updated last week
- NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other entβ¦β2,701Updated last week
- Flexible and powerful framework for managing multiple AI agents and handling complex conversationsβ6,238Updated 3 weeks ago
- Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your β¦β3,593Updated this week
- Build Real-Time Knowledge Graphs for AI Agentsβ12,727Updated this week
- Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversationβ3,962Updated 3 weeks ago
- Expose your FastAPI endpoints as Model Context Protocol (MCP) tools, with Auth!β6,502Updated this week
- Agent S: an open agentic framework that uses computers like a humanβ5,748Updated this week
- Fast and accurate automatic speech recognition (ASR) for edge devicesβ2,793Updated 2 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-oβ4,100Updated 3 months ago
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audiβ¦β8,636Updated this week
- LLM agents built for control. Designed for real-world use. Deployed in minutes.β3,266Updated last week
- Task-Aware Agent-driven Prompt Optimization Frameworkβ3,383Updated this week
- The easiest way to deploy agents, MCP servers, models, RAG, pipelines and more. No MLOps. No YAML.β3,382Updated this week
- Local realtime voice AIβ2,336Updated 4 months ago
- Full toolkit for running an AI agent service built with LangGraph, FastAPI and Streamlitβ3,295Updated last week
- Fully local web research and report writing assistantβ7,796Updated 2 weeks ago
- Open Source framework for voice and multimodal conversational AIβ6,805Updated this week
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documentsβ5,445Updated this week
- Toolkit for linearizing PDFs for LLM datasets/trainingβ13,196Updated this week
- A visual playground for agentic workflows: Iterate over your agents 10x fasterβ5,287Updated last week
- A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcriβ¦β8,085Updated this week
- An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Coβ¦β4,032Updated 4 months ago
- https://hf.co/hexgrad/Kokoro-82Mβ3,577Updated last week