pipecat-ai / pipecatLinks
Open Source framework for voice and multimodal conversational AI
β10,183Updated this week
Alternatives and similar repositories for pipecat
Users that are interested in pipecat are comparing it to the libraries listed below
Sorting:
- A powerful framework for building realtime voice AI agents π€ποΈπΉβ9,277Updated this week
- A fast multimodal LLM for real-time voiceβ4,349Updated last month
- Speech To Speech: an effort for an open-sourced and modular GPT4-oβ4,416Updated this week
- π€ Build voice-based LLM agents. Modular + open source.β3,691Updated last year
- Converts text to speech in realtimeβ3,750Updated 3 weeks ago
- Voice activity detector (VAD) for the browser with a simple APIβ1,808Updated last week
- The python library for real-time communicationβ4,519Updated 3 weeks ago
- Local realtime voice AIβ2,425Updated 2 months ago
- React app for inspecting, building and debugging with the Realtime APIβ3,551Updated 5 months ago
- π A better UX for chat, writing content, and coding with LLMs.β5,343Updated last month
- Have a natural, spoken conversation with AI!β3,506Updated 6 months ago
- Build multi-agent systems that learn and improve with every interaction.β37,691Updated this week
- first base model for full-duplex conversational audioβ1,774Updated last year
- Zep | Examples, Integrations, & Moreβ4,048Updated this week
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speeβ¦β3,119Updated 8 months ago
- β1,264Updated last week
- π§ Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 πβ5,091Updated this week
- Open-source Next.js template for building apps that are fully generated by AI. By E2B.β6,153Updated last month
- Fast and accurate automatic speech recognition (ASR) for edge devicesβ3,128Updated 2 months ago
- PraisonAI is a production-ready Multi AI Agents framework, designed to create AI Agents to automate and solve problems ranging from simplβ¦β5,599Updated this week
- A framework for serving and evaluating LLM routers - save LLM costs without compromising qualityβ4,581Updated last year
- A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcriβ¦β9,454Updated 6 months ago
- A Conversational Speech Generation Modelβ14,488Updated 8 months ago
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audiβ¦β9,557Updated 3 weeks ago
- Whisper realtime streaming for long speech-to-text transcription and translationβ3,526Updated 2 months ago
- Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including Cβ¦β5,250Updated 3 months ago
- An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Coβ¦β5,996Updated 2 months ago
- Inference and training library for high-quality TTS models.β5,528Updated last year
- β2,906Updated this week
- The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the webβ2,334Updated 8 months ago