pipecat-ai / pipecatLinks
Open Source framework for voice and multimodal conversational AI
β6,748Updated this week
Alternatives and similar repositories for pipecat
Users that are interested in pipecat are comparing it to the libraries listed below
Sorting:
- A powerful framework for building realtime voice AI agents π€ποΈπΉβ6,687Updated this week
- A fast multimodal LLM for real-time voiceβ4,087Updated this week
- Fast and accurate automatic speech recognition (ASR) for edge devicesβ2,782Updated last month
- Local realtime voice AIβ2,330Updated 4 months ago
- π§ Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 πβ4,095Updated this week
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speeβ¦β2,949Updated last month
- The python library for real-time communicationβ4,115Updated this week
- first base model for full-duplex conversational audioβ1,746Updated 6 months ago
- A language model programming library.β5,798Updated last month
- Converts text to speech in realtimeβ3,268Updated 2 weeks ago
- Inference and training library for high-quality TTS models.β5,336Updated 7 months ago
- React app for inspecting, building and debugging with the Realtime APIβ3,326Updated 2 weeks ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-oβ4,100Updated 2 months ago
- The easiest way to use Agentic RAG in any enterpriseβ4,282Updated 5 months ago
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audiβ¦β8,592Updated last week
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitchingβ3,197Updated last week
- π€ Build voice-based LLM agents. Modular + open source.β3,373Updated 7 months ago
- PraisonAI is a production-ready Multi AI Agents framework, designed to create AI Agents to automate and solve problems ranging from simplβ¦β5,027Updated this week
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,616Updated 11 months ago
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviateβ7,195Updated 2 weeks ago
- Zep | Examples, Integrations, & Moreβ3,344Updated last week
- Deploy your agentic worfklows to productionβ2,035Updated last week
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ5,828Updated 11 months ago
- The Open Source Memory Layer For Autonomous Agentsβ2,275Updated 8 months ago
- β782Updated 2 months ago
- π A better UX for chat, writing content, and coding with LLMs.β4,759Updated last month
- A modular voice assistant application for experimenting with state-of-the-art transcription, response generation, and text-to-speech modeβ¦β1,029Updated last month
- ML-powered speech recognition directly in your browserβ2,987Updated 9 months ago
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundryβ4,137Updated 4 months ago
- Agent Framework / shim to use Pydantic with LLMsβ10,703Updated this week