pipecat-ai / pipecatLinks
Open Source framework for voice and multimodal conversational AI
β10,078Updated this week
Alternatives and similar repositories for pipecat
Users that are interested in pipecat are comparing it to the libraries listed below
Sorting:
- A powerful framework for building realtime voice AI agents π€ποΈπΉβ9,214Updated this week
- A fast multimodal LLM for real-time voiceβ4,334Updated last month
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audiβ¦β9,463Updated 2 weeks ago
- Towards Human-Sounding Speechβ5,918Updated last month
- Fast and accurate automatic speech recognition (ASR) for edge devicesβ3,115Updated 2 months ago
- Local realtime voice AIβ2,424Updated 2 months ago
- Converts text to speech in realtimeβ3,740Updated 3 weeks ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-oβ4,278Updated 9 months ago
- The python library for real-time communicationβ4,500Updated 3 weeks ago
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speeβ¦β3,118Updated 8 months ago
- Have a natural, spoken conversation with AI!β3,486Updated 6 months ago
- π€ Build voice-based LLM agents. Modular + open source.β3,687Updated last year
- β1,249Updated this week
- https://hf.co/hexgrad/Kokoro-82Mβ5,506Updated 5 months ago
- π§ Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 πβ5,062Updated this week
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitchingβ4,359Updated 3 weeks ago
- React app for inspecting, building and debugging with the Realtime APIβ3,548Updated 5 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.β19,077Updated 2 months ago
- Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.β2,822Updated last week
- first base model for full-duplex conversational audioβ1,773Updated last year
- An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Coβ¦β5,942Updated last month
- A Conversational Speech Generation Modelβ14,476Updated 8 months ago
- Inference and training library for high-quality TTS models.β5,513Updated last year
- π₯ Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser sandbox that lets you automate the web witβ¦β6,278Updated 3 weeks ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising qualityβ4,568Updated last year
- PraisonAI is a production-ready Multi AI Agents framework, designed to create AI Agents to automate and solve problems ranging from simplβ¦β5,567Updated this week
- MARS5 speech model (TTS) from CAMB.AIβ2,815Updated last year
- ML-powered speech recognition directly in your browserβ3,230Updated last year
- Voice activity detector (VAD) for the browser with a simple APIβ1,798Updated 3 weeks ago
- An Open Source text-to-speech system built by inverting Whisper.β4,551Updated last month