pipecat-ai / pipecatLinks
Open Source framework for voice and multimodal conversational AI
β8,246Updated this week
Alternatives and similar repositories for pipecat
Users that are interested in pipecat are comparing it to the libraries listed below
Sorting:
- A powerful framework for building realtime voice AI agents π€ποΈπΉβ7,711Updated this week
- A fast multimodal LLM for real-time voiceβ4,211Updated last month
- Local realtime voice AIβ2,368Updated 7 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-oβ4,190Updated 5 months ago
- React app for inspecting, building and debugging with the Realtime APIβ3,461Updated last month
- The python library for real-time communicationβ4,326Updated 2 weeks ago
- Converts text to speech in realtimeβ3,550Updated 2 months ago
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audiβ¦β8,960Updated this week
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speeβ¦β3,073Updated 4 months ago
- π§ Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 πβ4,558Updated this week
- β953Updated 3 weeks ago
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitchingβ3,732Updated 2 months ago
- https://hf.co/hexgrad/Kokoro-82Mβ4,492Updated 2 months ago
- Flexible and powerful framework for managing multiple AI agents and handling complex conversationsβ6,944Updated last week
- Towards Human-Sounding Speechβ5,597Updated 5 months ago
- π€ Build voice-based LLM agents. Modular + open source.β3,459Updated 10 months ago
- first base model for full-duplex conversational audioβ1,764Updated 9 months ago
- Fast and accurate automatic speech recognition (ASR) for edge devicesβ2,897Updated last month
- Have a natural, spoken conversation with AI!β3,219Updated 2 months ago
- Whisper realtime streaming for long speech-to-text transcription and translationβ3,357Updated last month
- A language model programming library.β5,846Updated 4 months ago
- Inference and training library for high-quality TTS models.β5,426Updated 9 months ago
- ML-powered speech recognition directly in your browserβ3,106Updated last year
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ5,981Updated last year
- A framework for serving and evaluating LLM routers - save LLM costs without compromising qualityβ4,309Updated last year
- Voice activity detector (VAD) for the browser with a simple APIβ1,621Updated last week
- Silero VAD: pre-trained enterprise-grade Voice Activity Detectorβ6,976Updated last month
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,631Updated last year
- TTS with kokoro and onnx runtimeβ2,209Updated 3 months ago
- A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcriβ¦β8,689Updated 2 months ago