pipecat-ai / pipecatLinks
Open Source framework for voice and multimodal conversational AI
β7,417Updated this week
Alternatives and similar repositories for pipecat
Users that are interested in pipecat are comparing it to the libraries listed below
Sorting:
- A fast multimodal LLM for real-time voiceβ4,132Updated this week
- A powerful framework for building realtime voice AI agents π€ποΈπΉβ7,003Updated this week
- The python library for real-time communicationβ4,174Updated last week
- Local realtime voice AIβ2,343Updated 5 months ago
- React app for inspecting, building and debugging with the Realtime APIβ3,376Updated last month
- Converts text to speech in realtimeβ3,357Updated 2 weeks ago
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audiβ¦β8,720Updated this week
- Towards Human-Sounding Speechβ5,327Updated 3 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-oβ4,124Updated 3 months ago
- Fast and accurate automatic speech recognition (ASR) for edge devicesβ2,805Updated 2 months ago
- Flexible and powerful framework for managing multiple AI agents and handling complex conversationsβ6,349Updated last month
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speeβ¦β2,970Updated 2 months ago
- first base model for full-duplex conversational audioβ1,747Updated 7 months ago
- A language model programming library.β5,802Updated 2 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.β17,775Updated last month
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitchingβ3,382Updated last week
- Have a natural, spoken conversation with AI!β2,875Updated 3 weeks ago
- Voice activity detector (VAD) for the browser with a simple APIβ1,505Updated 2 weeks ago
- Inference and training library for high-quality TTS models.β5,374Updated 7 months ago
- A nearly-live implementation of OpenAI's Whisper.β3,222Updated 2 weeks ago
- π₯ Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web wiβ¦β4,894Updated this week
- A Conversational Speech Generation Modelβ13,858Updated 2 months ago
- β847Updated 2 weeks ago
- https://hf.co/hexgrad/Kokoro-82Mβ3,873Updated this week
- A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcriβ¦β8,278Updated 3 weeks ago
- π§ Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 πβ4,267Updated this week
- Whisper realtime streaming for long speech-to-text transcription and translationβ3,188Updated last month
- π€ Build voice-based LLM agents. Modular + open source.β3,396Updated 8 months ago
- β2,179Updated this week
- π A better UX for chat, writing content, and coding with LLMs.β4,867Updated 2 months ago