pipecat-ai / pipecatLinks

Open Source framework for voice and multimodal conversational AI

☆7,417

Alternatives and similar repositories for pipecat

Users that are interested in pipecat are comparing it to the libraries listed below

Sorting:

fixie-ai / ultravox
A fast multimodal LLM for real-time voice
☆4,132Updated this week
livekit / agents
A powerful framework for building realtime voice AI agents 🤖🎙️📹
☆7,003Updated this week
gradio-app / fastrtc
The python library for real-time communication
☆4,174Updated last week
menloresearch / ichigo
Local realtime voice AI
☆2,343Updated 5 months ago
openai / openai-realtime-console
React app for inspecting, building and debugging with the Realtime API
☆3,376Updated last month
KoljaB / RealtimeTTS
Converts text to speech in realtime
☆3,357Updated 2 weeks ago
kyutai-labs / moshi
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…
☆8,720Updated this week
canopyai / Orpheus-TTS
Towards Human-Sounding Speech
☆5,327Updated 3 months ago
huggingface / speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
☆4,124Updated 3 months ago
moonshine-ai / moonshine
Fast and accurate automatic speech recognition (ASR) for edge devices
☆2,805Updated 2 months ago
awslabs / agent-squad
Flexible and powerful framework for managing multiple AI agents and handling complex conversations
☆6,349Updated last month
ictnlp / LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…
☆2,970Updated 2 months ago
Standard-Intelligence / hertz-dev
first base model for full-duplex conversational audio
☆1,747Updated 7 months ago
MadcowD / ell
A language model programming library.
☆5,802Updated 2 months ago
nari-labs / dia
A TTS model capable of generating ultra-realistic dialogue in one pass.
☆17,775Updated last month
remsky / Kokoro-FastAPI
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
☆3,382Updated last week
KoljaB / RealtimeVoiceChat
Have a natural, spoken conversation with AI!
☆2,875Updated 3 weeks ago
ricky0123 / vad
Voice activity detector (VAD) for the browser with a simple API
☆1,505Updated 2 weeks ago
huggingface / parler-tts
Inference and training library for high-quality TTS models.
☆5,374Updated 7 months ago
collabora / WhisperLive
A nearly-live implementation of OpenAI's Whisper.
☆3,222Updated 2 weeks ago
steel-dev / steel-browser
🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web wi…
☆4,894Updated this week
SesameAILabs / csm
A Conversational Speech Generation Model
☆13,858Updated 2 months ago
pipecat-ai / smart-turn
☆847Updated 2 weeks ago
hexgrad / kokoro
https://hf.co/hexgrad/Kokoro-82M
☆3,873Updated this week
KoljaB / RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcri…
☆8,278Updated 3 weeks ago
Helicone / helicone
🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓
☆4,267Updated this week
ufal / whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
☆3,188Updated last month
vocodedev / vocode-core
🤖 Build voice-based LLM agents. Modular + open source.
☆3,396Updated 8 months ago
speaches-ai / speaches
☆2,179Updated this week
langchain-ai / open-canvas
📃 A better UX for chat, writing content, and coding with LLMs.
☆4,867Updated 2 months ago