pipecat-ai / pipecatLinks
Open Source framework for voice and multimodal conversational AI
β9,716Updated this week
Alternatives and similar repositories for pipecat
Users that are interested in pipecat are comparing it to the libraries listed below
Sorting:
- A powerful framework for building realtime voice AI agents π€ποΈπΉβ8,986Updated this week
- A fast multimodal LLM for real-time voiceβ4,309Updated 3 weeks ago
- The python library for real-time communicationβ4,480Updated last month
- Have a natural, spoken conversation with AI!β3,448Updated 6 months ago
- Converts text to speech in realtimeβ3,704Updated 5 months ago
- β1,189Updated last month
- π€ Build voice-based LLM agents. Modular + open source.β3,673Updated last year
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speeβ¦β3,114Updated 7 months ago
- Local realtime voice AIβ2,426Updated last month
- Speech To Speech: an effort for an open-sourced and modular GPT4-oβ4,266Updated 8 months ago
- Inference and training library for high-quality TTS models.β5,505Updated last year
- Fast and accurate automatic speech recognition (ASR) for edge devicesβ3,065Updated last month
- An Open Source text-to-speech system built by inverting Whisper.β4,550Updated 3 weeks ago
- Towards Human-Sounding Speechβ5,860Updated last month
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audiβ¦β9,250Updated last month
- β2,780Updated this week
- Whisper realtime streaming for long speech-to-text transcription and translationβ3,509Updated last month
- Automate browser based workflows with AIβ20,054Updated this week
- A framework for serving and evaluating LLM routers - save LLM costs without compromising qualityβ4,502Updated last year
- π§ Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 πβ4,891Updated last week
- PraisonAI is a production-ready Multi AI Agents framework, designed to create AI Agents to automate and solve problems ranging from simplβ¦β5,545Updated this week
- first base model for full-duplex conversational audioβ1,769Updated last year
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ6,110Updated last year
- Large Action Model framework to develop AI Web Agentsβ6,250Updated 11 months ago
- Foundational model for human-like, expressive TTSβ4,196Updated last year
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,643Updated last year
- A nearly-live implementation of OpenAI's Whisper.β3,721Updated 3 months ago
- πͺ’ Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with Openβ¦β20,155Updated this week
- π₯ Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser sandbox that lets you automate the web witβ¦β6,184Updated 3 weeks ago
- MARS5 speech model (TTS) from CAMB.AIβ2,809Updated last year