pipecat-ai / pipecat
Open Source framework for voice and multimodal conversational AI
☆5,298Updated this week
Alternatives and similar repositories for pipecat:
Users that are interested in pipecat are comparing it to the libraries listed below
- Build real-time multimodal AI applications 🤖🎙️📹☆5,343Updated this week
- A fast multimodal LLM for real-time voice☆3,757Updated last month
- PraisonAI is a production-ready Multi AI Agents framework, designed to create AI Agents to automate and solve problems ranging from simpl…☆3,648Updated this week
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆3,729Updated 7 months ago
- The easiest way to use Agentic RAG in any enterprise☆4,157Updated 2 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆3,898Updated 2 weeks ago
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆3,961Updated last month
- Local realtime voice AI☆2,260Updated 3 weeks ago
- first base model for full-duplex conversational audio☆1,722Updated 2 months ago
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆5,735Updated this week
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆7,875Updated this week
- Desktop app for prototyping and debugging LangGraph applications locally.☆2,630Updated last week
- 📃 A better UX for chat, writing content, and coding with LLMs.☆4,215Updated last week
- An Open Source text-to-speech system built by inverting Whisper.☆4,164Updated 3 months ago
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…☆2,861Updated 4 months ago
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper☆4,870Updated 5 months ago
- Inference and training library for high-quality TTS models.☆5,148Updated 3 months ago
- React app for inspecting, building and debugging with the Realtime API☆3,039Updated last week
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,017Updated this week
- A language model programming library.☆5,689Updated 3 weeks ago
- Fast and accurate automatic speech recognition (ASR) for edge devices☆2,640Updated 3 weeks ago
- An AI-powered search engine with a generative UI☆7,189Updated last week
- Large Action Model framework to develop AI Web Agents☆5,965Updated 2 months ago
- Converts text to speech in realtime☆2,727Updated this week
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,588Updated 7 months ago
- A framework for Claude Opus to intelligently orchestrate subagents.☆4,225Updated 8 months ago
- 🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library☆2,818Updated this week
- https://hf.co/hexgrad/Kokoro-82M☆1,825Updated this week
- 🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓☆3,431Updated this week