pipecat-ai / pipecat
Open Source framework for voice and multimodal conversational AI
β3,385Updated this week
Related projects β
Alternatives and complementary repositories for pipecat
- Local realtime voice AIβ1,946Updated this week
- Build real-time multimodal AI applications π€ποΈπΉβ4,010Updated this week
- A fast multimodal LLM for real-time voiceβ1,339Updated this week
- first base model for full-duplex conversational audioβ1,560Updated last week
- Speech To Speech: an effort for an open-sourced and modular GPT4-oβ3,540Updated 2 weeks ago
- Inference and training library for high-quality TTS models.β4,658Updated 3 weeks ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.β2,830Updated this week
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,547Updated 3 months ago
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serperβ4,660Updated last month
- Large Action Model framework to develop AI Web Agentsβ5,477Updated this week
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!β3,256Updated 3 months ago
- An AI-powered search engine with a generative UIβ6,304Updated 2 weeks ago
- The easiest way to use Agentic RAG in any enterpriseβ3,866Updated this week
- π A better UX for chat, writing content, and coding with LLMs.β2,602Updated last week
- Make websites accessible for AI agentsβ2,094Updated this week
- Turn any webpage into structured data using LLMsβ2,394Updated 2 months ago
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speeβ¦β2,571Updated last week
- The Open Source Memory Layer For Autonomous Agentsβ1,483Updated 3 weeks ago
- Fast and accurate automatic speech recognition (ASR) for edge devicesβ2,183Updated this week
- ML-powered speech recognition directly in your browserβ2,581Updated last month
- β6,781Updated 3 weeks ago
- Vision model based document ingestionβ1,242Updated this week
- πͺ’ Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with Llamβ¦β6,598Updated this week
- An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Coβ¦β1,092Updated this week
- Converts text to speech in realtimeβ2,023Updated this week
- PraisonAI application combines AutoGen and CrewAI or similar frameworks into a low-code solution for building and managing multi-agent LLβ¦β2,287Updated last week
- An Open Source text-to-speech system built by inverting Whisper.β3,982Updated 5 months ago
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundryβ3,322Updated this week
- A language model programming library.β5,295Updated this week
- Convert any PDF into a podcast episode!β1,511Updated 2 weeks ago