pipecat-ai / pipecat
Open Source framework for voice and multimodal conversational AI
β5,200Updated this week
Alternatives and similar repositories for pipecat:
Users that are interested in pipecat are comparing it to the libraries listed below
- A fast multimodal LLM for real-time voiceβ3,738Updated last month
- Build real-time multimodal AI applications π€ποΈπΉβ5,299Updated this week
- π A better UX for chat, writing content, and coding with LLMs.β4,132Updated last week
- The most advanced AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.β5,507Updated this week
- Local realtime voice AIβ2,256Updated 2 weeks ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising qualityβ3,729Updated 7 months ago
- Fast and accurate automatic speech recognition (ASR) for edge devicesβ2,633Updated 3 weeks ago
- An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Coβ¦β3,400Updated 3 weeks ago
- A language model programming library.β5,689Updated 3 weeks ago
- π¦ CHONK your texts with Chonkie β¨ - The no-nonsense RAG chunking libraryβ2,818Updated this week
- π₯ Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web wiβ¦β4,003Updated last week
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speeβ¦β2,857Updated 4 months ago
- Large Action Model framework to develop AI Web Agentsβ5,965Updated 2 months ago
- PraisonAI is a production-ready Multi AI Agents framework, designed to create AI Agents to automate and solve problems ranging from simplβ¦β3,648Updated this week
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serperβ4,869Updated 5 months ago
- Inference and training library for high-quality TTS models.β5,148Updated 3 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-oβ3,898Updated 2 weeks ago
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audiβ¦β7,875Updated this week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundryβ3,934Updated last month
- Build and query dynamic, temporally-aware Knowledge Graphsβ2,478Updated this week
- first base model for full-duplex conversational audioβ1,719Updated 2 months ago
- Desktop app for prototyping and debugging LangGraph applications locally.β2,630Updated last week
- Agent Framework / shim to use Pydantic with LLMsβ7,308Updated this week
- Converts text to speech in realtimeβ2,688Updated last week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/β8,312Updated this week
- File Parser optimised for LLM Ingestion with no loss π§ Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.β5,884Updated last month
- NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other entβ¦β2,604Updated this week
- Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations,β¦β5,669Updated this week
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.β19,354Updated last week
- The Open Source Memory Layer For Autonomous Agentsβ2,041Updated 5 months ago