pipecat-ai / pipecat
Open Source framework for voice and multimodal conversational AI
β5,543Updated this week
Alternatives and similar repositories for pipecat:
Users that are interested in pipecat are comparing it to the libraries listed below
- A powerful framework for building realtime voice AI agents π€ποΈπΉβ5,544Updated this week
- A fast multimodal LLM for real-time voiceβ3,824Updated 2 months ago
- Local realtime voice AIβ2,277Updated last month
- A language model programming library.β5,734Updated last month
- A framework for serving and evaluating LLM routers - save LLM costs without compromising qualityβ3,802Updated 8 months ago
- πͺ’ Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with Openβ¦β10,340Updated this week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundryβ3,999Updated last month
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speeβ¦β2,886Updated 5 months ago
- The python library for real-time communicationβ3,515Updated this week
- Fast and accurate automatic speech recognition (ASR) for edge devicesβ2,672Updated last month
- Vision infrastructure to turn complex documents into RAG/LLM-ready dataβ2,108Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagβ¦β20,671Updated this week
- Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web, vision.β3,724Updated this week
- Python & JS/TS SDK for running AI-generated code/code interpreting in your AI appβ1,662Updated last week
- first base model for full-duplex conversational audioβ1,730Updated 3 months ago
- An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Coβ¦β3,542Updated last month
- Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)β3,922Updated this week
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviateβ7,031Updated 3 weeks ago
- Deploy your agentic worfklows to productionβ1,995Updated 3 weeks ago
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serperβ4,880Updated 6 months ago
- Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including Oβ¦β4,205Updated this week
- React app for inspecting, building and debugging with the Realtime APIβ3,132Updated last month
- Build Real-Time Knowledge Graphs for AI Agentsβ3,961Updated this week
- ML-powered speech recognition directly in your browserβ2,883Updated 6 months ago
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,596Updated 8 months ago
- The Open Source Memory Layer For Autonomous Agentsβ2,172Updated 5 months ago
- Agent Framework / shim to use Pydantic with LLMsβ8,377Updated this week
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audiβ¦β8,079Updated this week
- SOTA Open-Source Browser Agent for autonomously performing complex tasks on the webβ1,213Updated this week
- LLM-powered multiagent persona simulation for imagination enhancement and business insights.β6,134Updated 3 weeks ago