Open Source framework for voice and multimodal conversational AI
☆10,742Mar 17, 2026Updated this week
Alternatives and similar repositories for pipecat
Users that are interested in pipecat are comparing it to the libraries listed below
Sorting:
- A framework for building realtime voice AI agents 🤖🎙️📹☆9,741Updated this week
- Open source conversation framework for structured Pipecat dialogues☆560Updated this week
- A fast multimodal LLM for real-time voice☆4,381Dec 12, 2025Updated 3 months ago
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆9,832Mar 4, 2026Updated 2 weeks ago
- Build, run, manage agentic software at scale.☆38,835Updated this week
- 🤖 Build voice-based LLM agents. Modular + open source.☆3,710Nov 15, 2024Updated last year
- Real-Time Voice Inference Web SDK☆304Mar 11, 2026Updated last week
- Open-source framework for conversational voice AI agents☆10,285Updated this week
- Universal memory layer for AI Agents☆50,147Updated this week
- ☆1,305Jan 29, 2026Updated last month
- Example UI implementing the RTVI web client☆474Dec 3, 2024Updated last year
- End-to-end realtime stack for connecting humans and AI☆17,711Updated this week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆39,597Updated this week
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.☆21,680Updated this week
- The AI Browser Automation Framework☆21,583Updated this week
- Local realtime voice AI☆2,439Nov 26, 2025Updated 3 months ago
- DSPy: The framework for programming—not prompting—language models☆32,853Updated this week
- Build local voice agents with open-source models☆4,602Updated this week
- 🙌 OpenHands: AI-Driven Development☆69,254Updated this week
- Build AI Agents, Visually☆50,762Mar 14, 2026Updated last week
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆81,169Updated this week
- Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work t…☆46,408Updated this week
- A Conversational Speech Generation Model☆14,545May 27, 2025Updated 9 months ago
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆36,136Apr 19, 2025Updated 11 months ago
- Large Action Model framework to develop AI Web Agents☆6,318Jan 21, 2025Updated last year
- Automate browser based workflows with AI☆20,834Updated this week
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆21,189Mar 11, 2025Updated last year
- The python library for real-time communication☆4,554Jan 12, 2026Updated 2 months ago
- 🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data☆93,251Mar 15, 2026Updated last week
- SOTA Open Source TTS☆27,364Mar 13, 2026Updated last week
- Examples for Cerebrium Serverless GPUs☆522Jan 5, 2026Updated 2 months ago
- Towards Human-Sounding Speech☆6,016Dec 5, 2025Updated 3 months ago
- structured outputs for llms☆12,551Updated this week
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆8,518Mar 8, 2026Updated 2 weeks ago
- Vane is an AI-powered answering engine.☆33,063Mar 10, 2026Updated last week
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆249Sep 10, 2025Updated 6 months ago
- LlamaIndex is the leading document agent and OCR platform☆47,753Updated this week
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.☆54,096Updated this week
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆20,821Updated this week