Open Source framework for voice and multimodal conversational AI
☆11,687May 2, 2026Updated this week
Alternatives and similar repositories for pipecat
Users that are interested in pipecat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A framework for building realtime voice AI agents 🤖🎙️📹☆10,353Updated this week
- Open source conversation framework for structured Pipecat dialogues☆583Updated this week
- A fast multimodal LLM for real-time voice☆4,412Dec 12, 2025Updated 4 months ago
- Run agents as production software.☆39,835May 1, 2026Updated last week
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆10,111Apr 28, 2026Updated last week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 🤖 Build voice-based LLM agents. Modular + open source.☆3,732Nov 15, 2024Updated last year
- Real-Time Voice Inference Web SDK☆311Updated this week
- Open-source framework for conversational voice AI agents☆10,462Apr 30, 2026Updated last week
- Universal memory layer for AI Agents☆54,714Apr 30, 2026Updated last week
- ☆1,368Jan 29, 2026Updated 3 months ago
- End-to-end realtime stack for connecting humans and AI☆18,458Apr 30, 2026Updated last week
- Example UI implementing the RTVI web client☆474Dec 3, 2024Updated last year
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆45,804Updated this week
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.☆22,391Apr 12, 2026Updated 3 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The SDK For Browser Agents☆22,463Updated this week
- Local realtime voice AI☆2,484Nov 26, 2025Updated 5 months ago
- DSPy: The framework for programming—not prompting—language models☆34,180May 2, 2026Updated last week
- Build local voice agents with open-source models☆4,716Updated this week
- 🙌 OpenHands: AI-Driven Development☆72,542Updated this week
- Build AI Agents, Visually☆52,479May 2, 2026Updated last week
- A Conversational Speech Generation Model☆14,616May 27, 2025Updated 11 months ago
- Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work t…☆50,629Updated this week
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆92,144Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Large Action Model framework to develop AI Web Agents☆6,333Jan 21, 2025Updated last year
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆36,413Apr 19, 2025Updated last year
- Automate browser based workflows with AI☆21,491Updated this week
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆21,427Apr 15, 2026Updated 3 weeks ago
- The python library for real-time communication☆4,583Jan 12, 2026Updated 3 months ago
- SOTA Open Source TTS☆30,034Apr 6, 2026Updated last month
- 🔥 The API to search, scrape, and interact with the web for AI☆113,973May 2, 2026Updated last week
- Examples for Cerebrium Serverless GPUs☆521May 1, 2026Updated last week
- structured outputs for llms☆12,889Apr 22, 2026Updated 2 weeks ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Towards Human-Sounding Speech☆6,127Dec 5, 2025Updated 5 months ago
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆8,993Mar 26, 2026Updated last month
- Vane is an AI-powered answering engine.☆34,125Apr 11, 2026Updated 3 weeks ago
- LlamaIndex is the leading document agent and OCR platform☆49,127Updated this week
- Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.☆63,536Updated this week
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆247Sep 10, 2025Updated 7 months ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆21,760Apr 4, 2026Updated last month