Open Source framework for voice and multimodal conversational AI
☆12,260May 15, 2026Updated this week
Alternatives and similar repositories for pipecat
Users that are interested in pipecat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A framework for building realtime voice AI agents 🤖🎙️📹☆10,531Updated this week
- Open source conversation framework for structured Pipecat dialogues☆586May 11, 2026Updated last week
- A fast multimodal LLM for real-time voice☆4,424Dec 12, 2025Updated 5 months ago
- Build, run, and manage agent platforms.☆40,135May 15, 2026Updated last week
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆10,211May 5, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 🤖 Build voice-based LLM agents. Modular + open source.☆3,745Nov 15, 2024Updated last year
- Open-source framework for conversational voice AI agents☆10,576May 14, 2026Updated last week
- Real-Time Voice Inference Web SDK☆313Updated this week
- Universal memory layer for AI Agents☆56,013Updated this week
- ☆1,380Jan 29, 2026Updated 3 months ago
- End-to-end realtime stack for connecting humans and AI☆18,695May 15, 2026Updated last week
- Example UI implementing the RTVI web client☆473Dec 3, 2024Updated last year
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆47,667Updated this week
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.☆22,728May 14, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The SDK For Browser Agents☆22,722Updated this week
- Local realtime voice AI☆2,484Nov 26, 2025Updated 5 months ago
- DSPy: The framework for programming—not prompting—language models☆34,496Updated this week
- Build local voice agents with open-source models☆4,755Updated this week
- Build AI Agents, Visually☆52,839May 14, 2026Updated last week
- 🙌 OpenHands: AI-Driven Development☆73,913Updated this week
- A Conversational Speech Generation Model☆14,627May 27, 2025Updated 11 months ago
- Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work t…☆51,703Updated this week
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆94,598Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Large Action Model framework to develop AI Web Agents☆6,351Jan 21, 2025Updated last year
- Automate browser based workflows with AI☆21,645Updated this week
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆36,509Apr 19, 2025Updated last year
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆21,513Apr 15, 2026Updated last month
- The python library for real-time communication☆4,584Jan 12, 2026Updated 4 months ago
- SOTA Open Source TTS☆30,356May 12, 2026Updated last week
- 🔥 Search, scrape, and clean the web for AI agents.☆120,407Updated this week
- Examples for Cerebrium Serverless GPUs☆522May 8, 2026Updated last week
- Towards Human-Sounding Speech☆6,148Dec 5, 2025Updated 5 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- structured outputs for llms☆12,974Updated this week
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆9,038Mar 26, 2026Updated last month
- Vane is an AI-powered answering engine.☆34,453Apr 11, 2026Updated last month
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆247Sep 10, 2025Updated 8 months ago
- LlamaIndex is the leading document agent and OCR platform☆49,501Updated this week
- Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.☆64,485Updated this week
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆21,896Apr 4, 2026Updated last month