pipecat-ai / gemini-multimodal-live-demoLinks
Chat Application Starter Kit — Gemini Multimodal Live API + Pipecat
☆222Updated 2 months ago
Alternatives and similar repositories for gemini-multimodal-live-demo
Users that are interested in gemini-multimodal-live-demo are comparing it to the libraries listed below
Sorting:
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆249Updated 3 months ago
- ☆136Updated 10 months ago
- NotebookLlama powered by Groq - Create podcasts on any topic lightning fast☆78Updated last year
- Voice AI agent starter kit with Groq, Llama 4, and (optionally) Twilio☆73Updated 3 months ago
- Example code and guides for building with Scrapybara☆138Updated 9 months ago
- An automated machine learning system that leverages O1 and Claude to iteratively develop, improve, and optimize ML solutions.☆91Updated 11 months ago
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.☆227Updated last year
- ☆191Updated last year
- Example projects built with the Hume AI APIs☆233Updated this week
- Real-Time Voice Inference Web SDK☆293Updated this week
- uses gpt-4o and gpt-4-mini to write books on topics while researching with perplexity API☆97Updated 11 months ago
- OpenAI real-time voice Fastapi template with function calling with maximum simplicity. comes with arxiv paper function as an example and …☆36Updated 11 months ago
- ☆75Updated 6 months ago
- The Multi-Agent Reasoning framework creates an interactive chatbot where AI agents collaborate via structured reasoning and Swarm Integra…☆177Updated 10 months ago
- Prompt to ui for fun☆237Updated last year
- ☆158Updated 2 weeks ago
- An amazon fresh mcp server☆63Updated last year
- ☆171Updated last year
- ☆167Updated 2 months ago
- Use OpenAI's realtime API for a chatting with your documents☆330Updated last year
- ☆52Updated 10 months ago
- ☆106Updated last year
- A fork of OpenAI Swarm that supports Groq and Anthropic☆124Updated 10 months ago
- This is a fork of Open Deep Research by @dzhng, enhanced with REST API implementation for integration into the CodeGuide platform.☆43Updated 5 months ago
- a minimalistic template for dynamic self-building AI agents☆96Updated 11 months ago
- The Moshi speech-to-speech model, deployed to Modal with a realtime CLI chat☆59Updated last year
- Sample application to add voice capabilities to the Agents SDK☆245Updated 7 months ago
- The agentic video editing framework☆185Updated 10 months ago
- Realtime API with Firecrawl Tool - Forked from the OpenAI Realtime Console☆160Updated last year
- Demonstrates how to protect your OpenAI API Key using a Cloudflare Worker to serve your ephemeral token and then do client side tool call…☆323Updated 10 months ago