pipecat-ai / gemini-multimodal-live-demoLinks
Chat Application Starter Kit — Gemini Multimodal Live API + Pipecat
☆222Updated 2 months ago
Alternatives and similar repositories for gemini-multimodal-live-demo
Users that are interested in gemini-multimodal-live-demo are comparing it to the libraries listed below
Sorting:
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆249Updated 3 months ago
- NotebookLlama powered by Groq - Create podcasts on any topic lightning fast☆78Updated last year
- Voice AI agent starter kit with Groq, Llama 4, and (optionally) Twilio☆73Updated 3 months ago
- ☆136Updated 10 months ago
- Real-Time Voice Inference Web SDK☆293Updated this week
- ☆159Updated this week
- Prompt to ui for fun☆237Updated last year
- An automated machine learning system that leverages O1 and Claude to iteratively develop, improve, and optimize ML solutions.☆91Updated 11 months ago
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.☆227Updated last year
- An amazon fresh mcp server☆63Updated last year
- OpenAI real-time voice Fastapi template with function calling with maximum simplicity. comes with arxiv paper function as an example and …☆36Updated 11 months ago
- Example code and guides for building with Scrapybara☆138Updated 9 months ago
- ☆75Updated 6 months ago
- Use OpenAI's realtime API for a chatting with your documents☆330Updated last year
- uses gpt-4o and gpt-4-mini to write books on topics while researching with perplexity API☆97Updated 11 months ago
- AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more☆293Updated last year
- Demonstrates how to protect your OpenAI API Key using a Cloudflare Worker to serve your ephemeral token and then do client side tool call…☆323Updated 10 months ago
- Example projects built with the Hume AI APIs☆233Updated this week
- Realtime API with Firecrawl Tool - Forked from the OpenAI Realtime Console☆160Updated last year
- Sample application to add voice capabilities to the Agents SDK☆245Updated 7 months ago
- Model Context Protocol Servers (Browserbase Version)☆49Updated last year
- ☆94Updated 10 months ago
- The Multi-Agent Reasoning framework creates an interactive chatbot where AI agents collaborate via structured reasoning and Swarm Integra…☆177Updated 11 months ago
- Turn any developer documentation into a GPT☆101Updated 9 months ago
- ☆106Updated last year
- ☆191Updated last year
- ☆171Updated last year
- Play with OpenAI's new Realtime API in your browser☆22Updated last year
- The agentic video editing framework☆186Updated 10 months ago
- This is a fork of Open Deep Research by @dzhng, enhanced with REST API implementation for integration into the CodeGuide platform.☆44Updated 5 months ago