pipecat-ai / gemini-multimodal-live-demoLinks
Chat Application Starter Kit — Gemini Multimodal Live API + Pipecat
☆224Updated 2 months ago
Alternatives and similar repositories for gemini-multimodal-live-demo
Users that are interested in gemini-multimodal-live-demo are comparing it to the libraries listed below
Sorting:
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆249Updated 4 months ago
- NotebookLlama powered by Groq - Create podcasts on any topic lightning fast☆78Updated last year
- ☆135Updated 11 months ago
- Real-Time Voice Inference Web SDK☆298Updated 3 weeks ago
- An amazon fresh mcp server☆62Updated last year
- An automated machine learning system that leverages O1 and Claude to iteratively develop, improve, and optimize ML solutions.☆91Updated last year
- The Multi-Agent Reasoning framework creates an interactive chatbot where AI agents collaborate via structured reasoning and Swarm Integra…☆177Updated 11 months ago
- Example projects built with the Hume AI APIs☆234Updated this week
- ☆191Updated last year
- Example code and guides for building with Scrapybara☆138Updated 9 months ago
- Prompt to ui for fun☆237Updated last year
- Voice AI agent starter kit with Groq, Llama 4, and (optionally) Twilio☆72Updated 3 months ago
- OpenAI real-time voice Fastapi template with function calling with maximum simplicity. comes with arxiv paper function as an example and …☆36Updated last year
- ☆75Updated 6 months ago
- Sample application to add voice capabilities to the Agents SDK☆247Updated 8 months ago
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.☆227Updated last year
- A Multi-modal MCP client for voice powered agentic workflows☆208Updated 11 months ago
- The agentic video editing framework☆207Updated 11 months ago
- ☆166Updated 2 months ago
- ☆158Updated 3 weeks ago
- uses gpt-4o and gpt-4-mini to write books on topics while researching with perplexity API☆97Updated last year
- This is a fork of Open Deep Research by @dzhng, enhanced with REST API implementation for integration into the CodeGuide platform.☆44Updated 5 months ago
- A Newsletter Agent that Aggregates Articles and Generates a Newsletter - Langflow, NextJS☆61Updated last year
- Groq-Powered Real-Time Voice Assistant☆226Updated last year
- Turn any developer documentation into a GPT☆101Updated 10 months ago
- ☆106Updated last year
- AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more☆293Updated last year
- Use OpenAI's realtime API for a chatting with your documents☆331Updated last year
- A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically dr…☆220Updated last year
- ☆172Updated last year