pipecat-ai / gemini-webrtc-web-simple
Gemini Multimodal Live + WebRTC in a single `app.ts`
☆195Updated 3 months ago
Alternatives and similar repositories for gemini-webrtc-web-simple:
Users that are interested in gemini-webrtc-web-simple are comparing it to the libraries listed below
- Realtime API with Firecrawl Tool - Forked from the OpenAI Realtime Console☆155Updated 6 months ago
- ☆339Updated last week
- ☆242Updated 2 months ago
- A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically dr…☆199Updated 5 months ago
- podcastfy.ai gradio demo app☆331Updated 4 months ago
- ☆111Updated last week
- Turn local files into a prompt for an LLM☆170Updated 2 months ago
- SearchGPT / Perplexity Pages clone, but personalised for you.☆235Updated 7 months ago
- A real-time AI development framework leveraging WebRTC for audio and video transmission.☆113Updated 2 months ago
- Use OpenAI's realtime API for a chatting with your documents☆324Updated 6 months ago
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆109Updated 5 months ago
- ☆74Updated 2 months ago
- ☆121Updated last month
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆232Updated 5 months ago
- An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.☆253Updated 3 months ago
- napkins.dev – from screenshot to app☆85Updated 6 months ago
- Demo app for Groq plugins in LiveKit Agents☆41Updated 2 weeks ago
- mind map generator☆71Updated 3 months ago
- Build realtime voice and video agents with Google's new Gemini 2.0 (API is free for now)☆272Updated 2 months ago
- Assistant for voice-to-blog writing☆132Updated 2 months ago
- Real-Time Voice Inference Web SDK☆212Updated this week
- An implementation of a computer use agent (CUA) using LangGraph☆131Updated 2 weeks ago
- openai realtime webrtc demo☆22Updated 3 months ago
- 🎥➡️📝 Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Ch…☆80Updated 7 months ago
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.☆212Updated 5 months ago
- Voice-Enabled Math Tutor Powered by Groq that Calculates and Renders Live Problems and Instruction with LaTeX in Seconds!☆219Updated 3 months ago
- Googles NotebookLM but local☆183Updated 2 weeks ago
- PostBot 3000 is an open-source project that shows how to build a powerful AI agent and stream responses and generate artifacts. This proj…☆286Updated 4 months ago
- Sample application to add voice capabilities to the Agents SDK☆81Updated this week
- Send emails directly from Cursor with this email sending MCP server☆289Updated 3 weeks ago