pipecat-ai / gemini-webrtc-web-simpleLinks
Gemini Multimodal Live + WebRTC in a single `app.ts`
☆212Updated 3 months ago
Alternatives and similar repositories for gemini-webrtc-web-simple
Users that are interested in gemini-webrtc-web-simple are comparing it to the libraries listed below
Sorting:
- Realtime API with Firecrawl Tool - Forked from the OpenAI Realtime Console☆162Updated 3 weeks ago
- Use OpenAI's realtime API for a chatting with your documents☆329Updated last year
- ☆253Updated last year
- A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically dr…☆220Updated last year
- podcastfy.ai gradio demo app☆333Updated last year
- Turn local files into a prompt for an LLM☆177Updated last year
- SearchGPT / Perplexity Pages clone, but personalised for you.☆248Updated last year
- Real-Time Voice Inference Web SDK☆300Updated this week
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆115Updated last year
- Demo app for Groq plugins in LiveKit Agents☆59Updated 10 months ago
- Voice-Enabled Math Tutor Powered by Groq that Calculates and Renders Live Problems and Instruction with LaTeX in Seconds!☆239Updated last month
- ⚡ Insanely fast AI voice assistant with <500ms response times☆583Updated last year
- 🔥 Generate llms.txt and llms-full.txt files for any website!☆512Updated 7 months ago
- Get started with native image generation and editing using Gemini 2.0 and Next.js☆521Updated this week
- Build realtime voice and video agents with Google's new Gemini 2.0 (API is free for now)☆322Updated 4 months ago
- Assistant for voice-to-blog writing☆147Updated last year
- mind map generator☆71Updated last year
- The Open Deep Research app – generate reports with OSS LLMs☆316Updated this week
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.☆232Updated last year
- openperplex is an opensource AI search engine☆170Updated last year
- The AI assistant for computer control.☆329Updated last year
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆248Updated 4 months ago
- An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.☆298Updated last year
- Generate descriptions from product images in multiple languages with AI☆323Updated last year
- Filter X content using LLM API requests, configurable, based on Groq API☆132Updated last year
- Realtime Voice and Vision wtih Brilliant Labs Frame and Gemini☆68Updated 8 months ago
- A real-time Agent framework for audio and video.☆168Updated 3 weeks ago
- Chat Application Starter Kit — Gemini Multimodal Live API + Pipecat☆226Updated 3 months ago
- Chat with any website on your local machine☆85Updated last year
- ☆201Updated this week