brilliantlabsAR / frame_realtime_gemini_voicevisionLinks
Realtime Voice and Vision wtih Brilliant Labs Frame and Gemini
☆58Updated 2 months ago
Alternatives and similar repositories for frame_realtime_gemini_voicevision
Users that are interested in frame_realtime_gemini_voicevision are comparing it to the libraries listed below
Sorting:
- ☆95Updated 7 months ago
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆112Updated 8 months ago
- Gemini Multimodal Live + WebRTC in a single `app.ts`☆208Updated 6 months ago
- openperplex is an opensource AI search engine☆168Updated 11 months ago
- ☆183Updated 7 months ago
- ☆134Updated 5 months ago
- A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically dr…☆209Updated 8 months ago
- Voice-Enabled Math Tutor Powered by Groq that Calculates and Renders Live Problems and Instruction with LaTeX in Seconds!☆228Updated 6 months ago
- ☆170Updated 11 months ago
- Turn local files into a prompt for an LLM☆173Updated 5 months ago
- Voice AI agent starter kit with Groq, Llama 4, and (optionally) Twilio☆72Updated 3 weeks ago
- ☆315Updated 6 months ago
- Realtime API with Firecrawl Tool - Forked from the OpenAI Realtime Console☆159Updated 9 months ago
- podcastfy.ai gradio demo app☆335Updated 7 months ago
- Use OpenAI's realtime API for a chatting with your documents☆330Updated 9 months ago
- NotebookLlama powered by Groq - Create podcasts on any topic lightning fast☆74Updated 8 months ago
- An examples code to make langchain agents without openai API key (Google Gemini), Completely free unlimited and open source, run it yours…☆314Updated 11 months ago
- A user interface for DSPy☆162Updated last month
- The agentic video editing framework☆143Updated 5 months ago
- The AI assistant for computer control.☆313Updated 9 months ago
- The Open Deep Research app – generate reports with OSS LLMs☆261Updated 3 weeks ago
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.☆220Updated 8 months ago
- Turn topics into essays in seconds!☆185Updated last week
- An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.☆280Updated 6 months ago
- ☆28Updated 7 months ago
- mind map generator☆72Updated 7 months ago
- Chat Application Starter Kit — Gemini Multimodal Live API + Pipecat☆203Updated 3 months ago
- ☆164Updated last year
- ☆141Updated 2 months ago
- PostBot 3000 is an open-source project that shows how to build a powerful AI agent and stream responses and generate artifacts. This proj…☆287Updated 7 months ago