brilliantlabsAR / frame_realtime_gemini_voicevisionLinks
Realtime Voice and Vision wtih Brilliant Labs Frame and Gemini
☆67Updated 7 months ago
Alternatives and similar repositories for frame_realtime_gemini_voicevision
Users that are interested in frame_realtime_gemini_voicevision are comparing it to the libraries listed below
Sorting:
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆115Updated last year
- openperplex is an opensource AI search engine☆171Updated last year
- A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically dr…☆219Updated last year
- Voice AI agent starter kit with Groq, Llama 4, and (optionally) Twilio☆73Updated 2 months ago
- 🎥➡️📝 Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Ch…☆79Updated last year
- Gemini Multimodal Live + WebRTC in a single `app.ts`☆212Updated last month
- ☆137Updated 10 months ago
- ☆94Updated 11 months ago
- AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more☆293Updated last year
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆211Updated last month
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.☆226Updated last year
- AI tool that annotates research papers and shows related articles and videos for better understanding☆44Updated 6 months ago
- MCP Server to run python code locally☆55Updated last year
- Voice-Enabled Math Tutor Powered by Groq that Calculates and Renders Live Problems and Instruction with LaTeX in Seconds!☆236Updated 11 months ago
- ☆191Updated last year
- napkins.dev – from screenshot to app☆86Updated last year
- ☆158Updated last week
- NotebookLlama powered by Groq - Create podcasts on any topic lightning fast☆77Updated last year
- OpenAI real-time voice Fastapi template with function calling with maximum simplicity. comes with arxiv paper function as an example and …☆36Updated 11 months ago
- The AI assistant for computer control.☆325Updated last year
- Chrome extension that interacts with content using Groq☆41Updated 11 months ago
- Realtime API with Firecrawl Tool - Forked from the OpenAI Realtime Console☆160Updated last year
- ☆196Updated this week
- Examples for using Hyperbrowser☆156Updated last week
- mind map generator☆72Updated last year
- Chat Application Starter Kit — Gemini Multimodal Live API + Pipecat☆222Updated last month
- An automated machine learning system that leverages O1 and Claude to iteratively develop, improve, and optimize ML solutions.☆90Updated 11 months ago
- podcastfy.ai gradio demo app☆334Updated last year
- deep seek & o1 auto coders which write python code from a simple description and iteratively improvesit and fix errors☆95Updated 10 months ago
- uses all reasoning models in parallel and synthesizes an answer with o1. also has multi-chat where you can chat with any of them☆39Updated 10 months ago