brilliantlabsAR / frame_realtime_gemini_voicevision
Realtime Voice and Vision wtih Brilliant Labs Frame and Gemini
β52Updated this week
Alternatives and similar repositories for frame_realtime_gemini_voicevision
Users that are interested in frame_realtime_gemini_voicevision are comparing it to the libraries listed below
Sorting:
- Building Blocks for Multi-Modal Gradio Powered by Groq Appsβ109Updated 6 months ago
- π₯β‘οΈπ Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Chβ¦β81Updated 8 months ago
- NotebookLlama powered by Groq - Create podcasts on any topic lightning fastβ71Updated 6 months ago
- Voice AI agent starter kit with Groq, Llama 4, and (optionally) Twilioβ68Updated 2 weeks ago
- MCP Server to run python code locallyβ53Updated 5 months ago
- Your own Coding Agent π€β76Updated 4 months ago
- β130Updated 2 weeks ago
- Jockey is a conversational video agent.β76Updated 3 months ago
- openperplex is an opensource AI search engineβ165Updated 9 months ago
- A Multi-modal MCP client for voice powered agentic workflowsβ171Updated 3 months ago
- ReActMCP is a reactive MCP server that empowers AI assistants to instantly respond with real-time, Markdown-formatted web search insightsβ¦β135Updated last month
- β28Updated 5 months ago
- An automated machine learning system that leverages O1 and Claude to iteratively develop, improve, and optimize ML solutions.β89Updated 4 months ago
- An amazon fresh mcp serverβ63Updated 5 months ago
- β94Updated 4 months ago
- SearchGPT / Perplexity Pages clone, but personalised for you.β237Updated 8 months ago
- Realtime API with Firecrawl Tool - Forked from the OpenAI Realtime Consoleβ156Updated 7 months ago
- Opensource chat app that uses Exa's API for web search and OpenAI o3-miniβ45Updated 2 months ago
- Chrome extension that interacts with content using Groqβ41Updated 4 months ago
- mind map generatorβ72Updated 5 months ago
- Chat with any website on your local machineβ74Updated 10 months ago
- β95Updated last week
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.β215Updated 6 months ago
- Gemini Multimodal Live + WebRTC in a single `app.ts`β199Updated 4 months ago
- OpenAI real-time voice Fastapi template with function calling with maximum simplicity. comes with arxiv paper function as an example and β¦β37Updated 4 months ago
- Turn any developer documentation into a GPTβ92Updated 2 months ago
- deep seek & o1 auto coders which write python code from a simple description and iteratively improvesit and fix errorsβ98Updated 3 months ago
- β161Updated 11 months ago
- A cool AI Diagram generator from a given topic, that streams the partial diagrams from the incomplete JSONs during generation. Built usinβ¦β210Updated last year
- AI tool that annotates research papers and shows related articles and videos for better understandingβ42Updated this week