saharmor / gemini-multimodal-playgroundLinks
Build realtime voice and video agents with Google's new Gemini 2.0 (API is free for now)
☆284Updated 3 months ago
Alternatives and similar repositories for gemini-multimodal-playground
Users that are interested in gemini-multimodal-playground are comparing it to the libraries listed below
Sorting:
- Assistant for voice-to-blog writing☆136Updated 4 months ago
- Realtime API with Firecrawl Tool - Forked from the OpenAI Realtime Console☆158Updated 8 months ago
- PostBot 3000 is an open-source project that shows how to build a powerful AI agent and stream responses and generate artifacts. This proj…☆287Updated 6 months ago
- Voice-Enabled Math Tutor Powered by Groq that Calculates and Renders Live Problems and Instruction with LaTeX in Seconds!☆224Updated 5 months ago
- An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.☆272Updated 5 months ago
- Use OpenAI's realtime API for a chatting with your documents☆331Updated 7 months ago
- SearchGPT / Perplexity Pages clone, but personalised for you.☆239Updated 9 months ago
- ☆405Updated 2 months ago
- ReActMCP is a reactive MCP client that empowers AI assistants to instantly respond with real-time, Markdown-formatted web search insights…☆137Updated 2 months ago
- Gemini Multimodal Live + WebRTC in a single `app.ts`☆205Updated 5 months ago
- 🔥 This repository contains complete application examples, including websites and other projects, developed using Firecrawl.☆359Updated this week
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆111Updated 7 months ago
- 📰 Building News Agents to Summarize News with MCP, Q, and tmux☆254Updated 3 weeks ago
- Oliva Multi-Agent Assistant☆364Updated last month
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.☆217Updated 7 months ago
- 🧍♂️LLM as a manager for approval processes.☆199Updated last month
- The open-source multi-agent chat interface that lets you manage multiple agents in one dynamic conversation and add MCP servers for deep …☆393Updated last month
- ☆137Updated 6 months ago
- A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically dr…☆201Updated 7 months ago
- podcastfy.ai gradio demo app☆333Updated 6 months ago
- Turn local files into a prompt for an LLM☆171Updated 4 months ago
- ☆206Updated 3 months ago
- YT Navigator: AI-powered YouTube content explorer that lets you search and chat with channel videos using AI agents. Extract insights fro…☆451Updated 2 months ago
- ☆234Updated 6 months ago
- Multi-agent that helps you organize and write documents.☆328Updated 6 months ago
- Leverage the OpenAI Realtime API (12-17-2024) with this Next.js 15 starter template featuring shadcn/ui components, tool-calling & locali…☆377Updated last month
- ☆246Updated 4 months ago
- openperplex is an opensource AI search engine☆166Updated 10 months ago
- An implementation of a computer use agent (CUA) using LangGraph☆150Updated 2 months ago
- ☆256Updated 7 months ago