examples for using gemini to extract data from media files
☆118Mar 13, 2025Updated last year
Alternatives and similar repositories for gemini-multimodal-structured-extraction
Users that are interested in gemini-multimodal-structured-extraction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ai trading agent using interactive brokers api☆97Feb 17, 2025Updated last year
- prediction market assistant using kalshi API and perplexity sonar api☆50Feb 22, 2025Updated last year
- GPT-4o Powered Calorie Detecor☆18May 29, 2024Updated last year
- A React-based web application that allows users to share their screen and audio with an AI assistant. The assistant provides real-time tr…☆22Sep 22, 2025Updated 6 months ago
- Task management for AI agents☆15Jun 25, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- bulk image downloader freeware, reddit bulk image downloader, bulk image downloader extension, bulk image downloader from url, bulk image…☆25Feb 19, 2026Updated last month
- An AI Hedge Fund Team☆23Jan 16, 2025Updated last year
- Insanely Fast Transcription: A Python-based utility for rapid audio transcription from YouTube videos or local files. Leverages GPU accel…☆95Jul 20, 2024Updated last year
- What if we could pack single purpose, powerful AI Agents into a single python file?☆433Apr 8, 2025Updated last year
- Fast STT, LLM, and TTS for personal AI assistants using OpenAI, Groq, AssemblyAI and ElevenLabs.☆195Oct 2, 2024Updated last year
- Turn the files in your Python project into a single *.md for submission to LLMs☆64Sep 13, 2024Updated last year
- A Python-based text editor server built with FastMCP that provides tools for file operations. This server enables reading, editing, and m…☆14Aug 21, 2025Updated 7 months ago
- ☆12Apr 22, 2025Updated 11 months ago
- Diverse collection of 100 Hydrogen Torch Use-Cases by different industries, data-types, and problem types☆11Oct 10, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Look, Anthropic's Claude-3.7-Sonnet is a powerful, hybrid CRASHOUT LLM. Let's understand this monumental release.☆87Mar 2, 2025Updated last year
- App built in the "Coding the Future With AI" YouTube tutorial series "Mastering AI Coding"☆12Jan 5, 2025Updated last year
- A sandbox for showcasing different use cases of LangChain's createAgent☆68Dec 11, 2025Updated 4 months ago
- Notebooks for exploring prediction markets (eg. Kalshi, Polymarket, ForecastTrader)☆26Aug 29, 2024Updated last year
- An opinionated, Agentic Engineering toolbox powered by LLM Agents to solve problems autonomously.☆170Mar 24, 2024Updated 2 years ago
- ☆21Apr 9, 2025Updated last year
- A simple voice agent using FastRTC and Groq☆60May 16, 2025Updated 11 months ago
- Benchmarks you can feel☆451May 24, 2025Updated 10 months ago
- ☆140Dec 1, 2024Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Python package to extract and analyse Canadian, United States and Indian real estate data from REALTOR.CA, REALTOR.COM and HOUSING.COM☆16Dec 21, 2025Updated 3 months ago
- A Workers implementation of the OpenAI realtime relay service☆23Oct 2, 2024Updated last year
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37May 18, 2025Updated 10 months ago
- Stock Trading Model using Q Learning☆10Dec 16, 2020Updated 5 years ago
- Guides for Model Context Protocol (MCP) and Agent Communication Protocol (ACP)☆21Jan 26, 2026Updated 2 months ago
- Fontawesome for sciter.js☆10Feb 28, 2025Updated last year
- Instantly convert ideas into app code with AI! This React app uses the Gemini API to generate and preview code from Markdown, making prot…☆13Mar 31, 2026Updated 2 weeks ago
- VisionCraft MCP delivers up-to-date, specialized computer vision and Gen-AI knowledge directly to Claude and other AI assistants.☆118Sep 19, 2025Updated 6 months ago
- ☆19Jun 28, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An automated machine learning system that leverages O1 and Claude to iteratively develop, improve, and optimize ML solutions.☆93Jan 12, 2025Updated last year
- Inbox Zero with AI☆33May 9, 2025Updated 11 months ago
- ☆14Nov 16, 2024Updated last year
- The Columns client SDK to create, publish and share data visualization☆18May 29, 2025Updated 10 months ago
- Container Runtime from scratch☆22Jun 21, 2025Updated 9 months ago
- A comprehensive collection of AI prompts with structured categories, subcategories, and searchable keywords. Each prompt includes detaile…☆76Jan 12, 2025Updated last year
- ☆14Nov 3, 2024Updated last year