examples for using gemini to extract data from media files
☆116Mar 13, 2025Updated 11 months ago
Alternatives and similar repositories for gemini-multimodal-structured-extraction
Users that are interested in gemini-multimodal-structured-extraction are comparing it to the libraries listed below
Sorting:
- A docker container and flask app for use with the Interactive Brokers Web API 1.0☆247Feb 17, 2025Updated last year
- prediction market assistant using kalshi API and perplexity sonar api☆50Feb 22, 2025Updated last year
- bulk image downloader freeware, reddit bulk image downloader, bulk image downloader extension, bulk image downloader from url, bulk image…☆25Feb 19, 2026Updated 2 weeks ago
- Learn how to build a Podcast Discovery platform with Python, Reflex, and Clerk.☆16Jun 19, 2025Updated 8 months ago
- Bypass browser bot detection in langchain tools☆17Feb 10, 2026Updated 3 weeks ago
- Have an LLM write your biography, probably incorrectly☆13Dec 26, 2024Updated last year
- GPT-4o Powered Calorie Detecor☆18May 29, 2024Updated last year
- Task management for AI agents☆15Jun 25, 2025Updated 8 months ago
- ☆12Mar 8, 2024Updated last year
- Fast STT, LLM, and TTS for personal AI assistants using OpenAI, Groq, AssemblyAI and ElevenLabs.☆193Oct 2, 2024Updated last year
- Add real-time Speech-to-Text to your LiveKit application with AssemblyAI☆18Jun 5, 2025Updated 9 months ago
- API for custom GPT Actions to talk to custom GPT Agents.☆15Apr 28, 2024Updated last year
- This is a Streamlit-based UI for a GPT-3.5-powered venture capitalist bot. The bot is designed to help entrepreneurs engage in conversati…☆20Mar 21, 2023Updated 2 years ago
- A React-based web application that allows users to share their screen and audio with an AI assistant. The assistant provides real-time tr…☆22Sep 22, 2025Updated 5 months ago
- ☆21Jan 27, 2023Updated 3 years ago
- An AI Hedge Fund Team☆22Jan 16, 2025Updated last year
- Insanely Fast Transcription: A Python-based utility for rapid audio transcription from YouTube videos or local files. Leverages GPU accel…☆94Jul 20, 2024Updated last year
- ☆141Oct 27, 2024Updated last year
- Grompt is a Python utility that uses the Groq LLM provider service to instantly refactor amazingly detailed and effective prompts. It's d…☆48Dec 9, 2024Updated last year
- Notebooks for exploring prediction markets (eg. Kalshi, Polymarket, ForecastTrader)☆25Aug 29, 2024Updated last year
- ☆58Aug 31, 2025Updated 6 months ago
- Running Deepseek R1 Local on Ollama☆26Jan 28, 2025Updated last year
- graphRAG approach with .pcap☆22Jul 17, 2024Updated last year
- Benchmarks you can feel☆449May 24, 2025Updated 9 months ago
- An opinionated, Agentic Engineering toolbox powered by LLM Agents to solve problems autonomously.☆168Mar 24, 2024Updated last year
- What if we could pack single purpose, powerful AI Agents into a single python file?☆428Apr 8, 2025Updated 10 months ago
- ☆100Nov 13, 2023Updated 2 years ago
- api integration with oobabooga text generation webui for rivet nodes☆25Feb 14, 2024Updated 2 years ago
- simple terminal-based AI coding agent. This is for learning purposes more than a final working app.☆27Mar 6, 2025Updated last year
- ☆27May 25, 2024Updated last year
- This is system where images are trained and recognize of bumch of faces at a time☆23Oct 25, 2025Updated 4 months ago
- Inbox Zero with AI☆32May 9, 2025Updated 9 months ago
- VisionCraft MCP delivers up-to-date, specialized computer vision and Gen-AI knowledge directly to Claude and other AI assistants.☆117Sep 19, 2025Updated 5 months ago
- Puppeteer automation through n8n☆22Oct 25, 2022Updated 3 years ago
- Receive AI-driven critiques, enhancements, and freshly generated designs for your UI/UX designs.☆30Feb 9, 2025Updated last year
- My personal response to OpenAI's Grant Challenge☆29Jun 13, 2023Updated 2 years ago
- A comprehensive collection of AI prompts with structured categories, subcategories, and searchable keywords. Each prompt includes detaile…☆76Jan 12, 2025Updated last year
- ☆11May 14, 2025Updated 9 months ago
- A sandbox for showcasing different use cases of LangChain's createAgent☆67Dec 11, 2025Updated 2 months ago