hackingthemarkets / gemini-multimodal-structured-extractionLinks
examples for using gemini to extract data from media files
☆116Updated 9 months ago
Alternatives and similar repositories for gemini-multimodal-structured-extraction
Users that are interested in gemini-multimodal-structured-extraction are comparing it to the libraries listed below
Sorting:
- ☆141Updated last year
- Communicate your work with diagrams in seconds with GenAI + Mermaid☆166Updated last year
- GroqCrawl is a powerful and user-friendly web crawling and scraping application built with Streamlit and powered by PocketGroq. It provid…☆99Updated last year
- Fast STT, LLM, and TTS for personal AI assistants using OpenAI, Groq, AssemblyAI and ElevenLabs.☆183Updated last year
- ai trading agent using interactive brokers api☆91Updated 10 months ago
- Using Mem0 with Agency Swarm☆40Updated 10 months ago
- An automated machine learning system that leverages O1 and Claude to iteratively develop, improve, and optimize ML solutions.☆91Updated 11 months ago
- ☆140Updated last year
- Build a crypto bot for trading with LLMs and NLP☆165Updated 8 months ago
- uses gpt-4o and gpt-4-mini to write books on topics while researching with perplexity API☆97Updated 11 months ago
- An opinionated, Agentic Engineering toolbox powered by LLM Agents to solve problems autonomously.☆157Updated last year
- Extract, timestamp, and analyze specific content from video collections using LLM-powered audio/video processing.☆60Updated 3 months ago
- PocketGroq is a powerful Python library that simplifies integration with the Groq API, offering advanced features for natural language pr…☆209Updated 10 months ago
- An AI agent that automates company research and lead prospecting (powered by langgraph and firecrawl)☆174Updated 11 months ago
- ☆96Updated last year
- ☆52Updated 9 months ago
- Groqqle is a powerful web search and content summarization tool built with Python, leveraging Groq's LLM API for advanced natural languag…☆148Updated 9 months ago
- All About AI MCP Servers☆79Updated last year
- LLM Use Case: LLM Powered, Reusable, Domain Agnostic Autocompletes☆67Updated last year
- Insanely Fast Transcription: A Python-based utility for rapid audio transcription from YouTube videos or local files. Leverages GPU accel…☆93Updated last year
- ☆82Updated 11 months ago
- ☆100Updated last year
- Full-stack web app that generates legal documents with AI☆150Updated 4 months ago
- Look, Anthropic's Claude-3.7-Sonnet is a powerful, hybrid CRASHOUT LLM. Let's understand this monumental release.☆83Updated 9 months ago
- Engineer your reusable, customizable, prompt library in Marimo reactive notebooks☆227Updated last year
- CrewAI agents that gather and analyze YouTube comments to generate insights to inform better content creation.☆64Updated last year
- ☆88Updated 11 months ago
- Tutorials☆112Updated 8 months ago
- Agency Swarm Railway Deployment Template☆53Updated 6 months ago
- A simple script that can run in the background, uses the whisper API to transcribe text into ANY application☆96Updated last year