kidpeterpan / gemini-document-processorLinks
A powerful document processing tool that uses Google's Gemini AI to generate high-quality Thai language summaries from PDF and EPUB files, with image extraction and Obsidian integration.
☆24Updated 7 months ago
Alternatives and similar repositories for gemini-document-processor
Users that are interested in gemini-document-processor are comparing it to the libraries listed below
Sorting:
- iauto is a low-code engine for building and deploying AI agents☆91Updated last year
- ☆30Updated last year
- Chat strategies for LLMs☆125Updated this week
- Supercompat allows you to use any AI provider like Anthropic, Groq or Mistral with OpenAI-compatible Assistants API.☆92Updated 3 weeks ago
- Flowchart-like UI to interconnect LLM's and Huggingface models, and deploy them as a REST API with little to no code.☆71Updated 9 months ago
- OmiAI is an opinionated AI SDK for Typescript that auto-picks the best model from a suite of curated models depending on the prompt. It i…☆119Updated 5 months ago
- A web-based tool that utilizes GPT-4's vision capabilities to analyze and describe system architecture diagrams, providing instant insigh…☆16Updated 2 years ago
- Light-weight control pane to run CLI coding agents(Claude Code, Codex) in parallel☆394Updated last month
- Add Siri like Native AI Agents in you App.☆54Updated 11 months ago
- GPT-4o-Realtime based AI Podcast Generator☆38Updated last year
- We handle what engineers and IDEs won't: generating and maintaining technical documentation for your codebase, while also providing searc…☆186Updated 2 months ago
- Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.☆54Updated last year
- Find the Best LLM for Your Needs through E2E Testing☆83Updated last year
- Autospec is an open-source AI agent that takes a web app URL and autonomously QAs it, and saves its passing specs as E2E test code☆56Updated 10 months ago
- Run Claude Code or OpenAI Codex in the background☆32Updated this week
- MidStream is a powerful platform that makes AI conversations smarter and more responsive. Instead of waiting for an AI to finish speaking…☆52Updated last month
- 💭 Chat with AI via API☆33Updated last year
- Your appetite for code + Claude's capabilities = Limitless creation. No experience required - just pure hunger! 🧠⚡💻☆58Updated 6 months ago
- A ChatGPT UI for young readers, written by ChatGPT☆69Updated 2 years ago
- Your AI research assistant☆79Updated 8 months ago
- WhisperAnywhere: Effortless speech-to-text everywhere on your Mac. Use a hotkey to dictate in any app, powered by Whisper AI and Groq API…☆34Updated last year
- Galleries for Models, Datasets, and Plugins used by Transformer Lab☆27Updated this week
- Embedding models from Jina AI☆65Updated last year
- A task management system designed for AI development☆68Updated last week
- 🛤️ Pathik - High-Performance Web Crawler ⚡☆31Updated 8 months ago
- A simple app for downloading YouTube Shorts transcripts. Built to self-host with Python and Streamlit. Free and open source.☆32Updated last year
- Secure, locally-run Retrieval-Augmented Generation system for document-based question-answering, utilizing Llama 3, Mistral, and Gemini m…☆25Updated 3 weeks ago
- Visual inference exploration & experimentation playground☆96Updated last year
- Curated resources about automated GUI computer-use via LLMs. Highly opinionated, focus is on quality vs quantity.☆23Updated last year
- LLM plugin for models hosted by Anyscale Endpoints☆35Updated last year