thiswillbeyourgithub / wdocLinks
Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, advanced RAG, advanced summaries, scriptable, etc
☆504Updated last week
Alternatives and similar repositories for wdoc
Users that are interested in wdoc are comparing it to the libraries listed below
Sorting:
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)☆681Updated 8 months ago
- SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks,…☆414Updated 5 months ago
- 📚 discover story relationships☆347Updated 6 months ago
- Local Video-LLM powered AI Baby Monitor☆465Updated 7 months ago
- ☆885Updated 8 months ago
- Parse PDFs into markdown using Vision LLMs☆455Updated 3 months ago
- High-accuracy PDF-to-Markdown OCR API using LLMs with vision capabilities. Features parallel processing, batching, and auto-retry logic f…☆878Updated last month
- Deeper Seeker is an simpler OSS version of OpenAI's latest Deep Research feature in ChatGPT.It is an agentic research tool to reason , cr…☆411Updated 8 months ago
- Turn local files into a prompt for an LLM☆177Updated last year
- 📰 Building News Agents to Summarize News with MCP, Q, and tmux☆309Updated 6 months ago
- ☆80Updated 9 months ago
- HawkinsDB is our take on giving AI systems a more human-like way to store and recall information, inspired by how our own brains work. Ba…☆318Updated last year
- Web scraper made for AI and simplicity in mind. It runs as a CLI that can be parallelized and outputs high-quality markdown content.☆540Updated 2 months ago
- Browser automation system that uses AI-driven planning to navigate web pages and perform goals.☆851Updated 2 months ago
- A python script designed to translate large amounts of text with an LLM, Ollama, OpenAI, Gemini and OpenRouter API☆456Updated this week
- Excalidraw meets ComfyUI for LLMs☆307Updated 4 months ago
- ☆170Updated last year
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆141Updated 4 months ago
- AI ContentCraft is an all-in-one content creation suite that helps creators generate stories, podcast scripts, and multimedia content usi…☆384Updated 6 months ago
- ☆285Updated 11 months ago
- The simplest open-source implementation of perplexity.ai☆324Updated 11 months ago
- ☆263Updated last year
- Mentis: A powerful multi-agent orchestration framework built on LangGraph.☆293Updated 8 months ago
- A simple agent framework that's capable of browser use + mcp + auto instrument + plan + deep research + more☆371Updated 2 weeks ago
- Speech to Text but with all the bells and whistles and most importantly AI! AI will clean up your filler words, edit and will refine what…☆327Updated 11 months ago
- https://no-ocr.com/about☆175Updated 6 months ago
- ☆441Updated last year
- A powerful document AI question-answering tool that connects to your local Ollama models. Create, manage, and interact with RAG systems f…☆1,088Updated 5 months ago
- An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.☆294Updated last year
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel…☆124Updated 2 months ago