thiswillbeyourgithub / wdocLinks
Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, advanced RAG, advanced summaries, scriptable, etc
☆499Updated last week
Alternatives and similar repositories for wdoc
Users that are interested in wdoc are comparing it to the libraries listed below
Sorting:
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)☆680Updated 7 months ago
- Parse PDFs into markdown using Vision LLMs☆453Updated 2 months ago
- Local Video-LLM powered AI Baby Monitor☆457Updated 7 months ago
- A python script designed to translate large amounts of text with an LLM, Ollama, OpenAI, Gemini and OpenRouter API☆449Updated last week
- 📚 discover story relationships☆346Updated 6 months ago
- Deeper Seeker is an simpler OSS version of OpenAI's latest Deep Research feature in ChatGPT.It is an agentic research tool to reason , cr…☆413Updated 7 months ago
- ☆880Updated 7 months ago
- SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks,…☆412Updated 4 months ago
- Turn local files into a prompt for an LLM☆177Updated 11 months ago
- Web scraper made for AI and simplicity in mind. It runs as a CLI that can be parallelized and outputs high-quality markdown content.☆540Updated last month
- ☆80Updated 8 months ago
- HawkinsDB is our take on giving AI systems a more human-like way to store and recall information, inspired by how our own brains work. Ba…☆319Updated last year
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆141Updated 3 months ago
- ☆170Updated last year
- High-accuracy PDF-to-Markdown OCR API using LLMs with vision capabilities. Features parallel processing, batching, and auto-retry logic f…☆877Updated last month
- 📰 Building News Agents to Summarize News with MCP, Q, and tmux