thiswillbeyourgithub / wdocLinks
Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, advanced RAG, advanced summaries, scriptable, etc
☆468Updated last week
Alternatives and similar repositories for wdoc
Users that are interested in wdoc are comparing it to the libraries listed below
Sorting:
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)☆656Updated last month
- Parse PDFs into markdown using Vision LLMs☆395Updated 5 months ago
- Deeper Seeker is an simpler OSS version of OpenAI's latest Deep Research feature in ChatGPT.It is an agentic research tool to reason , cr…☆409Updated 2 months ago
- Local Video-LLM powered AI Baby Monitor☆389Updated last month
- ☆848Updated 2 months ago
- Turn local files into a prompt for an LLM☆173Updated 5 months ago
- 📚 discover story relationships☆336Updated 3 weeks ago
- SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks,…☆388Updated last month
- Browser automation system that uses AI-driven planning to navigate web pages and perform goals.☆775Updated 6 months ago
- An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing an…☆864Updated 9 months ago
- OpenAI DeepResearch alternative, An AI-driven research system that performs comprehensive, iterative research on any topic using multiple…☆618Updated last month
- The open-source RAG platform☆236Updated 3 weeks ago
- HawkinsDB is our take on giving AI systems a more human-like way to store and recall information, inspired by how our own brains work. Ba…☆299Updated 6 months ago
- 📰 Building News Agents to Summarize News with MCP, Q, and tmux☆281Updated 2 months ago
- ☆444Updated 9 months ago
- NativeMind: Your fully private, open-source, on-device AI assistant☆368Updated this week
- https://no-ocr.com/about☆162Updated 2 weeks ago
- ☆241Updated 8 months ago
- A MCP server implementation for hyperbrowser☆486Updated last month
- Yet another open source Perplexity☆448Updated 8 months ago
- Googles NotebookLM but local☆315Updated 2 months ago
- Web scraper made for AI and simplicity in mind. It runs as a CLI that can be parallelized and outputs high-quality markdown content.☆521Updated last month
- This project provides a powerful web scraping tool that fetches search results and converts them into Markdown format using FastAPI, Sear…☆224Updated 6 months ago
- The simplest open-source implementation of perplexity.ai☆314Updated 5 months ago
- With one command, create a natural-sounding audiobook from a variety of input formats (epub, mobi, txt, PDF, HTML and more!)☆663Updated 3 months ago
- MCP server for fetch web page content using Playwright headless browser.☆760Updated 3 weeks ago
- Your first AI prompt engineer☆395Updated 2 weeks ago
- A CLI tool to provide LLM context for coding projects by combining project files into a single text file (or clipboard text) with directo…☆146Updated 2 months ago
- ☆77Updated 3 months ago
- 📥 cpdown - Copy to clipboard any webpage content/youtube subtitle as clean markdown with one click or shortcut☆314Updated this week