thiswillbeyourgithub / wdoc
Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, scalable (?), WIP
β451Updated this week
Alternatives and similar repositories for wdoc:
Users that are interested in wdoc are comparing it to the libraries listed below
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)β621Updated this week
- β826Updated this week
- π discover story relationshipsβ321Updated last week
- Deeper Seeker is an simpler OSS version of OpenAI's latest Deep Research feature in ChatGPT.It is an agentic research tool to reason , crβ¦β397Updated 3 weeks ago
- An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing anβ¦β852Updated 7 months ago
- Turn local files into a prompt for an LLMβ171Updated 3 months ago
- Parse PDFs into markdown using Vision LLMsβ356Updated 2 months ago
- OpenAI DeepResearch alternative, An AI-driven research system that performs comprehensive, iterative research on any topic using multipleβ¦β589Updated this week
- A powerful document AI question-answering tool that connects to your local Ollama models. Create, manage, and interact with RAG systems fβ¦β979Updated last month
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercelβ¦β117Updated 2 months ago
- β438Updated 7 months ago
- Browser automation system that uses AI-driven planning to navigate web pages and perform goals.β769Updated 3 months ago
- HawkinsDB is our take on giving AI systems a more human-like way to store and recall information, inspired by how our own brains work. Baβ¦β274Updated 4 months ago
- Fetch arxiv data to LLM-friendly textβ116Updated 2 months ago
- Self-hosted voice chat with LLMsβ427Updated 2 months ago
- MCP server for fetch web page content using Playwright headless browser.β658Updated this week
- SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks,β¦β354Updated 3 weeks ago
- Speech to Text but with all the bells and whistles and most importantly AI! AI will clean up your filler words, edit and will refine whatβ¦β310Updated 2 months ago
- Fully open-source command-line AI assistant inspired by OpenAI Codex, supporting local language models.β461Updated this week
- Open-source unstructured data (PDFs, Images, Audiofiles) processing platform built for knowledge workersβ278Updated last month
- An agentic company research tool powered by LangGraph and Tavily that conducts deep diligence on companies using a multi-agent framework.β¦β285Updated this week
- With one command, create a natural-sounding audiobook from a variety of input formats (epub, mobi, txt, PDF, HTML and more!)β648Updated last month
- Excalidraw meets ComfyUI for LLMsβ253Updated last week