thiswillbeyourgithub / wdocLinks
Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, advanced RAG, advanced summaries, scriptable, etc
☆491Updated last week
Alternatives and similar repositories for wdoc
Users that are interested in wdoc are comparing it to the libraries listed below
Sorting:
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)☆680Updated 6 months ago
- A python script designed to translate large amounts of text with an LLM and the Ollama API☆433Updated last week
- ☆879Updated 6 months ago
- Parse PDFs into markdown using Vision LLMs☆449Updated 2 months ago
- Deeper Seeker is an simpler OSS version of OpenAI's latest Deep Research feature in ChatGPT.It is an agentic research tool to reason , cr…☆411Updated 6 months ago
- High-accuracy PDF-to-Markdown OCR API using LLMs with vision capabilities. Features parallel processing, batching, and auto-retry logic f…☆875Updated last week
- 📚 discover story relationships☆346Updated 5 months ago
- SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks,…☆412Updated 4 months ago
- Browser automation system that uses AI-driven planning to navigate web pages and perform goals.☆847Updated last month
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆140Updated 3 months ago
- Web scraper made for AI and simplicity in mind. It runs as a CLI that can be parallelized and outputs high-quality markdown content.☆537Updated last month
- Yet another open source Perplexity☆460Updated last year
- 📰 Building News Agents to Summarize News with MCP, Q, and tmux☆307Updated 4 months ago
- HawkinsDB is our take on giving AI systems a more human-like way to store and recall information, inspired by how our own brains work. Ba…☆317Updated 11 months ago
- This project provides a powerful web scraping tool that fetches search results and converts them into Markdown format using FastAPI, Sear…☆229Updated 11 months ago
- ☆447Updated last year
- Excalidraw meets ComfyUI for LLMs☆294Updated 3 months ago
- Local Video-LLM powered AI Baby Monitor☆454Updated 6 months ago
- Turn local files into a prompt for an LLM☆178Updated 10 months ago
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel…☆125Updated last month
- Mentis: A powerful multi-agent orchestration framework built on LangGraph.☆291Updated 6 months ago
- ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3☆502Updated 4 months ago
- ☆170Updated last year
- An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.☆292Updated 11 months ago
- ☆80Updated 7 months ago
- The simplest open-source implementation of perplexity.ai☆321Updated 10 months ago
- ☆241Updated 6 months ago
- A powerful document AI question-answering tool that connects to your local Ollama models. Create, manage, and interact with RAG systems f…☆1,087Updated 4 months ago
- ☆260Updated last year
- https://no-ocr.com/about☆170Updated 5 months ago