thiswillbeyourgithub / wdocLinks
Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, advanced RAG, advanced summaries, scriptable, etc
β477Updated 3 weeks ago
Alternatives and similar repositories for wdoc
Users that are interested in wdoc are comparing it to the libraries listed below
Sorting:
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)β665Updated 3 months ago
- π discover story relationshipsβ337Updated 2 months ago
- β862Updated 3 months ago
- Parse PDFs into markdown using Vision LLMsβ417Updated 6 months ago
- SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks,β¦β399Updated 3 weeks ago
- Deeper Seeker is an simpler OSS version of OpenAI's latest Deep Research feature in ChatGPT.It is an agentic research tool to reason , crβ¦β412Updated 3 months ago
- π° Building News Agents to Summarize News with MCP, Q, and tmuxβ298Updated last month
- Local Video-LLM powered AI Baby Monitorβ429Updated 3 months ago
- Turn local files into a prompt for an LLMβ176Updated 7 months ago
- Web scraper made for AI and simplicity in mind. It runs as a CLI that can be parallelized and outputs high-quality markdown content.β524Updated last week
- The open-source RAG platformβ274Updated this week
- OpenAI DeepResearch alternative, An AI-driven research system that performs comprehensive, iterative research on any topic using multipleβ¦β626Updated 2 months ago
- HawkinsDB is our take on giving AI systems a more human-like way to store and recall information, inspired by how our own brains work. Baβ¦β304Updated 8 months ago
- Browser automation system that uses AI-driven planning to navigate web pages and perform goals.β834Updated 7 months ago
- MAESTRO is an AI-powered research application designed to streamline complex research tasks.β967Updated last week
- Excalidraw meets ComfyUI for LLMsβ276Updated last week
- β166Updated 9 months ago
- This project provides a powerful web scraping tool that fetches search results and converts them into Markdown format using FastAPI, Searβ¦β224Updated 8 months ago
- A MCP server implementation for hyperbrowserβ588Updated 3 months ago
- An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing anβ¦β867Updated 11 months ago
- β442Updated 11 months ago
- No-code ETL and data pipelines with AI and NLPβ315Updated 6 months ago
- Speech to Text but with all the bells and whistles and most importantly AI! AI will clean up your filler words, edit and will refine whatβ¦β318Updated 6 months ago
- Mentis: A powerful multi-agent orchestration framework built on LangGraph.β277Updated 3 months ago
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.β125Updated 10 months ago
- Portable KMS (knowledge management system) designed to integrate seamlessly with any Retrieval-Augmented Generation (RAG) systemβ1,350Updated 3 weeks ago
- MCP server for fetch web page content using Playwright headless browser.β838Updated 2 months ago
- A simple agent framework that's capable of browser use + mcp + auto instrument + plan + deep research + moreβ314Updated 3 weeks ago
- β77Updated 4 months ago
- https://no-ocr.com/aboutβ165Updated 2 months ago