robert-mcdermott / doc2mdView on GitHub
A utility that extracts text from images or PDFs using a local or remote OpenAI-compatible LLM API endpoint with vision-capable multimodal models. For PDFs, each page is rendered to an image and processed sequentially; outputs are concatenated into a single Markdown document.
30Aug 26, 2025Updated 6 months ago

Alternatives and similar repositories for doc2md

Users that are interested in doc2md are comparing it to the libraries listed below

Sorting:

Are these results useful?