robert-mcdermott / doc2mdLinks
A utility that extracts text from images or PDFs using a local or remote OpenAI-compatible LLM API endpoint with vision-capable multimodal models. For PDFs, each page is rendered to an image and processed sequentially; outputs are concatenated into a single Markdown document.
☆28Updated 5 months ago
Alternatives and similar repositories for doc2md
Users that are interested in doc2md are comparing it to the libraries listed below
Sorting:
- R MCP Server☆197Updated last month
- Git Based Memory Storage for Conversational AI Agent☆774Updated 2 weeks ago
- Local coding agent with neat UI☆344Updated 8 months ago
- Chat with your data - with memory, rules, and observability built in. Deploy in 2 minutes☆412Updated this week
- Opinionated agentic RAG powered by LanceDB, Pydantic AI, and Docling☆482Updated this week
- Byte-Vision is a privacy-first document intelligence platform that transforms static documents into an interactive, searchable knowledge …☆71Updated 2 months ago
- Pixelagent — Multimodal stateful agents☆224Updated 8 months ago
- Fully open-source command-line AI assistant inspired by OpenAI Codex, supporting local language models.☆666Updated 7 months ago
- state of the art browsing agent (WebArena 72.7%)☆365Updated 4 months ago
- Async transport layers for MCP☆108Updated 5 months ago
- Transcribe PDFs with local LLMs☆818Updated 2 weeks ago
- Stop using static chunk sizes. A lightweight, production-ready RAG ingestion toolkit. Uses Docling for layout-aware parsing and applies s…☆63Updated 2 months ago
- 🤖🕰️ An MCP server that gives language models temporal awareness and time calculation abilities. Teaching AI the significance of the pas…☆711Updated 7 months ago
- LLM Client, Server API and UI☆565Updated this week
- A Python toolkit for chain-of-thought prompting 🐍☆184Updated last month
- This methodology provides a structured approach for collaborating with AI systems on software development projects. It addresses common i…☆391Updated last month
- This is a framework that implements various parallel reasoning strategies from the literature☆275Updated last month
- Your toolkit for autonomous, evolving agent ecosystems. Create, execute, govern, and evolve agents that learn from experience, collaborat…☆448Updated 2 months ago
- Dead Simple LLM Abliteration☆248Updated 11 months ago
- It takes a village to raise a child: Google DeepThink 🧠 but in LangGraph and free - an original algorithm for collaborative agents using…☆135Updated 3 weeks ago
- CleverBee - The Open Source Deep Researcher Tool☆310Updated last week
- ☆121Updated 6 months ago
- A Model Context Protocol (MCP) server that provides tools for interacting with JMAP (JSON Meta Application Protocol) email servers. Built…☆161Updated this week
- EnrichMCP is a python framework for building data driven MCP servers☆642Updated last month
- AI Dataset Generator – Create realistic datasets for demos, learning, and dashboards☆746Updated 4 months ago
- A Python package for zero-shot text anonymization using Transformer-based NER models.☆81Updated last month
- An open-source coding helper. Very friendly!☆866Updated this week
- A structural code search engine for Al agents.☆532Updated 2 weeks ago
- Turn any website or doc into an MCP server☆172Updated last month
- Declarative language for composable Al workflows. Devtool for agents and mere humans.☆610Updated this week