robert-mcdermott / doc2mdLinks
A utility that extracts text from images or PDFs using a local or remote OpenAI-compatible LLM API endpoint with vision-capable multimodal models. For PDFs, each page is rendered to an image and processed sequentially; outputs are concatenated into a single Markdown document.
☆22Updated 3 months ago
Alternatives and similar repositories for doc2md
Users that are interested in doc2md are comparing it to the libraries listed below
Sorting:
- Git Based Memory Storage for Conversational AI Agent☆757Updated last month
- R MCP Server☆193Updated last week
- state of the art browsing agent (WebArena 72.7%)☆360Updated 2 months ago
- Async transport layers for MCP☆107Updated 4 months ago
- Pixelagent — Multimodal stateful agents☆223Updated 6 months ago
- Deploy an AI Analyst in less than 2 mins — connect any LLM to any data source with centralized context management, observability, and con…☆324Updated last week
- This methodology provides a structured approach for collaborating with AI systems on software development projects. It addresses common i…☆386Updated last week
- 🤖🕰️ An MCP server that gives language models temporal awareness and time calculation abilities. Teaching AI the significance of the pas…☆708Updated 6 months ago
- Stop using static chunk sizes. A lightweight, production-ready RAG ingestion toolkit. Uses Docling for layout-aware parsing and applies s…☆60Updated 3 weeks ago
- Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes☆700Updated this week
- This is a framework that implements various parallel reasoning strategies from the literature☆273Updated this week
- A Python toolkit for chain-of-thought prompting 🐍☆180Updated last week
- Local coding agent with neat UI☆335Updated 7 months ago
- Official python implementation of UTCP. UTCP is an open standard that lets AI agents call any API directly, without extra middleware.☆633Updated 2 weeks ago
- EnrichMCP is a python framework for building data driven MCP servers☆631Updated last week
- A Model Context Protocol (MCP) server that provides tools for interacting with JMAP (JSON Meta Application Protocol) email servers. Built…☆157Updated 4 months ago
- AI Dataset Generator – Create realistic datasets for demos, learning, and dashboards☆743Updated 2 months ago
- A comprehensive Model Context Protocol (MCP) server implementing the latest specification.☆335Updated 6 months ago
- CleverBee - The Open Source Deep Researcher Tool☆308Updated 6 months ago
- Declarative language for composable Al workflows. Devtool for agents and mere humans.☆598Updated this week
- Your toolkit for autonomous, evolving agent ecosystems. Create, execute, govern, and evolve agents that learn from experience, collaborat…☆446Updated 3 weeks ago
- Opinionated agentic RAG powered by LanceDB, Pydantic AI, and Docling☆428Updated this week
- An SDK for working with LLMs and AI Agents from Apache Airflow, based on Pydantic AI☆510Updated 2 months ago
- Spegel - Reflect the web through AI☆330Updated 5 months ago
- ☆62Updated 9 months ago
- Build an AI Telephony Agent for Inbound and Outbound Calls☆225Updated 3 months ago
- Transcribe PDFs with local LLMs☆808Updated this week
- ☆675Updated 4 months ago
- A comprehensive suite of tools, built to liberate science by making the creation, evaluation, and dissemination of research more transpar…☆227Updated 4 months ago
- An open-source coding helper. Very friendly!☆812Updated this week