robert-mcdermott / doc2mdLinks
A utility that extracts text from images or PDFs using a local or remote OpenAI-compatible LLM API endpoint with vision-capable multimodal models. For PDFs, each page is rendered to an image and processed sequentially; outputs are concatenated into a single Markdown document.
☆22Updated 4 months ago
Alternatives and similar repositories for doc2md
Users that are interested in doc2md are comparing it to the libraries listed below
Sorting:
- R MCP Server☆195Updated 2 weeks ago
- Git Based Memory Storage for Conversational AI Agent☆767Updated last month
- Opinionated agentic RAG powered by LanceDB, Pydantic AI, and Docling☆445Updated this week
- Byte-Vision is a privacy-first document intelligence platform that transforms static documents into an interactive, searchable knowledge …☆71Updated last month
- Chat with your data - with memory, rules, and observability built in. Deploy in 2 minutes☆398Updated this week
- Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes☆727Updated this week
- Stop using static chunk sizes. A lightweight, production-ready RAG ingestion toolkit. Uses Docling for layout-aware parsing and applies s…☆64Updated last month
- LLM Client, Server API and UI☆405Updated this week
- This is a framework that implements various parallel reasoning strategies from the literature☆274Updated 3 weeks ago
- Local coding agent with neat UI☆337Updated 7 months ago
- This methodology provides a structured approach for collaborating with AI systems on software development projects. It addresses common i…☆389Updated last month
- state of the art browsing agent (WebArena 72.7%)☆361Updated 3 months ago
- CleverBee - The Open Source Deep Researcher Tool☆309Updated 7 months ago
- Your toolkit for autonomous, evolving agent ecosystems. Create, execute, govern, and evolve agents that learn from experience, collaborat…☆447Updated last month
- Pixelagent — Multimodal stateful agents☆224Updated 7 months ago
- 🤖🕰️ An MCP server that gives language models temporal awareness and time calculation abilities. Teaching AI the significance of the pas…☆709Updated 6 months ago
- Async transport layers for MCP☆107Updated 4 months ago
- ☆120Updated 5 months ago
- Fast Diversification for Search & Retrieval☆461Updated last month
- Deep research tool for local knowledge base.☆151Updated last month
- An open-source Text2SQL tool that transforms natural language into SQL using graph-powered schema understanding. Ask your database questi…☆294Updated this week
- A Python package for zero-shot text anonymization using Transformer-based NER models.☆82Updated 3 weeks ago
- Generates breakthrough ideas from a single prompt through an 8 stage walkthrough, with optional research proposal paper.☆59Updated 3 months ago
- Turn AI into a persistent, memory-powered collaborator. Universal MCP Server (supports HTTP, STDIO, and WebSocket) enabling cross-platfor…☆243Updated 2 weeks ago
- Transcribe PDFs with local LLMs☆817Updated 3 weeks ago
- Claude consciousness project files☆31Updated last year
- ☆57Updated this week
- Turn any website or doc into an MCP server☆166Updated 3 weeks ago
- A Model Context Protocol (MCP) server that provides tools for interacting with JMAP (JSON Meta Application Protocol) email servers. Built…☆159Updated this week
- Declarative language for composable Al workflows. Devtool for agents and mere humans.☆599Updated this week