robert-mcdermott / doc2mdLinks
A utility that extracts text from images or PDFs using a local or remote OpenAI-compatible LLM API endpoint with vision-capable multimodal models. For PDFs, each page is rendered to an image and processed sequentially; outputs are concatenated into a single Markdown document.
☆20Updated 3 months ago
Alternatives and similar repositories for doc2md
Users that are interested in doc2md are comparing it to the libraries listed below
Sorting:
- R MCP Server☆191Updated last week
- Deploy an AI Analyst in less than 2 mins — connect any LLM to any data source with centralized context management, observability, and con…☆282Updated last week
- Git Based Memory Storage for Conversational AI Agent☆737Updated last week
- Pixelagent — Multimodal stateful agents☆223Updated 5 months ago
- state of the art browsing agent (WebArena 72.7%)☆358Updated 2 months ago
- CleverBee - The Open Source Deep Researcher Tool☆305Updated 5 months ago
- Your toolkit for autonomous, evolving agent ecosystems. Create, execute, govern, and evolve agents that learn from experience, collaborat…☆445Updated last week
- A Model Context Protocol (MCP) server that provides tools for interacting with JMAP (JSON Meta Application Protocol) email servers. Built…☆152Updated 3 months ago
- Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes☆650Updated this week
- Async transport layers for MCP☆107Updated 3 months ago
- A Python package for zero-shot text anonymization using Transformer-based NER models.☆78Updated this week
- Byte-Vision is a privacy-first document intelligence platform that transforms static documents into an interactive, searchable knowledge …☆63Updated 2 months ago
- Opinionated agentic RAG powered by LanceDB, Pydantic AI, and Docling☆395Updated this week
- A comprehensive suite of tools, built to liberate science by making the creation, evaluation, and dissemination of research more transpar…☆226Updated 3 months ago
- A Python toolkit for chain-of-thought prompting 🐍☆178Updated 3 months ago
- Local coding agent with neat UI☆329Updated 6 months ago
- Parallel thinking for LLMs. Confidence‑gated, strategy‑driven, offline‑friendly☆258Updated 2 months ago
- Content addressable storage with excellent search☆356Updated last week
- Declarative language for composable Al workflows. Devtool for agents and mere humans.☆586Updated this week
- Official python implementation of UTCP. UTCP is an open standard that lets AI agents call any API directly, without extra middleware.☆617Updated this week
- This methodology provides a structured approach for collaborating with AI systems on software development projects. It addresses common i…☆379Updated 2 months ago
- Deep research tool for local knowledge base.☆143Updated last month
- 🤖 An open-source AI assistant answering questions using your docs☆230Updated 2 months ago
- An open-source coding helper. Very friendly!☆787Updated this week
- A multi-agent LLM system for detecting and resolving cognitive dissonance.☆269Updated last month
- ☆117Updated 4 months ago
- Fast Diversification for Search & Retrieval☆428Updated 2 weeks ago
- Open-source AI-powered data science platform.☆284Updated last month
- ☆62Updated 8 months ago
- Easily copy all relevant source files in a repository to clipboard. For use in LLM code understanding and generation workflows☆229Updated 9 months ago