luisleo526 / doc2markLinks
AI-powered Python library that converts any document (PDF, Word, Excel, PowerPoint, HTML) to clean Markdown while preserving complex tables and layouts using AI-Powered OCR technology.
☆46Updated 2 months ago
Alternatives and similar repositories for doc2mark
Users that are interested in doc2mark are comparing it to the libraries listed below
Sorting:
- Fast local speech-to-text for any app using faster-whisper☆147Updated this week
- WebRAgent is a retrieval-augmented generation (RAG) web application featuring agent-based query decomposition, vector search with Qdrant,…☆54Updated 10 months ago
- Retrieval-augmented generation (RAG) for remote & local LLM use☆44Updated 8 months ago
- AI Document Assistant☆89Updated 8 months ago
- AI-powered text compression library for RAG systems and API calls. Reduce token usage by up to 50-60% while preserving semantic meaning w…☆78Updated 5 months ago
- Laddr is a python framework for building multi-agent systems where agents communicate, delegate tasks, and execute work in parallel. Thin…☆337Updated 2 months ago
- Make your meetings accessible to AI Agents☆432Updated 2 months ago
- LLM search engine faster than perplexity!☆376Updated 5 months ago
- This is a Python package to add tool calling capabilities to newly released LLMs on LangChain's ChatOpenAI, AzureAIChatCompletionsModel a…☆121Updated 8 months ago
- A MCP server allowing LLM agents to easily connect and retrieve data from any database☆99Updated 6 months ago
- One library to split them all: Sentence, Code, Docs. Chunk smarter, not harder — built for LLMs, RAG pipelines, and beyond.☆60Updated this week
- Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, py…☆167Updated 5 months ago
- VeritasGraph: Enterprise-Grade Graph RAG for Secure, On-Premise AI with Verifiable Attribution☆237Updated last week
- Turn topics, links, and files into AI-generated research notebooks — summarize, explore, and ask anything.☆147Updated 8 months ago
- ☆59Updated last week
- Your SDK solves all of this. One interface. Unified logic. Local + hosted models. Fine-tuning. Agent tools. Enterprise-ready. Hybrid RAG.…☆86Updated last week
- Multi-agent autonomous research system using LangGraph and LangChain. Generates citation-backed reports with credibility scoring and web …☆136Updated last month
- A simple CPU only OCR for pdf/images/word/excel to markdown. With streamlit.☆45Updated 2 weeks ago
- PDFStract - The Extraction and Chunking Layer in Your RAG Pipeline - Available as CLI - WEBUI - API☆113Updated 2 weeks ago
- pdfLLM is a completely open source, proof of concept RAG app.☆184Updated 5 months ago
- Chat with PDF files with source highlights☆149Updated last year
- HippocampAI — Autonomous Memory Engine for LLM Agents☆31Updated 2 weeks ago
- Find your files with natural language and ask questions.☆57Updated last week
- ☆106Updated 8 months ago
- ☆62Updated last month
- A fully custom chatbot built with Agentic RAG (Retrieval-Augmented Generation), combining Gemini models with a local knowledge base for a…☆164Updated 11 months ago
- AI Agent that researches the lives of historical figures and extracts events into structured JSON timelines using LangGraph multi-agent o…☆227Updated 3 months ago
- The AI runtime that turns your framework functions into OpenAI compatible endpoints☆88Updated 11 months ago
- An MCP server that executes Python code in isolated rootless containers with optional MCP server proxying. Implementation of Anthropic's …☆306Updated 2 months ago
- A Python CLI to test, benchmark, and find the best RAG chunking strategy for your Markdown documents.☆101Updated 3 weeks ago