E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with dedicated parsers and converters, supporting custom configs. E2M offers an all-in-one, flexible, and open-source solution.
☆1,294Sep 8, 2024Updated last year
Alternatives and similar repositories for e2m
Users that are interested in e2m are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- E2M API, converting everything to markdown (LLM-friendly Format).☆141Dec 12, 2024Updated last year
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆3,104Dec 8, 2025Updated 6 months ago
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆7,543Dec 12, 2025Updated 6 months ago
- Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.☆67,596Updated this week
- Using GPT to parse PDF☆3,554Apr 17, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- AI reads books: Page-by-Page PDF Knowledge Extractor & Summarizer. script performs an intelligent page-by-page analysis of PDF books, met…☆2,150Jan 20, 2025Updated last year
- The first open-source agent skills builder. Define skills by vibe workflow, run on Claude Code, Cursor, Codex & more. Build Clawdbot 🦞· …☆7,362Mar 25, 2026Updated 2 months ago
- Convert PDF to markdown + JSON quickly with high accuracy☆36,101Jun 6, 2026Updated last week
- Company Researcher tool helps you instantly understand any company inside out.☆1,460Apr 8, 2026Updated 2 months ago
- OCR & Document Extraction using vision models☆12,242May 20, 2025Updated last year
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,387Feb 21, 2025Updated last year
- Python tool for converting files and office documents to Markdown.☆152,866May 26, 2026Updated 2 weeks ago
- MemFree - Hybrid AI Search Engine & AI Page Generator☆1,499Aug 8, 2025Updated 10 months ago
- Make any LLM to think like OpenAI o1 and deepseek R1☆490Feb 6, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆9,715Jan 3, 2025Updated last year
- OCR, layout analysis, reading order, table recognition in 90+ languages☆20,780Updated this week
- OpenSource Production ready Customer service with built in Evals and monitoring☆1,451Jan 12, 2026Updated 5 months ago
- RAG Web UI is an intelligent dialogue system based on RAG (Retrieval-Augmented Generation) technology.☆3,036Apr 6, 2026Updated 2 months ago
- Task-Aware Agent-driven Prompt Optimization Framework☆3,874Oct 13, 2025Updated 8 months ago
- Toolkit for linearizing PDFs for LLM datasets/training☆17,387Mar 25, 2026Updated 2 months ago
- Turn local files into a prompt for an LLM☆175Jan 19, 2025Updated last year
- ☆2,289Mar 17, 2025Updated last year
- This React component is used to render Markdown into a beautiful poster image, with support for copying as an image. Md to Poster/Image/Q…☆1,941Mar 5, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆11,175May 22, 2026Updated 3 weeks ago
- Fetch an entire site and save it as a text file (to be used with AI models).☆1,734Jan 18, 2025Updated last year
- RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creat…☆82,621Updated this week
- An open-source RAG-based tool for chatting with your documents.☆25,467Jun 9, 2026Updated last week
- WhyHow Knowledge Graph Studio☆924Dec 25, 2024Updated last year
- 🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser sandbox that lets you automate the web wit…☆7,165Jun 9, 2026Updated last week
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,947Apr 9, 2026Updated 2 months ago
- 🔥 MaxKB is an open-source platform for building enterprise-grade agents. 强大易用的开源企业级智能体平台。☆21,195Jun 9, 2026Updated last week
- Detect and extract tables to markdown and csv☆752Jan 24, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,139Feb 10, 2025Updated last year
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,869Jul 4, 2025Updated 11 months ago
- [EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,…☆34,707May 25, 2026Updated 3 weeks ago
- Get your documents ready for gen AI☆61,291Updated this week
- FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data process…☆28,339Updated this week
- A high-quality PDF to Markdown tool based on large language model visual recognition. 一款基于大模型视觉识别的高质量PDF转Markdown工具☆1,758Jan 25, 2026Updated 4 months ago
- AI productivity studio with smart chat, autonomous agents, and 300+ assistants. Unified access to frontier LLMs☆47,145Updated this week