E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with dedicated parsers and converters, supporting custom configs. E2M offers an all-in-one, flexible, and open-source solution.
☆1,292Sep 8, 2024Updated last year
Alternatives and similar repositories for e2m
Users that are interested in e2m are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆3,097Dec 8, 2025Updated 4 months ago
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆6,815Dec 12, 2025Updated 4 months ago
- Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.☆61,724Apr 29, 2026Updated last week
- Using GPT to parse PDF☆3,551Apr 17, 2025Updated last year
- AI reads books: Page-by-Page PDF Knowledge Extractor & Summarizer. script performs an intelligent page-by-page analysis of PDF books, met…☆2,103Jan 20, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The first open-source agent skills builder. Define skills by vibe workflow, run on Claude Code, Cursor, Codex & more. Build Clawdbot 🦞· …☆7,261Mar 25, 2026Updated last month
- Convert PDF to markdown + JSON quickly with high accuracy☆34,606Apr 24, 2026Updated last week
- Company Researcher tool helps you instantly understand any company inside out.☆1,445Apr 8, 2026Updated 3 weeks ago
- OCR & Document Extraction using vision models☆12,227May 20, 2025Updated 11 months ago
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,354Feb 21, 2025Updated last year
- Python tool for converting files and office documents to Markdown.☆119,095Apr 20, 2026Updated 2 weeks ago
- MemFree - Hybrid AI Search Engine & AI Page Generator☆1,498Aug 8, 2025Updated 8 months ago
- Make any LLM to think like OpenAI o1 and deepseek R1☆489Feb 6, 2025Updated last year
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆9,642Jan 3, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,707Apr 24, 2026Updated last week
- OpenSource Production ready Customer service with built in Evals and monitoring☆1,443Jan 12, 2026Updated 3 months ago
- RAG Web UI is an intelligent dialogue system based on RAG (Retrieval-Augmented Generation) technology.☆2,980Apr 6, 2026Updated last month
- Task-Aware Agent-driven Prompt Optimization Framework☆3,846Oct 13, 2025Updated 6 months ago
- Toolkit for linearizing PDFs for LLM datasets/training☆17,231Mar 25, 2026Updated last month
- Turn local files into a prompt for an LLM☆176Jan 19, 2025Updated last year
- This React component is used to render Markdown into a beautiful poster image, with support for copying as an image. Md to Poster/Image/Q…☆1,898Mar 5, 2025Updated last year
- ☆2,283Mar 17, 2025Updated last year
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆10,745Updated this week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Fetch an entire site and save it as a text file (to be used with AI models).☆1,735Jan 18, 2025Updated last year
- RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creat…☆79,332Updated this week
- An open-source RAG-based tool for chatting with your documents.☆25,350Apr 3, 2026Updated last month
- 🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser sandbox that lets you automate the web wit…☆6,947Apr 29, 2026Updated last week
- WhyHow Knowledge Graph Studio☆916Dec 25, 2024Updated last year
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,942Apr 9, 2026Updated 3 weeks ago
- 🔥 MaxKB is an open-source platform for building enterprise-grade agents. 强大易用的开源企业级智能体平台。☆20,871Apr 30, 2026Updated last week
- Detect and extract tables to markdown and csv☆756Jan 24, 2025Updated last year
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,116Feb 10, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,848Jul 4, 2025Updated 10 months ago
- [EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,…☆33,492Apr 20, 2026Updated 2 weeks ago
- Get your documents ready for gen AI☆59,087Updated this week
- FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data process…☆27,894Apr 30, 2026Updated last week
- AI productivity studio with smart chat, autonomous agents, and 300+ assistants. Unified access to frontier LLMs☆44,968Updated this week
- [EMNLP 2025] OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking☆484Aug 23, 2025Updated 8 months ago
- 📃 A better UX for chat, writing content, and coding with LLMs.☆5,440Feb 25, 2026Updated 2 months ago