wisupai / e2mLinks

E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with dedicated parsers and converters, supporting custom configs. E2M offers an all-in-one, flexible, and open-source solution.

☆1,093

Alternatives and similar repositories for e2m

Users that are interested in e2m are comparing it to the libraries listed below

Sorting:

chatdoc-com / OCRFlux
OCRFlux is a lightweight yet powerful multimodal toolkit that significantly advances PDF-to-Markdown conversion, excelling in complex lay…
☆1,767Updated last week
adithya-s-k / marker-api
Easily deployable 🚀 API to convert PDF to markdown quickly with high accuracy.
☆877Updated 9 months ago
memfreeme / memfree
MemFree - Hybrid AI Search Engine & AI Page Generator
☆1,407Updated 2 weeks ago
Jing-yilin / E2M
E2M API, converting everything to markdown (LLM-friendly Format).
☆135Updated 7 months ago
GitHamza0206 / simba
Portable KMS (knowledge management system) designed to integrate seamlessly with any Retrieval-Augmented Generation (RAG) system
☆1,319Updated 2 weeks ago
opendatalab / magic-html
☆477Updated 4 months ago
opendatalab / magic-doc
☆529Updated 11 months ago
jolovicdev / shandu
OpenAI DeepResearch alternative, An AI-driven research system that performs comprehensive, iterative research on any topic using multiple…
☆618Updated last month
NanoNets / docext
An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)
☆1,511Updated 2 weeks ago
iamarunbrahma / vision-parse
Parse PDFs into markdown using Vision LLMs
☆395Updated 5 months ago
NoEdgeAI / pdfdeal
A python wrapper for the Doc2X API and comes with native texts processing (to improve PDF recall in RAG). | Doc2X API的python封装，同时附带本地的文本处…
☆276Updated last month
johnson7788 / MultiAgentPPT
MultiAgentPPT 是一个集成了 A2A（Agent2Agent）+ MCP（Model Context Protocol）+ ADK（Agent Development Kit）架构的智能化演示文稿生成系统，支持通过多智能体协作和流式并发机制
☆842Updated this week
icip-cas / PPTAgent
PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides
☆1,731Updated 2 weeks ago
rag-web-ui / rag-web-ui
RAG Web UI is an intelligent dialogue system based on RAG (Retrieval-Augmented Generation) technology.
☆2,515Updated 2 months ago
huridocs / pdf-document-layout-analysis
A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The servic…
☆627Updated last month
AnotiaWang / deep-research-web-ui
(Supports DeepSeek R1) An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, …
☆1,962Updated 3 weeks ago
robert-mcdermott / ai-knowledge-graph
AI Powered Knowledge Graph Generator
☆1,045Updated 3 weeks ago
memodb-io / memobase
Profile-Based Long-Term Memory for AI Applications. Memobase handles user profiles, memory events, and evolving context — perfect for ch…
☆1,622Updated this week
win4r / GraphRAG4OpenWebUI
GraphRAG4OpenWebUI integrates Microsoft's GraphRAG technology into Open WebUI, providing a versatile information retrieval API. It combin…
☆530Updated 6 months ago
VikParuchuri / tabled
Detect and extract tables to markdown and csv
☆749Updated 5 months ago
ZongqianLi / ReasonGraph
[ACL 2025 Demo] Repository for the demo and paper: ReasonGraph: Visualisation of Reasoning Paths
☆493Updated last month
GongRzhe / Office-PowerPoint-MCP-Server
A MCP (Model Context Protocol) server for PowerPoint manipulation using python-pptx. This server provides tools for creating, editing, an…
☆594Updated 3 weeks ago
microsoft / PIKE-RAG
PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation
☆1,860Updated 2 months ago
cxcscmu / Craw4LLM
Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"
☆633Updated 4 months ago
echohive42 / AI-reads-books-page-by-page
AI reads books: Page-by-Page PDF Knowledge Extractor & Summarizer. script performs an intelligent page-by-page analysis of PDF books, met…
☆1,501Updated 5 months ago
liuhuapiaoyuan / MinerU-webui
MinerU是一款开源的高质量PDF解析工具，基于深度学习技术，可自动提取PDF文档中的文字、表格、图片、公式等内容，并提供丰富的分析、统计、搜索等功能。本项目为其提供一个简化版本的WebUI，方便用户上传PDF文件，并实时展示提取结果。
☆229Updated 7 months ago
zjunlp / OmniThink
OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking
☆454Updated 2 months ago
gcui-art / markdown-to-image
This React component is used to render Markdown into a beautiful poster image, with support for copying as an image. Md to Poster/Image/Q…
☆1,517Updated 4 months ago
MarkPDFdown / markpdfdown
A high-quality PDF to Markdown tool based on large language model visual recognition. 一款基于大模型视觉识别的高质量PDF转Markdown工具
☆891Updated last month
BMPixel / moffee
moffee: Make Markdown Ready to Present
☆1,222Updated last month