wisupai / e2mLinks
E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with dedicated parsers and converters, supporting custom configs. E2M offers an all-in-one, flexible, and open-source solution.
☆1,234Updated last year
Alternatives and similar repositories for e2m
Users that are interested in e2m are comparing it to the libraries listed below
Sorting:
- A high-quality PDF to Markdown tool based on large language model visual recognition. 一款基于大模型视觉识别的高质量PDF转Markdown工具☆1,603Updated last month
- Easily deployable 🚀 API to convert PDF to markdown quickly with high accuracy.☆917Updated last year
- OCRFlux is a lightweight yet powerful multimodal toolkit that significantly advances PDF-to-Markdown conversion, excelling in complex lay…☆2,341Updated 2 months ago
- Portable KMS (knowledge management system) designed to integrate seamlessly with any Retrieval-Augmented Generation (RAG) system☆1,372Updated 2 months ago
- ☆713Updated 2 weeks ago
- E2M API, converting everything to markdown (LLM-friendly Format).☆139Updated 10 months ago
- AI reads books: Page-by-Page PDF Knowledge Extractor & Summarizer. script performs an intelligent page-by-page analysis of PDF books, met…☆1,554Updated 9 months ago
- This React component is used to render Markdown into a beautiful poster image, with support for copying as an image. Md to Poster/Image/Q…☆1,777Updated 7 months ago
- A python wrapper for the Doc2X API and comes with native texts processing (to improve PDF recall in RAG). | Doc2X API的python封装,同时附带本地的文本处…☆283Updated 4 months ago
- ☆492Updated 7 months ago
- MemFree - Hybrid AI Search Engine & AI Page Generator☆1,449Updated 2 months ago
- Parse PDFs into markdown using Vision LLMs☆439Updated 3 weeks ago
- [ACL 2025 Demo] Repository for the demo and paper: ReasonGraph: Visualisation of Reasoning Paths☆501Updated 5 months ago
- ☆543Updated last year
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆1,785Updated 2 months ago
- Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"☆639Updated 8 months ago
- Detect and extract tables to markdown and csv☆754Updated 9 months ago
- MinerU是一款开源的高质量PDF解析工具,基于深度学习技术,可自动提取PDF文档中的文字、表格、图片、公式等内容,并 提供丰富的分析、统计、搜索等功能。 本项目为其提供一个简化版本的WebUI,方便用户上传PDF文件,并实时展示提取结果。☆271Updated 10 months ago
- A MCP (Model Context Protocol) server for PowerPoint manipulation using python-pptx. This server provides tools for creating, editing, an…☆1,157Updated 2 weeks ago
- AI Powered Knowledge Graph Generator☆1,338Updated last month
- [EMNLP 2025] OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking☆464Updated 2 months ago
- PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides [EMNLP 2025]☆2,148Updated last week
- 添加🚀流式 Web 服务到 GraphRAG,兼容 OpenAI SDK,支持可访问的实体链接🔗,支持建议问题,兼容本地嵌入模型,修复诸多问题。Add streaming web server to GraphRAG, compatible with OpenAI SD…☆260Updated 7 months ago
- A quick vibe coded app for deepseek OCR☆1,242Updated last week
- GraphRAG4OpenWebUI integrates Microsoft's GraphRAG technology into Open WebUI, providing a versatile information retrieval API. It combin…☆563Updated 9 months ago
- 基于MinerU的桌面应用程序,MinerU是一款开源的高质量PDF解析工具,基于深度学习技术,可自动提取PDF文档中的文字、表格、图片、公式等内容,并提供丰富的分析、统计、搜索等功能。 本项目为其提供一个简化版本的WebUI,方便用户上传PDF文件,并实时展示提取结果。☆115Updated last year
- moffee: Make Markdown Ready to Present☆1,287Updated 2 months ago
- ☆158Updated 4 months ago
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆2,904Updated last month
- UltraRAG 2.0: Less Code, Lower Barrier, Faster Deployment! MCP-based low-code RAG framework, enabling researchers to build complex pipeli…☆1,768Updated this week