wisupai / e2mLinks
E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with dedicated parsers and converters, supporting custom configs. E2M offers an all-in-one, flexible, and open-source solution.
☆1,234Updated last year
Alternatives and similar repositories for e2m
Users that are interested in e2m are comparing it to the libraries listed below
Sorting:
- OCRFlux is a lightweight yet powerful multimodal toolkit that significantly advances PDF-to-Markdown conversion, excelling in complex lay…☆2,304Updated 2 months ago
- A high-quality PDF to Markdown tool based on large language model visual recognition. 一款基于大模型视觉识别的高质量PDF转Markdown工具☆1,592Updated 2 weeks ago
- Portable KMS (knowledge management system) designed to integrate seamlessly with any Retrieval-Augmented Generation (RAG) system☆1,365Updated 2 months ago
- Easily deployable 🚀 API to convert PDF to markdown quickly with high accuracy.☆906Updated 11 months ago
- A python wrapper for the Doc2X API and comes with native texts processing (to improve PDF recall in RAG). | Doc2X API的python封装,同时附带本地的文本处…☆281Updated 3 months ago
- OpenAI DeepResearch alternative, An AI-driven research system that performs comprehensive, iterative research on any topic using multiple…☆633Updated 4 months ago
- This React component is used to render Markdown into a beautiful poster image, with support for copying as an image. Md to Poster/Image/Q…☆1,773Updated 7 months ago
- AI reads books: Page-by-Page PDF Knowledge Extractor & Summarizer. script performs an intelligent page-by-page analysis of PDF books, met…☆1,549Updated 8 months ago
- MemFree - Hybrid AI Search Engine & AI Page Generator☆1,444Updated 2 months ago
- E2M API, converting everything to markdown (LLM-friendly Format).☆137Updated 9 months ago
- ☆493Updated 6 months ago
- Detect and extract tables to markdown and csv☆752Updated 8 months ago
- Parse PDFs into markdown using Vision LLMs☆429Updated 3 weeks ago
- Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"☆637Updated 7 months ago
- ☆488Updated last week
- (Supports DeepSeek R1) An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, …☆2,070Updated last month
- ☆543Updated last year
- [ACL 2025 Demo] Repository for the demo and paper: ReasonGraph: Visualisation of Reasoning Paths☆499Updated 4 months ago
- AI Powered Knowledge Graph Generator☆1,302Updated last week
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆2,893Updated 3 weeks ago
- moffee: Make Markdown Ready to Present☆1,272Updated 2 months ago
- Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, advanced RAG, advanced summaries, scriptable, …☆483Updated this week
- ContextGem: Effortless LLM extraction from documents☆1,516Updated last week
- Speech to Text but with all the bells and whistles and most importantly AI! AI will clean up your filler words, edit and will refine what…☆320Updated 8 months ago
- PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation☆2,051Updated last month
- RAG Web UI is an intelligent dialogue system based on RAG (Retrieval-Augmented Generation) technology.☆2,625Updated last month
- PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides [EMNLP 2025]☆2,053Updated last week
- A MCP (Model Context Protocol) server for PowerPoint manipulation using python-pptx. This server provides tools for creating, editing, an…☆1,068Updated 2 months ago
- Profile-Based Long-Term Memory for AI Applications. Memobase handles user profiles, memory events, and evolving context — perfect for ch…☆2,203Updated 2 weeks ago
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆1,756Updated last month