wisupai / e2mLinks
E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with dedicated parsers and converters, supporting custom configs. E2M offers an all-in-one, flexible, and open-source solution.
☆1,246Updated last year
Alternatives and similar repositories for e2m
Users that are interested in e2m are comparing it to the libraries listed below
Sorting:
- Easily deployable 🚀 API to convert PDF to markdown quickly with high accuracy.☆931Updated last year
- OpenSource Production ready Customer service with built in Evals and monitoring☆1,388Updated this week
- A high-quality PDF to Markdown tool based on large language model visual recognition. 一款基于大模型视觉识别的高质量PDF转Markdown工具☆1,628Updated 3 months ago
- OCRFlux is a lightweight yet powerful multimodal toolkit that significantly advances PDF-to-Markdown conversion, excelling in complex lay…☆2,413Updated 5 months ago
- MemFree - Hybrid AI Search Engine & AI Page Generator☆1,473Updated 5 months ago
- E2M API, converting everything to markdown (LLM-friendly Format).☆138Updated last year
- A python wrapper for the Doc2X API and comes with native texts processing (to improve PDF recall in RAG). | Doc2X API的python封装,同时附带本地的文本处…☆283Updated 6 months ago
- Detect and extract tables to markdown and csv☆756Updated 11 months ago
- AI reads books: Page-by-Page PDF Knowledge Extractor & Summarizer. script performs an intelligent page-by-page analysis of PDF books, met…☆1,567Updated 11 months ago
- Parse PDFs into markdown using Vision LLMs☆455Updated 3 months ago
- AI Powered Knowledge Graph Generator☆1,420Updated last week
- This React component is used to render Markdown into a beautiful poster image, with support for copying as an image. Md to Poster/Image/Q…☆1,795Updated 10 months ago
- RAG Web UI is an intelligent dialogue system based on RAG (Retrieval-Augmented Generation) technology.☆2,725Updated last month
- A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The servic…☆1,059Updated this week
- ☆515Updated 9 months ago
- ☆817Updated 2 months ago
- ☆547Updated last year
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆2,963Updated last month
- moffee: Make Markdown Ready to Present☆1,305Updated 5 months ago
- Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"☆650Updated 10 months ago
- [ACL 2025 Demo] Repository for the demo and paper: ReasonGraph: Visualisation of Reasoning Paths☆506Updated 7 months ago
- Prompt optimization scratch☆881Updated 8 months ago
- An experimental UI for text-to-knowledge-graph generation☆780Updated last year
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆1,828Updated 4 months ago
- (Supports DeepSeek R1) An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, …☆2,134Updated 4 months ago
- [EMNLP 2025] OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking☆485Updated 4 months ago
- ContextGem: Effortless LLM extraction from documents☆1,750Updated 3 weeks ago
- Using GPT to parse PDF☆3,560Updated 8 months ago
- A passive recording project allows you to have complete control over your data. Automatically take screenshots of all your screens, index…☆1,337Updated 2 weeks ago
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,931Updated 3 months ago