lxulxu / pdf-to-markdownLinks

通过paddle ocr实现pdf转markdown

☆72

Alternatives and similar repositories for pdf-to-markdown

Users that are interested in pdf-to-markdown are comparing it to the libraries listed below

Sorting:

daodao97 / gptpdf-ui
Using GPT to parse PDF
☆100Updated 11 months ago
NoEdgeAI / pdfdeal
A python wrapper for the Doc2X API and comes with native texts processing (to improve PDF recall in RAG). | Doc2X API的python封装，同时附带本地的文本处…
☆278Updated last month
neka-nat / mineru-api
MinerU API server
☆65Updated 7 months ago
NoEdgeAI / doc2x-doc
doc2x docs
☆68Updated 8 months ago
soulteary / amazing-openai-api
Convert different model APIs into the OpenAI API format out of the box.
☆157Updated last year
KylinMountain / markify
Convert files into markdown to help RAG or LLM understand, based on markitdown and MinerU, which could provide high quality pdf parser.
☆121Updated 4 months ago
dataelement / bisheng-unstructured
bisheng-unstructured library
☆55Updated 2 months ago
RapidAI / RapidDoc
📝 针对文档类图像做内容提取，将文档类图像一比一输出到Word或者Txt中，便于进一步使用或处理。后续计划支持输入PDF/图像，输出对应json格式、Txt格式、Word格式和Markdown格式。
☆201Updated 9 months ago
ck-unifr / pdf_parsing
PDF解析（文字，章节，表格，图片，参考），基于大模型(ChatGLM2-6B, RWKV)+langchain+streamlit的PDF问答，摘要，信息抽取
☆205Updated last year
shibing624 / agentica
Agentica: Effortlessly Build Intelligent, Reflective, and Collaborative Multimodal AI Agents! 构建智能的多模态AI Agent。
☆197Updated 2 weeks ago
minghaochen / universal-prompt
optimize your prompt like promptperfect｜万能提示词｜大语言模型提示词优化
☆45Updated last year
Jing-yilin / E2M
E2M API, converting everything to markdown (LLM-friendly Format).
☆136Updated 7 months ago
RapidAI / RapidOCRPDF
Based on RapidOCR, extract the PDF content
☆179Updated 3 months ago
chrschy / fact-finder
☆112Updated last year
shibing624 / ChatPilot
ChatPilot: Chat Agent Web UI，实现Chat对话前端，支持Google搜索、文件网址对话（RAG）、代码解释器功能，复现了Kimi Chat(文件，拖进来；网址，发出来)。
☆575Updated last month
HuiMi24 / chatppt
ChatPPT is powered by chatgpt/ollama, it could help you to generate PPT/slide. It supports output in English and Chinese
☆289Updated 2 months ago
sugarforever / spark-api-gateway
☆68Updated last year
soulteary / ai-token-calculator
一个可以验证和计算文本消耗 Token 的小工具，支持在浏览器中使用，汉化自 OpenAI Tokenizer。
☆56Updated last year
CosmosShadow / GeneralAgent
A python native agent framework
☆453Updated 8 months ago
aidoczh / dspy-doc-zh
DSPy中文文档
☆33Updated last year
opendatalab / magic-html
☆482Updated 4 months ago
bravekingzhang / search-engine-tool
可能是免费中最好的搜索引擎API，支持Google，Bing，DuckDuckGo，Yahoo
☆129Updated 2 years ago
q2wxec / lang2openai
lang2openai
☆73Updated 9 months ago
RapidAI / RapidRAG
QA based on local knowledge and LLM.
☆231Updated 6 months ago
SUSTYuxiao / PdfTranslator
a useful PDF Translate tool base on LLM/ 一个基于大语言模型的PDF翻译程序
☆68Updated 11 months ago
happyapplehorse / agere
The tool is used for building and driving workflows specifically tailored for AI initiatives. It can be used to construct AI agents.
☆149Updated last year
RapidAI / RapidLayout
Analysis of Chinese and English layouts 中英文版面分析
☆235Updated this week
hustyichi / dify-eval
基于 Dify + Langfuse 的自动化评估服务
☆77Updated 2 months ago
opendatalab / magic-doc
☆538Updated last year
shell-nlp / gpt_server
gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR和TTS的开源框架。
☆204Updated 2 weeks ago