lxulxu / pdf-to-markdown
通过paddle ocr实现pdf转markdown
☆56Updated last month
Related projects ⓘ
Alternatives and complementary repositories for pdf-to-markdown
- LLama3中文个人版本☆40Updated 6 months ago
- Using GPT to parse PDF☆68Updated 2 months ago
- A python wrapper for the Doc2X API and comes with native texts processing (to improve PDF recall in RAG). | Doc2X API的python封装,同时附带本地的文本处…☆197Updated this week
- ☆100Updated 3 months ago
- 📝 针对文档类图像做内容提取,将文档类图像一比一输出到Word或者Txt中,便于进一步使用或处理。后续计划支持输入PDF/图像,输出对应json格式、Txt格式、Word格式和Markdown格式。☆150Updated 2 weeks ago
- Based on RapidOCR, extract the PDF content.☆133Updated 2 months ago
- ☆68Updated 10 months ago
- bisheng-unstructured library☆36Updated this week
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆58Updated last week
- gpt_server是一个用于生产级部署LLMs或Embedding的开源框架。☆120Updated this week
- Analysis of Chinese and English layouts 中英文版面分析☆126Updated last month
- AGI模块库架构图☆75Updated last year
- doc2x docs☆30Updated this week
- dify's rag patch module☆45Updated this week
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆241Updated 2 months ago
- A collection of RAG systems powered by LLM.☆137Updated last week
- 源自PP-Structure的表格识别算法,模型转换为ONNX,推理引擎采用ONNXRuntime,部署简单,无内存泄露问题。☆70Updated last week
- ☆60Updated 2 months ago
- Convert different model APIs into the OpenAI API format out of the box.☆145Updated 9 months ago
- aigc_serving lightweight and efficient Language service model reasoning☆23Updated 5 months ago
- ChatGPT WebUI using gradio. 给 LLM 对话和检索知识问答RAG提供一个简单好用的Web UI界面☆93Updated 2 months ago
- 与 https://github.com/tonori/mem0ai-api 配合使用的非官方的 mem0ai provider.☆38Updated 3 months ago
- A Python Package to Access World-Class Generative Models☆125Updated 5 months ago
- Implement OpenAI APIs and plugin-enabled ChatGPT with open source LLM and other models.☆122Updated 5 months ago
- Conversational Retrieval Evaluation Dataset☆91Updated last month
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式☆84Updated last week
- GPT+神器,简单实用的一站式AGI架构,内置本地化,LLM模型,agent,矢量数据库,智能链chain☆48Updated last year
- 部署你自己的OpenAI api🤩, 基于flask, transformers (使用 Baichuan2-13B-Chat-4bits 模型, 可以运行在单张Tesla T4显卡) ,实现了OpenAI中Chat, Models和Completions接口,包含流式响…☆84Updated last year
- ☆39Updated last month