ArtifexSoftware / pdf2docx
Open source Python library for converting PDF to DOCX.
☆2,610Updated last month
Related projects ⓘ
Alternatives and complementary repositories for pdf2docx
- A Python library for reading and writing PDF, powered by QPDF☆2,186Updated this week
- PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.☆5,677Updated this week
- Demos, examples and utilities using PyMuPDF☆578Updated 4 months ago
- A Python library to extract tabular data from PDFs☆3,023Updated 3 months ago
- Create and modify Word documents with Python☆4,641Updated 3 months ago
- Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.☆6,749Updated last week
- Community maintained fork of pdfminer - we fathom PDF☆5,961Updated 3 months ago
- 文本盲水印:把信息隐匿到文本中,put invisible blind watermark into a text.☆1,412Updated this week
- OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched☆14,166Updated this week
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,642Updated 3 months ago
- CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包☆683Updated 4 months ago
- 📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO and PaddlePaddle.☆3,067Updated this week
- A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files☆8,388Updated this week
- 《PDF 解析》☆977Updated 3 months ago
- mupdf mirror☆1,564Updated this week
- Your most handy video processing software☆2,599Updated last year
- Blind&Invisible Watermark ,图片盲水印,提取水印无须原图!☆5,933Updated 4 months ago
- Tesseract documentation☆1,837Updated this week
- An Open-Source Python3 tool for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown for…☆1,947Updated this week
- Collaboration with wangxupeng(https://github.com/wangxupeng)☆1,822Updated 2 months ago
- 截屏 离线OCR 搜索翻译 以图搜图 贴图 录屏 万向滚动截屏 屏幕翻译 Screenshot Offline OCR Search Translate Search for picture Paste the picture on the scree…☆4,952Updated this week
- A Repo For Document AI☆2,591Updated this week
- Download Poppler binaries packaged for Windows with dependencies☆577Updated last month
- ☆585Updated 3 weeks ago
- Make awesome display tables using Python.☆1,891Updated this week
- Simple PDF text extraction☆870Updated last month
- A highly extensible Markdown editor. Version control, AI Copilot, mind map, documents encryption, code snippet running, integrated termin…☆5,645Updated this week
- 这是一个可以识别视频语音自动生成字幕SRT文件的开源 Windows-GUI 软件工具。☆4,732Updated last year
- A Python wrapper for Google Tesseract☆5,868Updated 3 weeks ago
- Create animated bar chart races in Python with matplotlib☆1,364Updated 4 months ago