hrushikeshrv / docxlatexLinks
A python library for extracting equations, text, and images from .docx files
☆14Updated last month
Alternatives and similar repositories for docxlatex
Users that are interested in docxlatex are comparing it to the libraries listed below
Sorting:
- convert equation inside word(.docx) to latex☆23Updated 2 years ago
- 中文论文、证券类、财报类PDF数据☆32Updated last year
- 厦门理工模式识别团队通用python代码工具库☆14Updated last month
- 检测和提取各种场景图片中的表格区域,并纠正透视和旋转问题 Detect and extract table regions from images in various scenarios, and correct perspective and rotation i…☆98Updated 6 months ago
- A handbook for mathematicians who want to get productive using GitHub☆10Updated 4 months ago
- Analysis of Chinese and English layouts 中英文版面分析☆218Updated last week
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆359Updated last week
- 文档方向分类☆219Updated 7 months ago
- 文档图像处理工具(Document image processing tool),包括漂白 / 文字方向矫正 / 清晰增强 / 笔记去噪美化 / 去阴影 / 扭曲矫正 / 切边增强(DocBleach / TextOrientationCorrection / DocSha…☆58Updated 10 months ago
- Another LaTex equation OCR tool based on ConvNeXt and Transformer☆50Updated last year
- This repository serves as a comprehensive reference for both beginners and advanced users of Git. It provides an organized and easy-to-fo…☆11Updated 7 months ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆252Updated 2 weeks ago
- End-to-end model training and deployment reference for handwritten Chinese text recognition, and can also be extended to other languages.☆166Updated 2 years ago
- 基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。☆314Updated this week
- Lecture notes, scripts, and material for the lecture of Selected Statistics Topics in the Autonomous University of Querétaro☆12Updated 7 months ago
- A text editor programmed with Python and PyQt6 with integration to Microsoft Word and Upload-System to Github.☆9Updated 2 months ago
- Formula recognition based on LaTeX-OCR and ONNXRuntime.☆354Updated 7 months ago
- 阅读顺序、Layoutreader☆16Updated last month
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆225Updated 2 months ago
- llms related stuff , including code, docs☆13Updated 4 months ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆290Updated 9 months ago
- 一个多语言支持、易使用的 OCR 项目。An easy-to-use OCR project with multilingual support.☆121Updated 3 years ago
- Add Any Formula to DOCX (LaTex/MathML/OMML)☆11Updated 4 years ago
- A simple OCR preprocessing tool using Python with a GUI.☆31Updated 2 years ago
- ☆58Updated 3 years ago
- 解析word 中的数学公式mathtype对象为latex字符串☆34Updated 5 years ago
- Handwritten mathematical symbols recognition with TrOCR☆18Updated last year
- PDF解析(文字,章节,表格,图片,参考),基于大模型(ChatGLM2-6B, RWKV)+langchain+streamlit的PDF问答,摘要,信息抽取☆202Updated last year
- ocr,pdf转docx,pdf to docx☆21Updated 2 years ago
- LaTeX OCR 的数据仓库☆124Updated last year