zlqm / docx-equation
convert equation inside word(.docx) to latex
☆18Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for docx-equation
- Convert omml to latex for displaying in web browsers (KaTeX)☆21Updated 4 years ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆104Updated 5 months ago
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆206Updated last month
- Codebase for fine-tuning / evaluating nougat-based image2latex generation models☆124Updated last month
- An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"☆74Updated last year
- Large scale training of Latex formula recognition model, currently being organized and open source☆44Updated 7 months ago
- TeX compilation service that makes use of arXiv.org's AutoTeX library.☆27Updated 5 months ago
- Code for ICPR2022 paper: "Graph Neural Networks and Representation Embedding for table extraction in PDF Documents"☆35Updated last year
- Document Image Binarization☆73Updated last month
- 源自PP-Structure的表格识别算法,模型转换为ONNX,推理引擎采用ONNXRuntime,部署简单,无内存泄露问题。☆70Updated last week
- 文档方向分类☆202Updated 2 weeks ago
- ☆67Updated this week
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆150Updated 2 weeks ago
- Tools for extract figure, table, text, .. from a pdf document.☆32Updated 3 years ago
- 阅读顺序、Layoutreader☆10Updated 5 months ago
- ☆16Updated last year
- Datasets and Evaluation Scripts for CompHRDoc☆25Updated 7 months ago
- A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating…☆138Updated 2 months ago
- Chinese Mathematical Formula Detection (MFD) Dataset 中文文档数学公式检测数据集☆29Updated last year
- YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis☆69Updated last month
- transformer based OCR framework used to train OCR or image to latex☆9Updated 2 years ago
- 【间隙·树·排序算法】 对OCR结果或PDF提取的文本进行版面分析,按人类阅读顺序进行排序。☆107Updated 8 months ago
- Extracting LaTeX equations from PDF☆19Updated last year
- A curated list of resources dedicated to table recognition☆374Updated 9 months ago
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis☆274Updated last year
- Pure Python library for LaTeX to MathML conversion☆187Updated this week
- Dataset and scripts for HRDoc☆33Updated last year
- The official code for “Geometric Representation Learning for Document Image Rectification”, ECCV, 2022.☆76Updated 4 months ago
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆165Updated this week
- Integrate AI-powered Document Analysis Pipelines☆62Updated this week