wzlxjtu / PDF2LaTeX-datasetLinks
☆21Updated 5 years ago
Alternatives and similar repositories for PDF2LaTeX-dataset
Users that are interested in PDF2LaTeX-dataset are comparing it to the libraries listed below
Sorting:
- Train a neural network to produce latex source code which generates a given pdf file☆13Updated 8 years ago
- Scanning Single Shot Detector for Math in Document Images☆132Updated 2 years ago
- TDF-ICDAR 2019 Dataset for Typeset Math Formula Detection☆69Updated 5 years ago
- Another LaTex equation OCR tool based on ConvNeXt and Transformer☆50Updated last year
- Codebase for fine-tuning / evaluating nougat-based image2latex generation models☆157Updated 11 months ago
- Solution to im2latex request for research of openai☆90Updated last year
- A command line interface to download PDF files from https://arxiv.org.☆57Updated 2 weeks ago
- Apache PDFBox extension for precisely extracting character/symbol locations and identities from born-digital PDF files.☆19Updated 3 years ago
- Official implementation for ICDAR 2021 best poster paper "Handwritten Mathematical Expression Recognition with Bidirectionally Trained Tr…☆125Updated last year
- LaTeX OCR 的数据仓库☆129Updated last year
- Math formula recognition (Images to LaTeX strings)☆303Updated last year
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆109Updated last year
- ☆45Updated 3 years ago
- Pytorch implemention of Deep CNN Encoder + LSTM Decoder with Attention for Image to Latex☆197Updated last year
- Large scale training of Latex formula recognition model, currently being organized and open source☆53Updated last year
- Question Answering dataset generator of Document Visual in English and Chinese☆25Updated 2 years ago
- ☆75Updated 4 years ago
- Fully automated end-to-end framework to extract data from bar plots and other figures in scientific research papers using modules such as…☆119Updated 4 years ago
- K12高中数学试题数据集☆13Updated 2 years ago
- DocBankLoader is a dataset loader for DocBank, and can convert DocBank to the Object Detection models' format.☆25Updated 4 years ago
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆149Updated 4 months ago
- A working Docker image for the Maxtract program that converts pdf to LaTeX sources☆14Updated 5 years ago
- Code for ICPR2022 paper: "Graph Neural Networks and Representation Embedding for table extraction in PDF Documents"☆35Updated 2 years ago
- We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…☆80Updated 2 years ago
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆199Updated 6 months ago
- Deep learning, Convolutional neural networks, Image processing, Document processing, Table detection, Page object detection, Table classi…☆68Updated last year
- ICDAR 2021 Competition on Scientific Literature Parsing☆35Updated 5 years ago
- 1st Solution For ICDAR 2021 Competition on Mathematical Formula Detection(公式检测冠军方案)☆132Updated 2 years ago
- Table Structure Recognition☆76Updated 2 years ago
- https://dl.acm.org/doi/10.1145/3657281☆98Updated last year