wzlxjtu / PDF2LaTeX-datasetLinks
☆21Updated 5 years ago
Alternatives and similar repositories for PDF2LaTeX-dataset
Users that are interested in PDF2LaTeX-dataset are comparing it to the libraries listed below
Sorting:
- Scanning Single Shot Detector for Math in Document Images☆133Updated 2 years ago
- Codebase for fine-tuning / evaluating nougat-based image2latex generation models☆159Updated last year
- Another LaTex equation OCR tool based on ConvNeXt and Transformer☆51Updated 2 years ago
- Solution to im2latex request for research of openai☆88Updated last year
- Pytorch implemention of Deep CNN Encoder + LSTM Decoder with Attention for Image to Latex☆202Updated 2 years ago
- TDF-ICDAR 2019 Dataset for Typeset Math Formula Detection☆69Updated 5 years ago
- Apache PDFBox extension for precisely extracting character/symbol locations and identities from born-digital PDF files.☆19Updated 2 months ago
- LaTeX OCR 的数据仓库☆135Updated last year
- A GPT-based generative LM for combined text and math formulas, leveraging tree-based formula encoding. Published as "Tree-Based Represent…☆41Updated 2 years ago
- Code for ICPR2022 paper: "Graph Neural Networks and Representation Embedding for table extraction in PDF Documents"☆37Updated 2 years ago
- Official implementation for ICDAR 2021 best poster paper "Handwritten Mathematical Expression Recognition with Bidirectionally Trained Tr…☆128Updated last year
- Data and Code for ACL 2021 Paper "Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning"☆169Updated 8 months ago
- Python tools for creating suitable dataset for OpenAI's im2latex task: https://openai.com/requests-for-research/#im2latex☆143Updated 7 years ago
- Python and JS tools to generate Printed LaTex formulas and images☆16Updated 2 years ago
- Math-aware QA system☆18Updated 2 years ago
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆115Updated last year
- A neural network capable of translating handwriting into text along with complex tools to generate datasets☆19Updated 5 years ago
- ☆76Updated 2 months ago
- ☆19Updated last year
- Question Answering dataset generator of Document Visual in English and Chinese☆24Updated 2 years ago
- Large scale training of Latex formula recognition model, currently being organized and open source☆56Updated last year
- Converts from AsciiMath, LaTeX, MathML to LaTeX, MathML☆59Updated 6 years ago
- Math formula recognition (Images to LaTeX strings)☆307Updated 2 years ago
- K12高中数学试题数据集☆15Updated 2 years ago
- MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition☆10Updated 8 months ago
- A command line interface to download PDF files from https://arxiv.org.☆62Updated 3 months ago
- ICDAR 2021 Competition on Scientific Literature Parsing☆35Updated 5 years ago
- Python tool for generation of LaTex code from PDF files.☆100Updated 2 years ago
- 1st Solution For ICDAR 2021 Competition on Mathematical Formula Detection(公式检测冠军方案)☆132Updated 2 years ago
- arXiv Search UI & APIs☆129Updated last week