This repo is used to release the ArxivFormula dataset.
☆35Nov 12, 2024Updated last year
Alternatives and similar repositories for ArxivFormula
Users that are interested in ArxivFormula are comparing it to the libraries listed below
Sorting:
- Datasets and Evaluation Scripts for CompHRDoc☆57Feb 25, 2025Updated last year
- FormulaNet is a new large-scale Mathematical Formula Detection dataset.☆20Nov 21, 2022Updated 3 years ago
- Official repository accompaying the ICDAR 2023 paper☆13Oct 3, 2023Updated 2 years ago
- A curated collection of projects, benchmarks, and research papers focused on reproducing and advancing the DeepSeek R1 framework.☆15Mar 19, 2025Updated last year
- This project aims to generate syntactichandwritten mathematical expression. The dataset is generated from the CROHME 2014 training set.☆14Feb 24, 2022Updated 4 years ago
- ☆23Dec 12, 2024Updated last year
- ☆37Jan 26, 2026Updated last month
- Syntax-Aware Network for Handwritten Mathematical Expression Recognition☆100Feb 21, 2023Updated 3 years ago
- MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.☆24Dec 11, 2024Updated last year
- Official implementation for AAAI 2025 paper: SSAN: A Symbol Spatial-Aware Network for Handwritten Mathematical Expression Recognition☆16Jan 21, 2025Updated last year
- UniTable: Towards a Unified Table Foundation Model☆529Jun 4, 2024Updated last year
- Handwritten mathematical symbols recognition with TrOCR☆21Jul 11, 2023Updated 2 years ago
- Python and JS tools to generate Printed LaTex formulas and images☆16Oct 26, 2023Updated 2 years ago
- Continuous diffusion for layout generation☆54Feb 19, 2025Updated last year
- ☆12Oct 10, 2024Updated last year
- Formula recognition based on LaTeX-OCR and ONNXRuntime.☆382Nov 3, 2024Updated last year
- ☆104Aug 22, 2024Updated last year
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Nov 25, 2022Updated 3 years ago
- ☆26Feb 3, 2023Updated 3 years ago
- This Grasshopper plugin is a set of 4 components to allow for the usage of Google's "Quick, Draw!" dataset inside of Grasshopper/Rhino.☆12Sep 14, 2020Updated 5 years ago
- End to end system on recognition of Handwritten Math Symbols☆12Aug 27, 2016Updated 9 years ago
- Convert LaTeX-OCR To ONNX☆14Apr 2, 2024Updated last year
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆459Sep 28, 2025Updated 5 months ago
- [ICME 2023] FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation☆13May 13, 2023Updated 2 years ago
- Cross-lingual learning in scene text recognition (ICASSP2024)☆18Sep 29, 2024Updated last year
- Handwritten Math Expressions Recognition☆13Sep 8, 2017Updated 8 years ago
- Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pairs data (about 600k including English/Chinese)☆86Sep 21, 2024Updated last year
- An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR☆15Dec 4, 2021Updated 4 years ago
- Detecting 400+ landmarks on faces using Computer Vision. A part of a wider project.☆12May 7, 2023Updated 2 years ago
- ☆41Jun 15, 2024Updated last year
- Scanipy stands for "scan it with Python"—it's your smart Python library for scanning and parsing complex PDF files like books, reports, a…☆19Dec 30, 2023Updated 2 years ago
- ☆44Jul 9, 2024Updated last year
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 3 years ago
- This example app shows how to recognize mathematical expressions using the Selvy Pen SDK for Math on Android.☆10Sep 18, 2023Updated 2 years ago
- CDLA: A Chinese document layout analysis (CDLA) dataset☆289Sep 13, 2021Updated 4 years ago
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆83Jan 30, 2023Updated 3 years ago
- Semantic Graph Representation Learning for Handwritten Mathematical Expression Recognition (ICDAR 2023)☆15Aug 29, 2023Updated 2 years ago
- Create handwritten word embeddings from a text recognition Seq2Seq system.☆11Dec 1, 2022Updated 3 years ago
- ☆17Jul 9, 2024Updated last year