breezedeus/CnMFD_Dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/breezedeus/CnMFD_Dataset)

breezedeus / CnMFD_Dataset

Chinese Mathematical Formula Detection (MFD) Dataset 中文文档数学公式检测数据集

☆35

Alternatives and similar repositories for CnMFD_Dataset

Users that are interested in CnMFD_Dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MaliParag / TFD-ICDAR2019
View on GitHub
TDF-ICDAR 2019 Dataset for Typeset Math Formula Detection
☆69Feb 9, 2020Updated 6 years ago
Yuxiang1995 / ICDAR2021_MFD
View on GitHub
1st Solution For ICDAR 2021 Competition on Mathematical Formula Detection（公式检测冠军方案）
☆134Sep 4, 2023Updated 2 years ago
omarWafaay / MathFormApp
View on GitHub
Application for Math formula detection in image/pdf and then recognition
☆13Jan 14, 2025Updated last year
felix-schmitt / MathNet
View on GitHub
MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition
☆10Mar 19, 2025Updated last year
breezedeus / CnSTD
View on GitHub
CnSTD: 基于 PyTorch/MXNet 的中文/英文场景文字检测（Scene Text Detection）、数学公式检测（Mathematical Formula Detection, MFD）、篇章分析（Layout Analysis）的Python3 包
☆792Jul 5, 2026Updated 2 weeks ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
OleehyO / TexTeller
View on GitHub
TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability,…
☆752Aug 22, 2025Updated 11 months ago
farhan-mohammed / OMath
View on GitHub
OMath is a Mouse Drawn Math to LaTeX converter meant for professors to be used as a teaching tool in their virtual online lectures. Imple…
☆10Jan 9, 2024Updated 2 years ago
opendatalab / UniMERNet
View on GitHub
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
☆492Sep 28, 2025Updated 9 months ago
buptlihang / CDLA
View on GitHub
CDLA: A Chinese document layout analysis (CDLA) dataset
☆293Sep 13, 2021Updated 4 years ago
opendatalab / Miner-PDF-Benchmark
View on GitHub
MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.
☆24Dec 11, 2024Updated last year
wzlxjtu / PDF2LaTeX-dataset
View on GitHub
☆22Jul 9, 2020Updated 6 years ago
duaibeom / chemOCR
View on GitHub
DB-based Optical Chemical Structure Recognition
☆13Sep 12, 2022Updated 3 years ago
Chingliu / xilou_core
View on GitHub
基于pdfium的pdf/ofd双引擎解析渲染引擎
☆13Oct 15, 2024Updated last year
lqtrung1998 / mwp_cot_design
View on GitHub
☆14Oct 11, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ibm-aur-nlp / PubTabNet
View on GitHub
☆483Jul 8, 2025Updated last year
LivingSkyTechnologies / Document_Layout_Segmentation
View on GitHub
Repository to use/train segmentation models for document layout analysis
☆19Jan 13, 2022Updated 4 years ago
ChenZhounan / PEN-Net
View on GitHub
This is a Repository corresponding to ACCV2022 accepted paper ”Complex Handwriting Trajectory Recovery: Evaluation Metrics and Algorithm“…
☆14Oct 3, 2022Updated 3 years ago
freedom10086 / PdfiumAndroid
View on GitHub
buid pdfium for android use cmake
☆11Jun 13, 2017Updated 9 years ago
Neon-Jing / Guider
View on GitHub
[WSDM 2025] Source code for "Teach Me How to Denoise: A Universal Framework for Denoising Multi-modal Recommender Systems via Guided Cali…
☆14Oct 14, 2025Updated 9 months ago
HaikuArchives / PDFWriter
View on GitHub
A printer driver that writes PDF files instead of sending data to a printer.
☆11May 21, 2026Updated 2 months ago
360AILABNLP / 360LayoutAnalysis
View on GitHub
☆28Oct 14, 2024Updated last year
BobLd / simple-docstrum
View on GitHub
A step-by-step C# implementation of the Docstrum algorithm
☆24Dec 13, 2020Updated 5 years ago
namtuanly / WikiTableSet
View on GitHub
WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia
☆32Jun 12, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
CrazySummerday / ctpn.pytorch
View on GitHub
Pytorch implementation of CTPN (Detecting Text in Natural Image with Connectionist Text Proposal Network)
☆47Sep 2, 2020Updated 5 years ago
gaborchris / DeepReplace
View on GitHub
This project focuses on using deep learning to replace text in images while retaining the same font and style.
☆10Dec 9, 2019Updated 6 years ago
jyanln / AlignReg
View on GitHub
☆17Apr 17, 2024Updated 2 years ago
ArminKmz / im2latex
View on GitHub
Pytorch implementation of math equation images to latex markup language.
☆30Oct 25, 2020Updated 5 years ago
wanggangkun / ST-Text-GCN
View on GitHub
Code for paper "Self-training Method Based on GCN for Semi-supervised Short Text Classification"
☆11Oct 30, 2021Updated 4 years ago
PaddleCV-SIG / PaddleLabel-Frontend
View on GitHub
飞桨智能标注 - 前端
☆20Mar 27, 2023Updated 3 years ago
jianlong-yuan / LovaszSoftmax_tf
View on GitHub
fix the speed of tensorflow
☆13Jun 15, 2020Updated 6 years ago
tobiasvanderwerff / full-page-handwriting-recognition
View on GitHub
Unofficial implementation of the paper "Full Page Handwriting Recognition via Image to Sequence Extraction" by Singh et al. (2021).
☆54Oct 11, 2022Updated 3 years ago
zhyhan / RDA
View on GitHub
Robust Domain Adaptation under Noisy Environments
☆18Jul 22, 2022Updated 4 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
SWHL / TrOCR-Formula-Rec
View on GitHub
基于TrOCR + UniMER-1M数据集，训练一个小而美的公式识别模型
☆30Mar 17, 2026Updated 4 months ago
adlnlp / doc_gcn
View on GitHub
☆19May 30, 2023Updated 3 years ago
xiaolongonly / PDFiumForAndroidDemo
View on GitHub
A Pretty PDFViewer
☆13Sep 7, 2018Updated 7 years ago
KnIfER / PDFium-Android-Demo
View on GitHub
Using LibPDFium for android .
☆16Nov 18, 2020Updated 5 years ago
doc-analysis / DocBank
View on GitHub
DocBank: A Benchmark Dataset for Document Layout Analysis
☆652Aug 12, 2024Updated last year
Jo-wang / Daily-Paper-Reading
View on GitHub
Daily paper reading records
☆15Mar 31, 2025Updated last year
AyanKumarBhunia / Handwriting-Trajectory-Recovery
View on GitHub
Handwriting Trajectory Recovery using End-to-End Deep Encoder-Decoder Network, ICPR 2018.
☆15Jul 17, 2019Updated 7 years ago