opendatalab/UniMERNet

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/opendatalab/UniMERNet)

opendatalab / UniMERNet

UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition

☆492

Alternatives and similar repositories for UniMERNet

Users that are interested in UniMERNet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

InternScience / StructEqTable-Deploy
View on GitHub
A High-efficiency Open-source Toolkit for Table-to-Latex Task
☆276Dec 6, 2025Updated 7 months ago
opendatalab / Miner-PDF-Benchmark
View on GitHub
MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.
☆24Dec 11, 2024Updated last year
opendatalab / DocLayout-YOLO
View on GitHub
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
☆2,233Apr 14, 2025Updated last year
opendatalab / PDF-Extract-Kit
View on GitHub
A Comprehensive Toolkit for High-Quality PDF Content Extraction
☆9,797Jan 3, 2025Updated last year
opendatalab / CLIP-Parrot-Bias
View on GitHub
ECCV2024_Parrot Captions Teach CLIP to Spot Text
☆66Sep 6, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
FreeOCR-AI / layoutreader
View on GitHub
A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.
☆322Aug 15, 2025Updated 11 months ago
OleehyO / TexTeller
View on GitHub
TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability,…
☆752Aug 22, 2025Updated 11 months ago
opendatalab / OmniDocBench
View on GitHub
[CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation
☆1,900Jun 26, 2026Updated 3 weeks ago
opendatalab / OHR-Bench
View on GitHub
(ICCV 2025) OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation
☆104Dec 3, 2025Updated 7 months ago
RapidAI / RapidLaTeXOCR
View on GitHub
Formula recognition based on LaTeX-OCR and ONNXRuntime.
☆388Nov 3, 2024Updated last year
Alpha-Innovator / DocParser
View on GitHub
☆18Jan 13, 2025Updated last year
mxin262 / ESTextSpotter
View on GitHub
(ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
☆78Apr 9, 2024Updated 2 years ago
webvokess / Google-Stock-Price-Prediction-using-LSTM
View on GitHub
Google Stock Price Prediction using Long Short-Term Memory (LSTM) is a deep learning-based approach to forecasting stock prices using his…
☆18Sep 6, 2025Updated 10 months ago
opendatalab / mineru-vl-utils
View on GitHub
A Python package for interacting with the MinerU Vision-Language Model.
☆136Jun 11, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
breezedeus / CnMFD_Dataset
View on GitHub
Chinese Mathematical Formula Detection (MFD) Dataset 中文文档数学公式检测数据集
☆35Dec 21, 2022Updated 3 years ago
felix-schmitt / MathNet
View on GitHub
MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition
☆10Mar 19, 2025Updated last year
VikParuchuri / texify
View on GitHub
Math OCR model that outputs LaTeX and markdown
☆1,126Jan 29, 2025Updated last year
HCIILAB / M6Doc
View on GitHub
☆163May 8, 2025Updated last year
Alpha-Innovator / DocGenome
View on GitHub
DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models
☆156Jan 13, 2025Updated last year
opendatalab / dsdl-docs
View on GitHub
Data Set Description Language Specification （新一代人工智能数据集描述语言DSDL）
☆46May 29, 2024Updated 2 years ago
breezedeus / Pix2Text
View on GitHub
An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them…
☆3,196Feb 7, 2026Updated 5 months ago
RapidAI / RapidTable
View on GitHub
基于序列表格识别算法推理库，集成PP-Structure和modelscope等表格识别算法。
☆433Apr 23, 2026Updated 2 months ago
buptlihang / CDLA
View on GitHub
CDLA: A Chinese document layout analysis (CDLA) dataset
☆293Sep 13, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
TenMilesLotus / DTSM
View on GitHub
Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator
☆13Apr 28, 2024Updated 2 years ago
opendatalab / VIGC
View on GitHub
AAAI 2024: Visual Instruction Generation and Correction
☆97Feb 4, 2024Updated 2 years ago
opendatalab / labelbee
View on GitHub
☆25Nov 7, 2022Updated 3 years ago
InternScience / SimChart9K
View on GitHub
The proposed simulated dataset consisting of 9,536 charts and associated data annotations in CSV format.
☆26Feb 22, 2024Updated 2 years ago
microsoft / ArxivFormula
View on GitHub
This repo is used to release the ArxivFormula dataset.
☆35Nov 12, 2024Updated last year
ZichenWen1 / DIJA
View on GitHub
(ICLR 2026 🔥) Code for "The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs"
☆79Feb 9, 2026Updated 5 months ago
AlibabaResearch / AdvancedLiterateMachinery
View on GitHub
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…
☆1,833Mar 17, 2026Updated 4 months ago
opendatalab / opendatalab-python-sdk
View on GitHub
SDK of OpenDataLab - https://opendatalab.org.cn
☆60Jul 31, 2025Updated 11 months ago
SWHL / TrOCR-Formula-Rec
View on GitHub
基于TrOCR + UniMER-1M数据集，训练一个小而美的公式识别模型
☆30Mar 17, 2026Updated 4 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
MaxKinny / TabRecSet
View on GitHub
A large scale camera-taken table detection and recognition dataset.
☆150Apr 9, 2026Updated 3 months ago
Mountchicken / Union14M
View on GitHub
[ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective
☆206Nov 1, 2023Updated 2 years ago
Ucas-HaoranWei / GOT-OCR2.0
View on GitHub
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
☆8,155Feb 10, 2025Updated last year
opendatalab / magic-doc
View on GitHub
☆549Jul 26, 2024Updated last year
opendatalab / CHARM
View on GitHub
[ACL 2024 Main Conference] Chinese commonsense benchmark for LLMs
☆46Jul 27, 2024Updated last year
LayTextLLM / LayTextLLM
View on GitHub
☆103Dec 23, 2024Updated last year
LinXueyuanStdio / Data-for-LaTeX_OCR
View on GitHub
LaTeX OCR 的数据仓库
☆142Jun 11, 2024Updated 2 years ago