BFlameSwift/Uni-MuMER

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/BFlameSwift/Uni-MuMER)

BFlameSwift / Uni-MuMER

[NeurIPS'25 Spotlight🔥]Official implementation of Uni-MuMER: Unified Multi-Task Fine-Tuning of Vision-Language Model for Handwritten Mathematical Expression Recognition

☆37

Alternatives and similar repositories for Uni-MuMER

Users that are interested in Uni-MuMER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Howrunz / SSAN
View on GitHub
Official implementation for AAAI 2025 paper: SSAN: A Symbol Spatial-Aware Network for Handwritten Mathematical Expression Recognition
☆16Jan 21, 2025Updated last year
AiArt-Gao / HMEG
View on GitHub
[CVPR'24] Handwritten Mathematical Expressions Generation (HMEG)
☆34Jun 3, 2024Updated 2 years ago
qingzhenduyu / TAMER
View on GitHub
Official implementation for AAAI 2025 paper: TAMER: Tree-Aware Transformer for Handwritten Mathematical Expression Recognition
☆37Jul 28, 2025Updated 11 months ago
iFLYTEK-CV / EDU-CHEMC
View on GitHub
A handwritten Chemical Structure Image data set named EDU-CHEMC, which consists of totally 52,987 handwritten molecular structure images …
☆17May 12, 2025Updated last year
qingzhenduyu / ICAL
View on GitHub
Official implementation for ICDAR 2024 Oral paper "ICAL: Implicit Character-Aided Learning for Enhanced Handwritten Mathematical Expressi…
☆29Aug 16, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
liuzhuang1024 / SAM
View on GitHub
Semantic Graph Representation Learning for Handwritten Mathematical Expression Recognition (ICDAR 2023)
☆14Aug 29, 2023Updated 2 years ago
thanhnghiadk / syntactic_HME_generation
View on GitHub
This project aims to generate syntactichandwritten mathematical expression. The dataset is generated from the CROHME 2014 training set.
☆14Feb 24, 2022Updated 4 years ago
yhzhu99 / FengruCupTemplate
View on GitHub
北航“冯如杯”论文模板 (2022年)
☆12Apr 24, 2022Updated 4 years ago
SJTU-DeepVisionLab / PosFormer
View on GitHub
[ECCV2024] PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer
☆85Apr 10, 2025Updated last year
EDM-Research / VATr-pp
View on GitHub
☆18Jul 9, 2024Updated 2 years ago
Intellindust-AI-Lab / HTR-VT
View on GitHub
(Pattern Recognition) Pytorch implementation of “HTR-VT: Handwritten Text Recognition with Vision Transformer”
☆134Jan 22, 2026Updated 6 months ago
tal-tech / SAN
View on GitHub
Syntax-Aware Network for Handwritten Mathematical Expression Recognition
☆103Feb 21, 2023Updated 3 years ago
zzyhlyoko / DCTC
View on GitHub
☆42Sep 2, 2023Updated 2 years ago
HCIILAB / LAST
View on GitHub
Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition
☆28Aug 29, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
VCG-team / elbo-t2ialign
View on GitHub
(TIP 2026, CCF-A) ELBO-T2IAlign: A Generic ELBO-Based Method for Calibrating Pixel-level Text-Image Alignment in Diffusion Models
☆18Jul 6, 2026Updated 2 weeks ago
shannanyinxiang / ViTEraser
View on GitHub
Official implementation of ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining (AAAI 20…
☆66Jul 4, 2024Updated 2 years ago
awei669 / DiffInk
View on GitHub
[ICLR 2026] DiffInk: Glyph- and Style-Aware Latent Diffusion Transformer for Text to Online Handwriting Generation
☆39Jun 19, 2026Updated last month
jack139 / ocr-rare-chars
View on GitHub
生僻字OCR识别优化训练
☆16Feb 16, 2023Updated 3 years ago
ihdia / seamformer
View on GitHub
Official repository accompaying the ICDAR 2023 paper
☆14Oct 3, 2023Updated 2 years ago
opendatalab / UniMERNet
View on GitHub
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
☆492Sep 28, 2025Updated 9 months ago
Lllllolita / CAN_Paddle
View on GitHub
☆16Mar 5, 2023Updated 3 years ago
zjykzj / MPDataset
View on GitHub
Custom Iterable Dataset Class for Large-Scale Data Loading
☆14Dec 8, 2021Updated 4 years ago
dasayan05 / chirodiff
View on GitHub
ChiroDiff: Modelling chirographic data with Diffusion Models
☆20Apr 11, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
JINO-ROHIT / nano-paged-attention
View on GitHub
a minimal paged attention implementation
☆20Jan 30, 2026Updated 5 months ago
Planet-AI-GmbH / tfaip-hybrid-ctc-s2s
View on GitHub
Repository sharing code and the model for the paper "Rescoring Sequence-to-Sequence Models for Text Line Recognition with CTC-Prefixes"
☆17Oct 13, 2021Updated 4 years ago
BUAA-SE-2021 / software-engineering
View on GitHub
☆11Apr 23, 2021Updated 5 years ago
Levi-ZJY / SAN
View on GitHub
SAN: Structure-Aware Network for Complex and Long-tailed Chinese Text Recognition
☆10Apr 8, 2024Updated 2 years ago
LARS-research / TREFE
View on GitHub
Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022
☆13Nov 25, 2022Updated 3 years ago
shannanyinxiang / UPOCR
View on GitHub
Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)
☆69Jun 6, 2024Updated 2 years ago
CXH-Research / StainRestorer
View on GitHub
[WACV 2025] High-Fidelity Document Stain Removal via A Large-Scale Real-World Dataset and A Memory-Augmented Transformer
☆23Jan 14, 2026Updated 6 months ago
xxyQwQ / metascript
View on GitHub
Project of AI3604 Computer Vision, 2023 Fall, SJTU
☆27Aug 26, 2025Updated 11 months ago
shi-yx / URaG
View on GitHub
Official implementation of URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding (AAAI 2026…
☆43Feb 4, 2026Updated 5 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
banban3forever / Demo-JEPA
View on GitHub
☆17May 21, 2026Updated 2 months ago
XH-B / ABM
View on GitHub
☆105Aug 22, 2024Updated last year
koninik / WordStylist
View on GitHub
Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023
☆83Jun 25, 2024Updated 2 years ago
JackHck / FVP
View on GitHub
[ICCV 2025] FVP: 4D Visual Pre-training for Robot Learning
☆17Sep 5, 2025Updated 10 months ago
aimagelab / Emuru-autoregressive-text-img
View on GitHub
Official PyTorch implementation for "Zero-Shot Styled Text Image Generation, but Make It Autoregressive" (CVPR25)
☆29Jul 31, 2025Updated 11 months ago
yqingli123 / TDv2
View on GitHub
The source codes of TDv2 in paper: TDv2: A Novel Tree-Structured Decoder for Offline Mathematical Expression Recognition.
☆12Jul 28, 2022Updated 3 years ago
abcpp12383 / ThreeStageBinarization
View on GitHub
Three-stage binarization of color document images based on discrete wavelet transform and generative adversarial networks
☆12Aug 12, 2025Updated 11 months ago