RQLuo / MixTeX-DataHubLinks
LaTeXDataHub is an open-source platform dedicated to the sharing and contribution of real-world LaTeX image datasets and their annotations, allows users to upload, download, and contribute to a growing collection of high-quality LaTeX datasets.
☆12Updated last year
Alternatives and similar repositories for MixTeX-DataHub
Users that are interested in MixTeX-DataHub are comparing it to the libraries listed below
Sorting:
- Convert LaTeX-OCR To ONNX☆13Updated last year
- MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition☆10Updated 10 months ago
- Codebase for fine-tuning / evaluating nougat-based image2latex generation models☆159Updated last year
- Formula recognition based on LaTeX-OCR and ONNXRuntime.☆378Updated last year
- ☆57Updated 2 years ago
- Large scale training of Latex formula recognition model, currently being organized and open source☆56Updated last year
- A repo for the Formula Recognition Model (im2latex) based on Vision Encoder Decoder Model☆19Updated last year
- Exploration of World Languages☆19Updated last year
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding☆25Updated last month
- Chinese tokens in tiktoken tokenizers.☆32Updated last year
- Another LaTex formula OCR tool☆15Updated 2 years ago
- ☆194Updated 2 months ago
- Using Llam.cpp and onnxruntime to accelerate inference of GOT-OCR2.0☆15Updated 11 months ago
- doc2x docs☆74Updated last year
- Parse LaTeX math expressions☆38Updated 3 weeks ago
- 🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…☆56Updated this week
- Python and JS tools to generate Printed LaTex formulas and images☆16Updated 2 years ago
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆452Updated 4 months ago
- ☆19Updated last year
- TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability,…☆705Updated 5 months ago
- A fast RWKV Tokenizer written in Rust☆54Updated 5 months ago
- A specialized RWKV-7 model for Othello(a.k.a. Reversi) that predicts legal moves, evaluates positions, and performs in-context search. It…☆43Updated last year
- 生成训练文本检测数据集☆12Updated 5 years ago
- DocReal: Robust Document Dewarping of Real-Life Images via Attention-Enhanced Control Point Prediction☆27Updated 2 years ago
- ☆29Updated last year
- This repo is used to release the ArxivFormula dataset.☆35Updated last year
- 中文论文、证券类、财报类PDF数据☆36Updated last year
- 一行代码搞定 Python 图表中文展示☆70Updated last week
- VimTS: A Unified Video and Image Text Spotter☆78Updated last year
- 用于学习GOT/Qwen/OnnxLLm☆53Updated last year