RQLuo / MixTeX-DataHub
LaTeXDataHub is an open-source platform dedicated to the sharing and contribution of real-world LaTeX image datasets and their annotations, allows users to upload, download, and contribute to a growing collection of high-quality LaTeX datasets.
☆11Updated 8 months ago
Alternatives and similar repositories for MixTeX-DataHub:
Users that are interested in MixTeX-DataHub are comparing it to the libraries listed below
- A GUI implement of MixTex with rust☆29Updated 2 months ago
- ☆56Updated last year
- ☆16Updated last month
- Large scale training of Latex formula recognition model, currently being organized and open source☆52Updated last year
- 🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…☆50Updated this week
- Formula recognition based on LaTeX-OCR and ONNXRuntime.☆343Updated 5 months ago
- RWKV-RAG个人版☆18Updated last month
- Codebase for fine-tuning / evaluating nougat-based image2latex generation models☆146Updated 7 months ago
- 基于TrOCR + UniMER-1M数据集,训练一个小而美的公式识别模型☆23Updated 5 months ago
- ☆26Updated last year
- Parse LaTeX math expressions☆21Updated 3 weeks ago
- 研究GOT-OCR-项目落地加速,不限语言☆60Updated 6 months ago
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆310Updated last month
- ☆14Updated 7 months ago
- 用于SenseVoice的api项目,输出带时间戳字幕☆34Updated 5 months ago
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆28Updated last year
- Exploration of World Languages☆20Updated last year
- [AAAI2025 Oral] Predicting the Original Appearance of Damaged Historical Documents☆74Updated last month
- Another LaTex formula OCR tool☆15Updated 2 years ago
- This repo is used to release the ArxivFormula dataset.☆26Updated 5 months ago
- 中文论文、证券类、财报类PDF数据☆27Updated 10 months ago
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Updated last year
- ☆29Updated 8 months ago
- Convert LaTeX-OCR To ONNX☆10Updated last year
- ☆28Updated 11 months ago
- A fast RWKV Tokenizer written in Rust☆44Updated 3 weeks ago
- ☆24Updated last year
- Python and JS tools to generate Printed LaTex formulas and images☆16Updated last year
- The official code for NeurIPS 2024 paper: Harmonizing Visual Text Comprehension and Generation☆119Updated 5 months ago
- ☆14Updated 3 months ago