RQLuo / MixTeX-DataHubLinks
LaTeXDataHub is an open-source platform dedicated to the sharing and contribution of real-world LaTeX image datasets and their annotations, allows users to upload, download, and contribute to a growing collection of high-quality LaTeX datasets.
☆11Updated last year
Alternatives and similar repositories for MixTeX-DataHub
Users that are interested in MixTeX-DataHub are comparing it to the libraries listed below
Sorting:
- ☆57Updated last year
- Convert LaTeX-OCR To ONNX☆11Updated last year
- Chinese tokens in tiktoken tokenizers.☆31Updated last year
- ☆45Updated last week
- 🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…☆54Updated this week
- Exploration of World Languages☆19Updated last year
- Codebase for fine-tuning / evaluating nougat-based image2latex generation models☆157Updated 11 months ago
- ☆14Updated last year
- BNU CERNET CLI 是一款专为北京师范大学校园网用户设计的命令行客户端。自2023年7月1日校园网服务升级后,原有的命令行客户端无法正常使用。为了解决这个问题,我们开发了这款新的客户端,使用户能够在命令行环境下便捷地登录校园网并访问互联网资源。☆13Updated 2 years ago
- Fused Qwen3 MoE layer for faster training, compatible with HF Transformers, LoRA, 4-bit quant, Unsloth☆172Updated this week
- A fast RWKV Tokenizer written in Rust☆53Updated last month
- Creating Your Divine Agent 😇☆10Updated last month
- Exploration of the multi modal fuyu-8b model of Adept. 🤓 🔍☆27Updated last year
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding☆21Updated 6 months ago
- A repo for the Formula Recognition Model (im2latex) based on Vision Encoder Decoder Model☆16Updated last year
- Formula recognition based on LaTeX-OCR and ONNXRuntime.☆365Updated 10 months ago
- ☆16Updated last year
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆35Updated last year
- [EMNLP 2024] RWKV-CLIP: A Robust Vision-Language Representation Learner☆141Updated 3 months ago
- ☆34Updated last year
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆38Updated last year
- XVERSE-7B: A multilingual large language model developed by XVERSE Technology Inc.☆53Updated last year
- This project is to extend RWKV LM's capabilities including sequence classification/embedding/peft/cross encoder/bi encoder/multi modaliti…☆10Updated last year
- VimTS: A Unified Video and Image Text Spotter☆79Updated 10 months ago
- SwanLab Local Visualization Python Package Plugin|SwanLab本地可视化python包插件☆20Updated 6 months ago
- ☆14Updated last year
- ☆22Updated 5 months ago
- [EMNLP 2025 Demo] PresentAgent: Multimodal Agent for Presentation Video Generation☆99Updated last week
- [ICCV2023] TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance☆105Updated last year
- ☆137Updated last month