DocTron-hub / DocTron-FormulaLinks
☆59Updated last month
Alternatives and similar repositories for DocTron-Formula
Users that are interested in DocTron-Formula are comparing it to the libraries listed below
Sorting:
- ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory☆184Updated 2 months ago
- An awesome gpu tasks scheduler. 轻量好用的GPU机群任务调度工具。觉得有用可以点个star☆190Updated 3 years ago
- ☆135Updated 7 months ago
- Unveiling Super Experts in Mixture-of-Experts Large Language Models☆27Updated this week
- Deepseek-r1复现科普与资源汇总☆22Updated 6 months ago
- Parameter-Efficient Fine-Tuning for Foundation Models☆93Updated 5 months ago
- PaperHelper: Knowledge-Based LLM QA Paper Reading Assistant with Reliable References☆18Updated last year
- WanJuan-CC是以CommonCrawl为基础,经过数据抽取,规则清洗,去重,安全过滤,质量清洗等步骤得到的高质量数据。☆14Updated last year
- Efficient Mixture of Experts for LLM Paper List☆129Updated this week
- A minimal, easy-to-read PyTorch reimplementation of the Qwen3 and Qwen2.5 VL with a fancy CLI☆146Updated 3 weeks ago
- [CVPR 2025] Docopilot: Improving Multimodal Models for Document-Level Understanding☆34Updated 2 months ago
- The official repository of the dots.vlm1 instruct models proposed by rednote-hilab.☆255Updated this week
- The official repository for the Scientific Paper Idea Proposer (SciPIP)☆64Updated 7 months ago
- This project aims to collect and collate various datasets for multimodal large model training, including but not limited to pre-training …☆57Updated 4 months ago
- mllm-npu: training multimodal large language models on Ascend NPUs☆92Updated last year
- [ICCV2025] A Token-level Text Image Foundation Model for Document Understanding☆120Updated last month
- Token level visualization tools for large language models☆88Updated 8 months ago
- Build a daily academic subscription pipeline! Get daily Arxiv papers and corresponding chatGPT summaries with pre-defined keywords. It is…☆42Updated 2 years ago
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆72Updated last year
- qwen-nsa☆74Updated 5 months ago
- siiRL: Shanghai Innovation Institute RL Framework for Advanced LLMs and Multi-Agent Systems☆214Updated this week
- 青稞Talk☆146Updated this week
- differentiable top-k operator☆22Updated 8 months ago
- 本项目提供了基于910B的huggingface LLM模型的Tensor Parallel(TP)部署教程,同时也可以作为一份极简的TP学习代码。☆29Updated last year
- MobileVLM: A Vision-Language Model for Better Intra- and Inter-UI Understanding☆72Updated 7 months ago
- Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool wo…☆29Updated last year
- Arxiv个性化定制化模版,实现对特定领域的相关内容、作者与学术会议的有效跟进。☆321Updated this week
- The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning☆312Updated 3 months ago
- tinybig for deep function learning☆61Updated 3 months ago
- Max的有趣数据集 / Max's awesome datasets☆43Updated 3 weeks ago