ccprocessor / llm-webkit-mirrorLinks
☆22Updated last week
Alternatives and similar repositories for llm-webkit-mirror
Users that are interested in llm-webkit-mirror are comparing it to the libraries listed below
Sorting:
- A Python package for interacting with the MinerU Vision-Language Model.☆61Updated this week
- Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train …☆218Updated 4 months ago
- MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.☆23Updated 10 months ago
- ☆179Updated last year
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆247Updated 6 months ago
- datasets resource☆123Updated 3 months ago
- a-m-team's exploration in large language modeling☆189Updated 4 months ago
- Dingo: A Comprehensive AI Data Quality Evaluation Tool☆505Updated this week
- [ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models☆354Updated last year
- PDF解析工具:GOT的vLLM加速实现,MinerU做布局识别裁剪、GOT做表格公式解析,实现RAG中的pdf解析☆64Updated 11 months ago
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆384Updated 6 months ago
- 万卷1.0多模态语料☆567Updated 2 years ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆263Updated 3 months ago
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆171Updated last year
- ☆74Updated 9 months ago
- A live reading list for LLM data synthesis (Updated to July, 2025).☆398Updated 2 months ago
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆293Updated last year
- Collect every awesome work about r1!☆420Updated 5 months ago
- ☆234Updated last year
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆302Updated last year
- ☆312Updated last year
- Baichuan-Omni: Towards Capable Open-source Omni-modal LLM 🌊☆269Updated 9 months ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆278Updated 2 years ago
- Document Artifical Intelligence☆189Updated 3 weeks ago
- ☆54Updated last year
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆369Updated this week
- a toolkit on knowledge distillation for large language models☆181Updated last week
- Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception☆248Updated last month
- ☆49Updated last year
- Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforce…☆377Updated last week