ccprocessor / llm-webkit-mirrorLinks
☆18Updated last week
Alternatives and similar repositories for llm-webkit-mirror
Users that are interested in llm-webkit-mirror are comparing it to the libraries listed below
Sorting:
- MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.☆23Updated 6 months ago
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆246Updated 6 months ago
- GOT的vLLM加速实现 并结合 MinerU 实现RAG中的pdf 解析☆58Updated 7 months ago
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆225Updated 2 months ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆290Updated 9 months ago
- ☆169Updated last year
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆151Updated last year
- [CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation☆526Updated last month
- ☆130Updated last month
- datasets resource☆117Updated 2 months ago
- Dingo: A Comprehensive AI Data Quality Evaluation Tool☆255Updated this week
- SDK of OpenDataLab - https://opendatalab.org.cn☆57Updated last year
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆276Updated last year
- [ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models☆348Updated last year
- Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train …☆204Updated 2 weeks ago
- 【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling☆120Updated 3 weeks ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆252Updated 2 weeks ago
- 万卷1.0多模态语料☆561Updated last year
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆354Updated last week
- AAAI 2024: Visual Instruction Generation and Correction☆93Updated last year
- ☆229Updated last year
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆341Updated 2 months ago
- ☆136Updated last year
- ☆66Updated 5 months ago
- [ICLR 2025 Spotlight] OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text☆368Updated last month
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆251Updated 3 weeks ago
- This is the official repository of the revised datasets FUNSD-r and CORD-r, introduced in EMNLP 2023 paper Reading Order Matters: Informa…☆16Updated last year
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆160Updated this week
- a-m-team's exploration in large language modeling☆161Updated 3 weeks ago
- ☆339Updated last year