winter1203 / vllm_GOT2_OCRLinks
Accelerating GOT-OCRv2 with VLLM
☆11Updated last year
Alternatives and similar repositories for vllm_GOT2_OCR
Users that are interested in vllm_GOT2_OCR are comparing it to the libraries listed below
Sorting:
- 从零构建了Agent中最重要的功能-function call☆17Updated last year
- Here is a demo for PDF parser (Including OCR, object detection tools)☆36Updated last year
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆48Updated last year
- ☆28Updated last year
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆38Updated last year
- ☆57Updated 2 years ago
- Python implementation of AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, w…☆49Updated 10 months ago
- ☆33Updated last month
- ☆29Updated 11 months ago
- PDF Parsing Tool: GOT's vLLM acceleration implementation, MinerU for layout recognition, and GOT for table formula parsing.☆65Updated last year
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Updated 11 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- ☆101Updated last year
- ☆55Updated last year
- 视觉信息抽取任务中,使用OCR识别结果规范多模态大模型的回答☆43Updated last year
- This repository provides an implementation of "A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction B…☆85Updated 6 months ago
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Updated last year
- ☆15Updated last year
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Updated last year
- TianGong-AI-Unstructure☆69Updated 3 months ago
- [Paper] Code for the EMNLP2023 (Findings) paper "Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document"☆17Updated 2 years ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- Support finetuning GLM4v with zero2☆16Updated last year
- 研究GOT-OCR-项目落地加速,不限语言☆62Updated last year
- Agentic Learning Powered by AWorld☆80Updated last week
- ☆28Updated last year
- This is the code repo for our paper "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts".☆41Updated 4 months ago
- aigc evals☆10Updated 2 years ago
- 中文原生工业测评基准☆15Updated last year
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆195Updated last year