WalkerMitty / PDFparserLinks
Here is a demo for PDF parser (Including OCR, object detection tools)
☆36Updated last year
Alternatives and similar repositories for PDFparser
Users that are interested in PDFparser are comparing it to the libraries listed below
Sorting:
- ☆27Updated last year
- 中文原生检索增强生成测评基准☆123Updated last year
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆47Updated last year
- TianGong-AI-Unstructure☆69Updated 2 months ago
- ☆40Updated 8 months ago
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆73Updated last year
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- 视觉信息抽取任务中,使用OCR识别结果规范多模态大模型的回答☆43Updated 11 months ago
- 基于baichuan-7b的开源多模态大语言模型☆72Updated 2 years ago
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆38Updated last year
- PDF解析工具:GOT的vLLM加速实现,MinerU做布局识别裁剪、GOT做表格公式解析,实现RAG中的pdf解析☆65Updated last year
- 中文论文、证券类、财报类PDF数据☆35Updated last year
- YiZhao: A 2TB Open Financial Corpus. Data and tools for generating and inspecting YiZhao, a safe, high-quality, open-source bilingual fin…☆34Updated 5 months ago
- ☆106Updated 2 years ago
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆162Updated 4 months ago
- SearchGPT: Building a quick conversation-based search engine with LLMs.☆47Updated 11 months ago
- ☆31Updated 2 months ago
- ☆57Updated last year
- ☆15Updated last year
- MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.☆23Updated last year
- A unified tool to generate fine-tuning datasets for LLMs, including questions, answers, and dialogues. ✨🤖📚💬☆63Updated 8 months ago
- Agentica: Effortlessly Build Intelligent, Reflective, and Collaborative Multimodal AI Agents! 构建智能的多模态AI Agent。☆219Updated this week
- A Toolkit for Table-based Question Answering☆115Updated 2 years ago
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆69Updated last year
- Recursive Abstractive Processing for Tree-Organized Retrieval☆10Updated last year
- Python implementation of AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, w…☆48Updated 8 months ago
- open-o1: Using GPT-4o with CoT to Create o1-like Reasoning Chains☆116Updated 11 months ago
- Xtuner Factory☆35Updated last year
- Generate dialog data from documents using LLM like ChatGLM2 or ChatGPT;利用ChatGLM2,ChatGPT等大模型根据文档生成对话数据集☆163Updated 2 years ago
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆49Updated last week