WalkerMitty / PDFparserLinks
Here is a demo for PDF parser (Including OCR, object detection tools)
☆35Updated 9 months ago
Alternatives and similar repositories for PDFparser
Users that are interested in PDFparser are comparing it to the libraries listed below
Sorting:
- TianGong-AI-Unstructure☆68Updated last month
- ☆28Updated 9 months ago
- 中文原生检索增强生成测评基准☆120Updated last year
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆47Updated last year
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆69Updated last year
- 中文论文、证券类、财报类PDF数据☆32Updated last year
- ☆57Updated last year
- ☆37Updated 3 months ago
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆37Updated 11 months ago
- 视觉信息抽取任务中,使用OCR识别结果规范多模态大模型的回答☆39Updated 7 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- ☆15Updated last year
- 本项目使用LLaVA 1.6多模态模型实现以文搜图和以图搜图功能。☆24Updated last year
- A Toolkit for Table-based Question Answering☆112Updated last year
- Python implementation of AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, w…☆46Updated 4 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆114Updated last month
- YiZhao: A 2TB Open Financial Corpus. Data and tools for generating and inspecting YiZhao, a safe, high-quality, open-source bilingual fin…☆28Updated 3 weeks ago
- 基于baichuan-7b的开源多模态大语言模型☆73Updated last year
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆160Updated 2 weeks ago
- SearchGPT: Building a quick conversation-based search engine with LLMs.☆47Updated 7 months ago
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆64Updated last year
- 中文世界的NLP自动标注开源工具,简单样本,交给LabelFast。☆74Updated 6 months ago
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆227Updated 3 months ago
- GOT的vLLM加速实现 并结合 MinerU 实现RAG中的pdf 解析☆61Updated 9 months ago
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆153Updated last year
- Agentica: Effortlessly Build Intelligent, Reflective, and Collaborative Multimodal AI Agents! 构建智能的多模态AI Agent。☆197Updated 2 weeks ago
- AGM阿格姆:AI基因图谱模型,从token-weight权重微粒角度,探索AI模型,GPT\LLM大模型的内在运作机制。☆28Updated 2 years ago
- Search, organize, discover anything!☆48Updated last year
- ☆105Updated last year
- Generate dialog data from documents using LLM like ChatGLM2 or ChatGPT;利用ChatGLM2,ChatGPT等大模型根据文档生成对话数据集☆158Updated last year