WalkerMitty / PDFparserLinks
Here is a demo for PDF parser (Including OCR, object detection tools)
☆36Updated last year
Alternatives and similar repositories for PDFparser
Users that are interested in PDFparser are comparing it to the libraries listed below
Sorting:
- ☆28Updated last year
- 中文原生检索增强生成测评基准☆124Updated last year
- TianGong-AI-Unstructure☆69Updated 3 months ago
- ☆40Updated 9 months ago
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆48Updated last year
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆70Updated last year
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆38Updated last year
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- Generate dialog data from documents using LLM like ChatGLM2 or ChatGPT;利用ChatGLM2,ChatGPT等大模型根据文档生成对话数据集☆163Updated 2 years ago
- Python implementation of AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, w…☆49Updated 9 months ago
- SearchGPT: Building a quick conversation-based search engine with LLMs.☆46Updated last year
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆50Updated last week
- 视觉信息抽取任务中,使用OCR识别结果规范多模态大模型的回答☆43Updated last year
- 基于baichuan-7b的开源多模态大语言模型☆72Updated 2 years ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆124Updated 6 months ago
- ☆15Updated last year
- Accelerating GOT-OCRv2 with VLLM☆11Updated last year
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆305Updated last year
- 大语言模型ChatGLM-6B为基座,接入文档阅读功能进行实时问答,可上传txt/docx/pdf多种文件类型。☆43Updated 2 years ago
- 中文论文、证券类、财报类PDF数据☆35Updated last year
- A Toolkit for Table-based Question Answering☆115Updated 2 years ago
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆162Updated 5 months ago
- PDF Parsing Tool: GOT's vLLM acceleration implementation, MinerU for layout recognition, and GOT for table formula parsing.☆65Updated last year
- YiZhao: A 2TB Open Financial Corpus. Data and tools for generating and inspecting YiZhao, a safe, high-quality, open-source bilingual fin…☆38Updated 6 months ago
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆69Updated last year
- ☆57Updated last year
- 大语言模型训练和服务调研☆37Updated 2 years ago
- 国内首个全参数训练的法律大模型 HanFei-1.0 (韩非)☆126Updated 2 years ago
- PDF解析(文字,章节,表格,图片,参考),基于大模型(ChatGLM2-6B, RWKV)+langchain+streamlit的PDF问答,摘要,信息抽取☆215Updated 2 years ago
- Xtuner Factory☆35Updated last year