WalkerMitty / PDFparserLinks
Here is a demo for PDF parser (Including OCR, object detection tools)
☆36Updated last year
Alternatives and similar repositories for PDFparser
Users that are interested in PDFparser are comparing it to the libraries listed below
Sorting:
- ☆27Updated last year
- TianGong-AI-Unstructure☆69Updated last month
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆47Updated last year
- 中文原生检索增强生成测评基准☆123Updated last year
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆73Updated last year
- ☆40Updated 7 months ago
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆38Updated last year
- 视觉信息抽取任务中,使用OCR识别结果规范多模态大模型的回答☆42Updated 10 months ago
- ☆57Updated last year
- 中文论文、证券类、财报类PDF数据☆35Updated last year
- A Toolkit for Table-based Question Answering☆115Updated 2 years ago
- SearchGPT: Building a quick conversation-based search engine with LLMs.☆46Updated 10 months ago
- ☆32Updated last month
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆49Updated this week
- Python implementation of AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, w…☆48Updated 7 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆123Updated 4 months ago
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆161Updated 3 months ago
- ☆15Updated last year
- 大语言模型训练和服务调研☆36Updated 2 years ago
- 在RAG技术中,嵌入向量的生成和匹配是关键环节。本文介绍了一种基于CLIP/BLIP模型的嵌入服务,该服务支持文本和图像的嵌入生成与相似度计算,为多模态信息检索提供了基础能力。☆37Updated 10 months ago
- 基于baichuan-7b的开源多模态大语言模型☆72Updated last year
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆305Updated last year
- PDF解析工具:GOT的vLLM加速实现,MinerU做布局识别裁剪、GOT做表格公式解析,实现RAG中的pdf解析☆66Updated last year
- share data, prompt data , pretraining data☆36Updated last year
- 大语言模型ChatGLM-6B为基座,接入文档阅读功能进行实时问答,可上传txt/docx/pdf多种文件类型。☆42Updated 2 years ago
- open-o1: Using GPT-4o with CoT to Create o1-like Reasoning Chains☆116Updated 10 months ago
- Accelerating GOT-OCRv2 with VLLM☆11Updated last year
- YiZhao: A 2TB Open Financial Corpus. Data and tools for generating and inspecting YiZhao, a safe, high-quality, open-source bilingual fin…☆33Updated 4 months ago
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆27Updated last year