WalkerMitty / PDFparser
Here is a demo for PDF parser (Including OCR, object detection tools)
☆34Updated 4 months ago
Alternatives and similar repositories for PDFparser:
Users that are interested in PDFparser are comparing it to the libraries listed below
- ☆25Updated 4 months ago
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆46Updated 9 months ago
- TianGong-AI-Unstructure☆62Updated last month
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆62Updated 7 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆76Updated 4 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated 10 months ago
- 中文原生检索增强生成测评基准☆111Updated 10 months ago
- 大语言模型训练和服务调研☆37Updated last year
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆37Updated 6 months ago
- LLM+RAG for QA☆22Updated last year
- MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.☆21Updated 3 months ago
- A Toolkit for Table-based Question Answering☆110Updated last year
- AGM阿格姆:AI基因图谱模型,从token-weight权重微粒角度,探索AI模型,GPT\LLM大模型的内在运作机制。☆28Updated last year
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆21Updated 7 months ago
- ☆63Updated 5 months ago
- ☆56Updated last year
- A light proxy solution for HuggingFace hub.☆46Updated last year
- ⚡FlashRAG: A Python Toolkit for Efficient RAG Research☆23Updated 3 months ago
- 视觉信息抽取任务中,使用OCR识别结果规范多模态大模型的回答☆27Updated 2 months ago
- zero零训练llm调参☆31Updated last year
- ☆33Updated 2 months ago
- 本项目使用LLaVA 1.6多模态模型实现以文搜图和以图搜图功能。☆19Updated last year
- ☆40Updated last year
- ☆16Updated 8 months ago
- 中文版面检测(Chinese layout detection),yolov8 is used to detect the layout of Chinese document images。☆58Updated last year
- SearchGPT: Building a quick conversation-based search engine with LLMs.☆45Updated 2 months ago
- 使用langchain进行任务规划,构建子任务的会话场景资源,通过MCTS任务执行器,来让每个子任务通过在上下文中资源,通过自身反思探索来获取自身对问题的最优答案;这种方式依赖模型的对齐偏好,我们在每种偏好上设计了一个工程框架,来完成自我对不同答案的奖励进行采样策略☆29Updated this week
- 中文原生工业测评基准☆13Updated 11 months ago
- 基于baichuan-7b的开源多模态大语言模型☆73Updated last year