ck-unifr / pdf_parsingView external linksLinks
PDF解析(文字,章节,表格,图片,参考),基于大模型(ChatGLM2-6B, RWKV)+langchain+streamlit的PDF问答,摘要,信息抽取
☆212Oct 17, 2023Updated 2 years ago
Alternatives and similar repositories for pdf_parsing
Users that are interested in pdf_parsing are comparing it to the libraries listed below
Sorting:
- 本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。☆638Apr 22, 2024Updated last year
- Creating a graph that summarizes correlations between stocks and using a Graph Neural Network to encode that information to be utilized i…☆17May 19, 2023Updated 2 years ago
- 《大语言模型》综述全书学习笔记☆13Aug 2, 2024Updated last year
- 利用BERT预训练模型进行文本生成,可用于对话、摘要、问题生成等任务。 目前支持策略,词表的插入和删除、自定义Character Embedding、随机词替换等☆10Jun 1, 2022Updated 3 years ago
- Multi-Label Text Classification Based On Bert☆23Feb 28, 2023Updated 2 years ago
- 在index-tts-vllm的基础上,实现了并提供了模拟流式合成音频的接口服务及客户端测试脚本☆26Sep 2, 2025Updated 5 months ago
- A simple implement for multi-label text classification with Bert. I will extend the code to a higher version for very long text over 512,…☆12Jun 2, 2021Updated 4 years ago
- Scripts for reading, extracting, and organizing data from either HTML or PDF documents and prepare them to be converted into embeddings f…☆13Aug 26, 2024Updated last year
- 文档方向分类☆221Feb 3, 2026Updated 2 weeks ago
- 本项目使用LLaVA 1.6多模态模型实现以文搜图和以图搜图功能。☆28Feb 26, 2024Updated last year
- Github repo for Peifeng's internship project☆13Nov 7, 2023Updated 2 years ago
- 基于自由度(熵)、凝固度 新词发现算法实现☆12Oct 7, 2018Updated 7 years ago
- 使用opencv部署yolo11表格检测,它是百度网盘AI大赛-表格检测的第2名方案,方案里包含表格框检测,表格角点检测,表格方向分类,一共三个模块。我依然是编写了C++和Python两个版本的程序☆13Dec 12, 2024Updated last year
- 智谱AI 2024年金融行业大模型挑战赛仓库☆57Feb 19, 2025Updated 11 months ago
- 大模型智能体Agent中文教程,博客代码仓库☆58Nov 5, 2025Updated 3 months ago
- A Python Package to Access World-Class Generative Models☆131Jun 13, 2024Updated last year
- 🔥 专注于中文的「自然语言处理框架」:中文分词;平衡类别;数据集划分...☆12Nov 14, 2020Updated 5 years ago
- Based on RapidOCR, extract the PDF content☆185May 7, 2025Updated 9 months ago
- A simple, easy-to-hack GraphRAG implementation☆15Sep 21, 2024Updated last year
- ☆16Apr 7, 2024Updated last year
- llama信息抽取实战☆102Apr 29, 2023Updated 2 years ago
- 一个适合学习、使用、自主扩展的RAG【检索增强生成】系统!可联网做AI搜索☆523Sep 4, 2024Updated last year
- The KedoAI Process Intelligent Agent Development Platform is a cutting-edge platform designed to simplify the development of intelligent …☆20May 26, 2025Updated 8 months ago
- 本项目主要用于掌纹特征提取,主要工作包含: 1. 手掌掌纹ROI提取 2. 特征提取网络设置 3. 特征网络训练预测 其中,掌纹提取部分,主要实现参照`palm_rpi_ext` 实现,核心调用出口位置为instance.py 训练与推理为 train_palm_ext…☆11Sep 18, 2024Updated last year
- 基于大语言模型的检索增强生成RAG示例☆168May 4, 2025Updated 9 months ago
- 可以成功Lora微调的Qwen-VL模型☆16Oct 27, 2023Updated 2 years ago
- HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches☆37Oct 9, 2025Updated 4 months ago
- PaperHelper: Knowledge-Based LLM QA Paper Reading Assistant with Reliable References☆20Jun 13, 2024Updated last year
- LangChain实现的基于PDF文档构建问答知识库☆39Apr 12, 2024Updated last year
- FinGLM: 致力于构建一个开放的、公益的、持久的金融大模型项目,利用开源开放来促进「AI+金融」。☆2,160May 8, 2024Updated last year
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Jun 18, 2023Updated 2 years ago
- 个人收藏资源导航☆30Oct 1, 2025Updated 4 months ago
- ☆16Sep 9, 2023Updated 2 years ago
- RAG for Local LLM, chat with PDF/doc/txt files, ChatPDF. 纯原生实现RAG功能,基于本地LLM、embedding模型、reranker模型实现,支持GraphRAG,无须安装任何第三方agent库。☆836Apr 2, 2025Updated 10 months ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆306Sep 10, 2024Updated last year
- 信息抽取相关论文。☆78Apr 13, 2023Updated 2 years ago
- EMNLP 2025 | TongSearch-QR☆41Dec 4, 2025Updated 2 months ago
- [EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction☆4,322Jul 19, 2025Updated 6 months ago
- The online version is temporarily unavailable because we cannot afford the key. You can clone and run it locally. Note: we set defaul ope…☆828May 28, 2024Updated last year