ck-unifr / pdf_parsingLinks

PDF解析（文字，章节，表格，图片，参考），基于大模型(ChatGLM2-6B, RWKV)+langchain+streamlit的PDF问答，摘要，信息抽取

☆205

Alternatives and similar repositories for pdf_parsing

Users that are interested in pdf_parsing are comparing it to the libraries listed below

Sorting:

shibing624 / chatgpt-webui
ChatGPT WebUI using gradio. 给 LLM 对话和检索知识问答RAG提供一个简单好用的Web UI界面
☆131Updated 11 months ago
Logistic98 / rag-omni
基于大语言模型的检索增强生成RAG示例
☆153Updated 2 months ago
wenge-research / YAYI-UIE
雅意信息抽取大模型：在百万级人工构造的高质量信息抽取数据上进行指令微调，由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)
☆306Updated 11 months ago
qianniuspace / llm_notebooks
AI 应用示例合集
☆102Updated last year
iMagist486 / ElasticSearch-Langchain-Chatglm2
Q&A based on elasticsearch+langchain+chatglm2 ｜基于elasticsearch，langchain，chatglm2的自有知识库问答
☆242Updated last year
taishan1994 / langchain-learning
langchain学习笔记，包含langchain源码解读、langchain中使用中文模型、langchain实例等。
☆216Updated 2 years ago
ZixinxinWang / Legal-Eagle-InternLM
Legal-Eagle-InternLM 是一个基于商汤科技和上海人工智能实验室推出的书生浦语大模型InternLM的法律问答机器人。旨在为用户提供符合3H（即Helpful、Honest、Harmless）原则的专业、智能、全面的法律服务的法律领域大模型。
☆58Updated last year
360AILAB-NLP / 360LayoutAnalysis
360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute
☆292Updated 10 months ago
duanyu / LabelFast
中文世界的NLP自动标注开源工具，简单样本，交给LabelFast。
☆74Updated 6 months ago
jayli / langchain-GLM_Agent
本地知识库 + chatGLM6B + CustomAgent
☆270Updated 2 years ago
qq31682216 / chatgpt_all
学习开源chatGPT类模型的指南，汇总各种训练数据获取、模型微调、模型服务的方法，以及记录自己操作总遇到的各种常见坑，欢迎收藏、转发，希望能帮你省一些时间
☆76Updated last year
RapidAI / RapidRAG
QA based on local knowledge and LLM.
☆231Updated 6 months ago
billvsme / law_ai
💼法律AI助手，法律RAG，通过全部200+本法律手册📖、网页搜索内容💻结合LLM回答你的问题，并且给出相应的法规和网站，基于⚡️ langchain，Gradio，openai，chroma，duckduckgo-search
☆160Updated last year
Airmomo / graphrag-practice-chinese
GraphRAG的应用实例，项目特点在于提供了替换OpenAI模型的方法，并通过修改原有提示和切分文档的方法，提高了GraphRAG处理中文内容的能力。
☆166Updated 9 months ago
neukg / TechGPT
TechGPT: Technology-Oriented Generative Pretrained Transformer
☆225Updated 2 years ago
coggle-club / notebooks
数据科学教程、大模型实践案例
☆143Updated last month
threeColorFr / LLMforDialogDataGenerate
Generate dialog data from documents using LLM like ChatGLM2 or ChatGPT;利用ChatGLM2,ChatGPT等大模型根据文档生成对话数据集
☆158Updated last year
linancn / TianGong-AI-Unstructure
TianGong-AI-Unstructure
☆68Updated last month
shibing624 / ChatPDF
RAG for Local LLM, chat with PDF/doc/txt files, ChatPDF. 纯原生实现RAG功能，基于本地LLM、embedding模型、reranker模型实现，支持GraphRAG，无须安装任何第三方agent库。
☆788Updated 3 months ago
wangxb96 / RAG-QA-Generator
RAG-QA-Generator 是一个用于检索增强生成（RAG）系统的自动化知识库构建与管理工具。该工具通过读取文档数据，利用大规模语言模型生成高质量的问答对（QA对），并将这些数据插入数据库中，实现RAG系统知识库的自动化构建和管理。
☆219Updated 7 months ago
JovenChu / embedding_model_test
基于开源embedding模型的中文向量效果测试
☆143Updated 2 years ago
ZhouhaoJiang / PdfReader-LangChian-LLM
ChatPDF Implement PDF parsing based on LangChain and LLM language model(ChatGLM,GPT...) | ChatPDF 基于LangChain和LLM语言模型实现PDF解析阅读
☆53Updated last year
MetaGLM / LawGLM
探索 LLM 在法律行业的应用潜力
☆91Updated 7 months ago
Sshuoshuo / easy-rag
快速入门RAG与私有化部署
☆197Updated last year
liuhuanyong / RAGOnMedicalKG
RAGOnMedicalKG，将大模型RAG与KG结合，完成demo级问答，旨在给出基础的思路。
☆307Updated last year
yuntianhe2014 / Easy-RAG
一个适合学习、使用、自主扩展的RAG【检索增强生成】系统！可联网做AI搜索
☆502Updated 10 months ago
open-chinese / alpaca-chinese-dataset
Alpaca Chinese Dataset -- 中文指令微调数据集
☆209Updated 9 months ago
percent4 / embedding_rerank_retrieval
本项目是针对RAG中的Retrieve阶段的召回技术及算法效果所做评估实验。使用主体框架为LlamaIndex.
☆270Updated last week
intsig-textin / markdown_tester
如需体验textin文档解析，请点击https://cc.co/16YSIy
☆113Updated last month
RapidAI / RapidLayout
Analysis of Chinese and English layouts 中英文版面分析
☆230Updated 2 weeks ago