hbh112233abc / pdfplumberLinks
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
☆57Updated last year
Alternatives and similar repositories for pdfplumber
Users that are interested in pdfplumber are comparing it to the libraries listed below
Sorting:
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆283Updated 8 months ago
- 国内首个全参数训练的法律大模型 HanFei-1.0 (韩非)☆116Updated last year
- ☆41Updated last year
- clueai工具包: 3行代码3分钟,自定义需要的API!☆233Updated 2 years ago
- change pdf to txt☆67Updated last year
- basic framework for rag(retrieval augment generation)☆84Updated last year
- A Multi-Modal Dataset of Chinese Governmental Docunments☆34Updated 4 years ago
- Python ROUGE Score Implementation for Chinese Language Task (official rouge score)☆101Updated 11 months ago
- 文档方向分类☆219Updated 6 months ago
- 中文原生检索增强生成测评基准☆118Updated last year
- kbqa,langchain,large langauge model, chatgpt☆80Updated 7 months ago
- SMP 2023 ChatGLM金融 大模型挑战赛 60 分baseline思路介绍☆185Updated last year
- 基于sentence transformers和chatglm实现的文档搜索工具☆154Updated 2 years ago
- pke_zh, python keyphrase extraction for chinese(zh). 中文关键词或关键句提取工具,实现了KeyBert、PositionRank、TopicRank、TextRank等算法,开箱即用。☆208Updated last year
- 该项目是为了使用layoutlmv3针对中文图片训练和推理。 其中主要解决三个问题: 1.数据标准化成可以的训练数据集格式 2.layoutlmv3-base-chinese 分词修改 2.超过512长度的文本切分和滑窗操作☆48Updated 9 months ago
- 中文拼写错误和语法错误纠正☆153Updated last month
- ☆26Updated 3 years ago
- 骆驼QA,中文大语言阅读理解模型。☆74Updated 2 years ago
- 探索 LLM 在法律行业的应用潜力☆89Updated 5 months ago
- 使用qlora对中文大语言模型进行微调,包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE☆86Updated last year
- TianGong-AI-Unstructure☆65Updated last month
- "桃李“: 国际中文教育大模型☆181Updated last year
- ChatGPT WebUI using gradio. 给 LLM 对话和检索知识问答RAG提供一个简单好用的Web UI界面☆130Updated 9 months ago
- company name parser, extract company name brand. 中文公司名称分词工具,支持公司名称中的地名,品牌名(主词),行业词,公司名后缀提取。☆90Updated 2 years ago
- 【间隙·树·排序算法】 对OCR结果或PDF提取的文本进行版面分析,按人类阅读顺序进行排序。☆136Updated last year
- 🌳CED: Catalog Extraction from Documents☆16Updated last year
- 实现了Baichuan-Chat微调,Lora、QLora等各种微调方式,一键运行。☆70Updated last year
- ☆66Updated 8 months ago
- 律知, 法律咨询大模型☆38Updated last year
- 打造人人都会的NLP,开源不易,记得star哦☆101Updated 2 years ago