hbh112233abc / pdfplumberLinks
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
☆57Updated last year
Alternatives and similar repositories for pdfplumber
Users that are interested in pdfplumber are comparing it to the libraries listed below
Sorting:
- clueai工具包: 3行代码3分钟,自定义需要的API!☆231Updated 2 years ago
- PDF解析(文字,章节,表格,图片,参考),基于大模型(ChatGLM2-6B, RWKV)+langchain+streamlit的PDF问答,摘要,信息抽取☆208Updated last year
- 夫子•明察司法大模型是由山东大学、浪潮云、中国政法大学联合研发,以 ChatGLM 为大模型底座,基于海量中文无监督司法语料与有监督司法微调数据训练的中文司法大模型。该模型支持法条检索、案例分析、三段论推理判决以及司法对话等功能,旨在为用户提供全方位、高精准的法律咨询与解答…☆360Updated 2 months ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆302Updated last year
- "桃李“: 国际中文教育大模型☆183Updated last year
- 国内首个全参数训练的法律大模型 HanFei-1.0 (韩非)☆124Updated last year
- 基于sentence transformers和chatglm实现的文档搜索工具☆157Updated 2 years ago
- change pdf to txt☆67Updated 2 years ago
- TechGPT: Technology-Oriented Generative Pretrained Transformer☆226Updated 2 years ago
- 利用LLM+敏感词库,来自动判别是否涉及敏感词。☆129Updated 2 years ago
- basic framework for rag(retrieval augment generation)☆85Updated last year
- SMP 2023 ChatGLM金融大模型挑战赛 60 分baseline思路介绍☆186Updated 2 years ago
- 大模型预训练中文语料清洗及质量评估 Large model pre-training corpus cleaning☆70Updated last year
- kbqa,langchain,large langauge model, chatgpt☆81Updated 11 months ago
- 语言模型中文认知能力分析☆236Updated 2 years ago
- 雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)☆310Updated last year
- LingoWhale-8B: Open Bilingual LLMs | 开源双语预训练大模型☆142Updated last year
- 用于大模型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.☆254Updated 2 years ago
- Alpaca Chinese Dataset -- 中文指令微调数据集☆214Updated last year
- 中文原生检索增强生成测评基准☆123Updated last year
- 中文书籍收录整理, Collection of Chinese Books☆198Updated last year
- Python ROUGE Score Implementation for Chinese Language Task (official rouge score)☆110Updated last year
- pke_zh, python keyphrase extraction for chinese(zh). 中文关键词或关键句提取工具,实现了KeyBert、PositionRank、TopicRank、TextRank等算法,开箱即用。☆207Updated last year
- 文本去重☆76Updated last year
- chatglm多gpu用deepspeed和☆412Updated last year
- PromptCLUE, 全中文任务支持零样本学习模型☆664Updated 2 years ago
- 实现了Baichuan-Chat微调,Lora、QLora等各种微调方式,一键运行。☆72Updated 2 years ago
- 基于开源embedding模型的中文向量效果测试☆144Updated 2 years ago
- 打造人人都会的NLP,开源不易,记得star哦☆101Updated 2 years ago
- unified embedding model☆871Updated 2 years ago