hbh112233abc / pdfplumber
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
☆56Updated last year
Alternatives and similar repositories for pdfplumber:
Users that are interested in pdfplumber are comparing it to the libraries listed below
- ☆26Updated 3 years ago
- clueai工具包: 3行代码3分钟,自定义需要的API!☆233Updated 2 years ago
- change pdf to txt☆67Updated last year
- A Python Package to Access World-Class Generative Models☆127Updated 10 months ago
- A Multi-Modal Dataset of Chinese Governmental Docunments☆32Updated 4 years ago
- 国内首个全参数训练的法律大模型 HanFei-1.0 (韩非)☆116Updated last year
- kbqa,langchain,large langauge model, chatgpt☆80Updated 6 months ago
- basic framework for rag(retrieval augment generation)☆83Updated last year
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆279Updated 7 months ago
- 大模型预训练中文语料清洗及质量评估 Large model pre-training corpus cleaning☆57Updated 9 months ago
- 专注于中文领域大语言模型,落地到某个行业某个领域,成为一个行业大模型、公司级别或行业级别领域大模型。☆118Updated last month
- 骆驼QA,中文大语言阅读理解模型。☆74Updated last year
- 语言模型中文认知能力分析☆236Updated last year
- LingoWhale-8B: Open Bilingual LLMs | 开源双语预训练大模型☆138Updated last year
- chatglm-6b微调/LORA/PPO/推理, 样本为自动生成的整数/小数加减乘除运算, 可gpu/cpu☆164Updated last year
- 中文世界的NLP自动标注开源工具,简单样本,交给LabelFast。☆70Updated 3 months ago
- SMP 2023 ChatGLM金融大模型挑战赛 60 分baseline思路介绍☆184Updated last year
- 在中文开源大模型的基础上进行定制化的微调,拥有自己专属的语言模型。☆47Updated last year
- 🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。☆113Updated last year
- NL2SQL competition dataset☆201Updated last year
- pke_zh, python keyphrase extraction for chinese(zh). 中文关键词或关键句提取工具,实现了KeyBert、PositionRank、TopicRank、TextRank等算法,开箱即用。☆204Updated last year
- LAiW: A Chinese Legal Large Language Models Benchmark☆79Updated 9 months ago
- deep training task☆29Updated 2 years ago
- 大语言模型指令调优工具(支持 FlashAttention)☆172Updated last year
- "桃李“: 国际中文教育大模型☆177Updated last year
- 利用LLM+敏感词库,来自动判别是否涉及敏感词。☆118Updated last year
- company name parser, extract company name brand. 中文公司名称分词工具,支持公司名称中的地名,品牌名(主词),行业词,公司名后缀提取。☆90Updated 2 years ago
- ☆64Updated 2 years ago
- 打造人人都会的NLP,开源不易,记得star哦☆101Updated 2 years ago
- MuCGEC中文纠错数据集及文本纠错SOTA模型开源;Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Gr…☆537Updated last year