hbh112233abc / pdfplumber
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
☆51Updated 9 months ago
Alternatives and similar repositories for pdfplumber:
Users that are interested in pdfplumber are comparing it to the libraries listed below
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆246Updated 2 months ago
- A Python Package to Access World-Class Generative Models☆125Updated 5 months ago
- clueai工具包: 3行代码3分钟,自定义需要的API!☆231Updated last year
- ChatGPT WebUI using gradio. 给 LLM 对话和检索知识问答RAG提供一个简单好用的Web UI界面☆93Updated 3 months ago
- 国内首个全参数训练的法律大模型 HanFei-1.0 (韩非)☆103Updated last year
- Based on RapidOCR, extract the PDF content.☆133Updated 3 months ago
- kbqa,langchain,large langauge model, chatgpt☆78Updated last month
- 使用qlora对中文大语言模型进行微调,包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE☆85Updated last year
- 🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。☆113Updated 9 months ago
- "桃李“: 国际中文教育大模型☆168Updated last year
- SMP 2023 ChatGLM金融大模型挑战赛 60 分baseline思路介绍☆182Updated last year
- 基于sentence transformers和chatglm实现的文档搜索工具☆154Updated last year
- 打造人人都会的NLP,开源不易,记得star哦☆101Updated last year
- ☆61Updated 2 months ago
- Legal-Eagle-InternLM 是一个基于商汤科技和上海人工智能实验室推出的书生浦语大模型InternLM的法律问答机器人。旨在为用户提供符合3H(即Helpful、Honest、Harmless)原则的专业、智能、全面的法律服务的法律领域大模型。☆48Updated 9 months ago
- change pdf to txt☆65Updated last year
- ☆37Updated 7 months ago
- 文档方向分类☆204Updated last week
- PDF解析(文字,章节,表格,图片,参考),基于大模型(ChatGLM2-6B, RWKV)+langchain+streamlit的PDF问答,摘要,信息抽取☆163Updated last year
- ☆38Updated last year
- 利用LLM+敏感词库,来自动判别是否 涉及敏感词。☆106Updated last year
- TechGPT: Technology-Oriented Generative Pretrained Transformer☆216Updated last year
- 骆驼QA,中文大语言阅读理解模型。☆72Updated last year
- llama信息抽取实战☆97Updated last year
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆45Updated 5 months ago
- pke_zh, python keyphrase extraction for chinese(zh). 中文关键词或关键句提 取工具,实现了KeyBert、PositionRank、TopicRank、TextRank等算法,开箱即用。☆190Updated 8 months ago
- 专注于中文领域大语言模型,落地到某个行业某个领域,成为一个行业大模型、公司级别或行业级别领域大模型。☆112Updated 2 months ago
- (1)弹性区间标准化的旋转位置词嵌入编码器+peft LORA量化训练,提高万级tokens性能支持。(2)证据理论解释学习,提升模型的复杂逻辑推理能力(3)兼容alpaca数据格式。☆45Updated last year
- Python ROUGE Score Implementation for Chinese Language Task (official rouge score)☆83Updated 5 months ago
- Deepspeed、LLM、Medical_Dialogue、医疗大模型、预训练、微调☆233Updated 5 months ago