hbh112233abc/pdfplumber

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hbh112233abc/pdfplumber)

hbh112233abc / pdfplumber

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

☆62

Alternatives and similar repositories for pdfplumber

Users that are interested in pdfplumber are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

icip-cas / Chinese-PPDB
View on GitHub
Chineses-PPDB
☆14Nov 23, 2020Updated 5 years ago
Muennighoff / FLAN
View on GitHub
Provides a minimal implementation to extract FLAN datasets for further processing
☆11Feb 1, 2023Updated 3 years ago
linjieccc / PaddleNLP
View on GitHub
Easy-to-use and Fast NLP library with awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications.
☆12Mar 13, 2024Updated 2 years ago
Qyu-ai / Reina
View on GitHub
PySpark-based causal inference package.
☆13Aug 20, 2021Updated 4 years ago
tensorchord / ai-infra-statistics
View on GitHub
This repository contains statistics about the AI Infrastructure products.
☆16Feb 27, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
pisa-engine / BMP
View on GitHub
Faster Learned Sparse Retrieval with Block-Max Pruning. ACM SIGIR 2024.
☆37Jan 14, 2026Updated 6 months ago
alexrs / herd
View on GitHub
Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.
☆11Feb 11, 2024Updated 2 years ago
jiaohuix / PaddleSeq
View on GitHub
PaddleSeq
☆10Mar 28, 2023Updated 3 years ago
ZJU-DAILY / Metric_Index
View on GitHub
This repository contains the code of metric indexing for exact similarity search.
☆12Jul 11, 2023Updated 3 years ago
kurbster / Prompt-Summarization
View on GitHub
Using NLP techniques to summarize prompts for program synthesis
☆17Sep 26, 2023Updated 2 years ago
emanuele / PyCoverTree
View on GitHub
Python library of cover tree (http://hunch.net/~jl/projects/cover_tree/cover_tree.html) for fast nearest neighbor querying
☆15Jan 3, 2012Updated 14 years ago
chenzen94 / debug-deepspeed-chat
View on GitHub
Debug DeepSpeed-Chat step by step in IDE (在IDE里一步一步调试DeepSpeed-Chat)
☆10Apr 17, 2023Updated 3 years ago
YanSte / NLP-LLM-Fine-tuning-QA-LoRA-T5
View on GitHub
Natural Language Processing (NLP) and Large Language Models (LLM) with Fine-Tuning LLM and make Chatbot Question answering (QA) with LoRA…
☆12Jan 20, 2024Updated 2 years ago
sfeng-m / REAL4MWP
View on GitHub
Code for EMNLP 2021 Paper "Recall and Learn: A Memory-augmented Solver for Math Word Problems".
☆16Oct 20, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
cehinson / ERRANT_ZH
View on GitHub
☆15Jan 21, 2021Updated 5 years ago
AI-confused / Sequence-to-Action
View on GitHub
Grammar correct project based Tencent's paper(Sequence to Action)
☆15Sep 8, 2022Updated 3 years ago
AI-confused / CGEC-with-Pointer-Generator-Network-Bart
View on GitHub
基于Bart语言模型的指针生成网络，用于中文语法纠错任务
☆16Sep 8, 2022Updated 3 years ago
ZJU-DAILY / FusionQuery
View on GitHub
[VLDB 2024] Source code for FusionQuery: On-demand Fusion Queries over Multi-source Heterogeneous Data
☆15Mar 11, 2025Updated last year
LINs-lab / M3
View on GitHub
[ICLR 2024] Towards Robust Multi-Modal Reasoning via Model Selection
☆14Mar 7, 2024Updated 2 years ago
AI-confused / arxiv_auto_crawler
View on GitHub
auto scrawl for arrive data
☆16Jan 24, 2022Updated 4 years ago
nnzhaocs / DupHunter
View on GitHub
☆16May 4, 2021Updated 5 years ago
yytzsy / SMCG
View on GitHub
Code for the paper "Controllable Video Captioning with an Exemplar Sentence"
☆12Apr 14, 2021Updated 5 years ago
seanzhang-zhichen / PytorchBilstmCRF-Information-Extraction
View on GitHub
基于Bilstm + CRF的信息抽取模型
☆37Aug 2, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Streamedian / html5_hls_player
View on GitHub
html5 HLS player
☆10Jun 9, 2016Updated 10 years ago
thunlp / Document-Plugin
View on GitHub
Plug-and-Play Document Modules for Pre-trained Models
☆25May 28, 2023Updated 3 years ago
zchoi / VCRN
View on GitHub
☆11Jul 11, 2023Updated 3 years ago
vllm-project / vllm-nccl
View on GitHub
Manages vllm-nccl dependency
☆18Jun 3, 2024Updated 2 years ago
THUKElab / CCL2023-CLTC-THU_KELab
View on GitHub
This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…
☆15Nov 25, 2023Updated 2 years ago
cubenlp / CERRU
View on GitHub
CCL2024 Chinese Essay Rhetoric Recognition and Understanding
☆17Oct 1, 2024Updated last year
RAIVNLab / LLC
View on GitHub
☆13Oct 29, 2021Updated 4 years ago
HugoZHL / MEVI
View on GitHub
[NeurIPS 2023] Model-enhanced Vector Index
☆26May 9, 2024Updated 2 years ago
AndresPMD / Clip_CMR
View on GitHub
CLIP-based simple image-text matching baseline for COCO and F30K
☆15Sep 16, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
sfeng-m / Math-Word-Problems-PaperList
View on GitHub
A Paper List for Math Word Problem
☆20Oct 25, 2023Updated 2 years ago
breezedeus / LoveShare
View on GitHub
breezedeus的各种分享
☆22Jan 31, 2023Updated 3 years ago
leolee99 / CLIP_ITM
View on GitHub
A simple pytorch implementation of baseline based-on CLIP for Image-text Matching.
☆19May 25, 2023Updated 3 years ago
sean0042 / Open_WikiTable
View on GitHub
Open-WikiTable :Dataset for Open Domain Question Answering with Complex Reasoning over Table
☆28Jun 2, 2023Updated 3 years ago
chenlian-zhou / ALBERT_NER
View on GitHub
☆34Jul 19, 2020Updated 6 years ago
voidful / ChineseErrorDataset
View on GitHub
CGED & CSC
☆23Feb 27, 2020Updated 6 years ago
ArmelRandy / Self-instruct
View on GitHub
A repository to perform self-instruct with a model on HF Hub
☆32Sep 29, 2023Updated 2 years ago