Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
☆61Feb 18, 2024Updated 2 years ago
Alternatives and similar repositories for pdfplumber
Users that are interested in pdfplumber are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A list of multi-vector retrieval resources☆18May 29, 2024Updated last year
- Easy-to-use and Fast NLP library with awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications.☆12Mar 13, 2024Updated 2 years ago
- This repository contains statistics about the AI Infrastructure products.☆17Feb 27, 2025Updated last year
- ☆12Oct 1, 2025Updated 6 months ago
- Reproducible research and reusable acyclic workflows in Python. Execute code on HPC systems as if you executed them on your personal comp…☆18Jan 11, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- PaddleSeq☆10Mar 28, 2023Updated 3 years ago
- This pipeline is used to distinguish allotetraploid subgenomes.☆11Apr 8, 2024Updated 2 years ago
- This repository contains the code of metric indexing for exact similarity search.☆12Jul 11, 2023Updated 2 years ago
- Using NLP techniques to summarize prompts for program synthesis☆17Sep 26, 2023Updated 2 years ago
- A repository for organizing our submission to the MEDIQA-Chat Tasks @ ACL-ClinicalNLP 2023☆22Jul 21, 2023Updated 2 years ago
- Backend server for envd☆21Dec 18, 2023Updated 2 years ago
- Natural Language Processing (NLP) and Large Language Models (LLM) with Fine-Tuning LLM and make Chatbot Question answering (QA) with LoRA…☆13Jan 20, 2024Updated 2 years ago
- Debug DeepSpeed-Chat step by step in IDE (在IDE里一步一步调试DeepSpeed-Chat)☆10Apr 17, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Codes for ocean-of-memories article series☆22Sep 2, 2015Updated 10 years ago
- Code for EMNLP 2021 Paper "Recall and Learn: A Memory-augmented Solver for Math Word Problems".☆16Oct 20, 2022Updated 3 years ago
- ☆10May 27, 2024Updated last year
- WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.☆17Aug 4, 2022Updated 3 years ago
- Grammar correct project based Tencent's paper(Sequence to Action)☆15Sep 8, 2022Updated 3 years ago
- 3位代码类目表;6位扩展代码表;疾病分类与代码(修订版);章节名称及代码☆11Aug 20, 2018Updated 7 years ago
- python implementation of Incremental Hierarchical Agglomerative Clustering (IHAC)☆16Apr 8, 2015Updated 11 years ago
- Code and Benchmarks for JOSIE (SIGMOD 2019)☆19Apr 13, 2023Updated 3 years ago
- [ICLR 2024] Towards Robust Multi-Modal Reasoning via Model Selection☆14Mar 7, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- CoalHMM☆22Nov 4, 2013Updated 12 years ago
- [VLDB 2024] Source code for FusionQuery: On-demand Fusion Queries over Multi-source Heterogeneous Data☆15Mar 11, 2025Updated last year
- Source code for Pivot Selection Algorithms in Metric Spaces: An Experimental Evaluation. VLDBJ 2021.☆15Jul 27, 2021Updated 4 years ago
- 同花顺算法挑战平台:【9-10双月赛】跨领域迁移的文本语义匹配☆11Oct 28, 2021Updated 4 years ago
- ☆16May 4, 2021Updated 4 years ago
- Modern Data Engineering Project☆12Jun 3, 2022Updated 3 years ago
- ☆13Nov 29, 2022Updated 3 years ago
- auto scrawl for arrive data☆16Jan 24, 2022Updated 4 years ago
- Some commonly used functions and modules☆10Jan 15, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Plug-and-Play Document Modules for Pre-trained Models☆25May 28, 2023Updated 2 years ago
- Multifastats: Multi-Fasta Sequence Stats. Free python-based program that, from a set of a set of 'fasta' sequences (as group or individua…☆10Feb 21, 2018Updated 8 years ago
- Summaries of ICML 2024 papers☆12Jul 31, 2024Updated last year
- My WWDC17 scholarship winning playground☆13Feb 14, 2019Updated 7 years ago
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Nov 25, 2023Updated 2 years ago
- Manages vllm-nccl dependency☆17Jun 3, 2024Updated last year
- Quartet-based species tree and tree of blob estimation☆16Feb 10, 2026Updated 2 months ago