The simplest way to extract text from PDFs in Python
☆429Jul 7, 2022Updated 3 years ago
Alternatives and similar repositories for slate
Users that are interested in slate are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A fast and friendly PDF scraping library.☆780Oct 17, 2023Updated 2 years ago
- Python wrapper for xpdf☆19Nov 28, 2019Updated 6 years ago
- Python PDF Parser (Not actively maintained). Check out pdfminer.six.☆5,295Dec 7, 2022Updated 3 years ago
- extract text from any document. no muss. no fuss.☆4,540Updated this week
- A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files☆9,967Updated this week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- pyaddress is an address parsing library, taking the guesswork out of using addresses in your applications. We use it as part of our apart…☆100Sep 16, 2019Updated 6 years ago
- Circular buffer implementation in Nim☆10Apr 21, 2023Updated 3 years ago
- A slim, non-SWIG Python adapter to CTesseract (Tesseract OCR for C).☆24Apr 25, 2014Updated 12 years ago
- pdfrw is a pure Python library that reads and writes PDFs☆1,910Apr 29, 2024Updated 2 years ago
- A Generic plug-in system for python applications☆57Mar 29, 2020Updated 6 years ago
- Simple wrapper of tabula-java: extract table from PDF into pandas DataFrame☆2,314Dec 5, 2024Updated last year
- A command-line tool to better visualize crowded dot density maps.☆154Dec 27, 2014Updated 11 years ago
- Simple Bayesian spam rating in Python that is easy to use, small, contained in a single file, and doesn't require any external modules.☆30Mar 11, 2015Updated 11 years ago
- ☆10Jul 22, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Supreme Court prediction model, "version" 2☆50Apr 24, 2017Updated 9 years ago
- Word Graph utility built with NLTK and TextBlob☆18Aug 16, 2013Updated 12 years ago
- A DSL to build Lucene text queries in Python.☆38Jan 5, 2017Updated 9 years ago
- Python Client for Microsoft Project Oxford☆10Jun 7, 2016Updated 9 years ago
- Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.☆1,654Updated this week
- Extract text, metadata and references (pdf, url, doi, arxiv) from PDF. Optionally download all referenced PDFs.☆1,074Jun 15, 2023Updated 2 years ago
- A more complete example of programming with PDFMiner, which continues where the default documentation stops☆216Dec 3, 2019Updated 6 years ago
- A collection of Fabric utilities largely for Django deployment.☆28Apr 15, 2013Updated 13 years ago