Python interface to Apache PDFBox command-line tools.
☆79Jan 24, 2023Updated 3 years ago
Alternatives and similar repositories for python-pdfbox
Users that are interested in python-pdfbox are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python wrapper for xpdf☆19Nov 28, 2019Updated 6 years ago
- ☆11Oct 22, 2018Updated 7 years ago
- ☆61Jan 28, 2026Updated 2 months ago
- Python library for GeneiaTagger☆10May 7, 2015Updated 10 years ago
- Random Forest-based "Correlation" measures☆15May 3, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A tool to expand abbreviations detected within a string. Designed for scientific writing.☆13Oct 6, 2017Updated 8 years ago
- A tool for converting PDF into hOCR with text, tables, and figures being recognized and preserved.☆458Aug 3, 2023Updated 2 years ago
- Screen scrapers relating to natural disasters. See their output in https://github.com/simonw/disaster-data/☆11May 22, 2023Updated 2 years ago
- minimal examples of brat annotation visualizations☆17Jan 21, 2015Updated 11 years ago
- Complex Systems 530 - Computer Modeling of Complex Systems (Winter 2016)☆15Apr 15, 2016Updated 10 years ago
- Demo of using Airflow☆11Jun 24, 2022Updated 3 years ago
- CS231n Convolutional Neural Networks for Visual Recognition☆12Aug 17, 2021Updated 4 years ago
- Code example for pretraining an LLM with vanilla PyTorch training loop☆10Jun 6, 2024Updated last year
- ☆20Nov 17, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Utility to re-structure research papers published in US Letter or A4 format PDF files to typically remove the 2 columns layout.☆53Nov 8, 2010Updated 15 years ago
- Easy OCR demo + Invoice for Youtube☆11Jul 15, 2020Updated 5 years ago
- A repository to host code for participation in the 2021 #30DayMapChallenge☆14Nov 20, 2021Updated 4 years ago
- ScienceBeam Gym☆25Feb 19, 2026Updated 2 months ago
- Deep Unsupervised Learning Course Tracking☆10Oct 23, 2020Updated 5 years ago
- The current version of Data by Design, an interactive history of data visualization☆14Updated this week
- A CLI utility written in Python to help you count files, grouped by extension, in a directory. By default, it will count files recursivel…☆23Mar 17, 2026Updated last month
- Source code of "Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers" EMNLP 2025☆17Jan 12, 2026Updated 3 months ago
- Master Documentation Repository for OHNLP Projects Related to Coronavirus Disease 2019☆21Dec 8, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- MAE is a lightweight, general-purpose natural language annotation tool☆63Oct 31, 2025Updated 5 months ago
- Parsing pdf tables using YOLOV3☆121Mar 15, 2021Updated 5 years ago
- A point file of populated places in Great Britain (from Ordnance Survey open data)☆14May 14, 2022Updated 3 years ago
- Automated Damage Assessment using Deep Learning☆14Jun 25, 2025Updated 9 months ago
- The Object-Oriented-Programming (OOP) version of the "Coffee Machine Project" from Dr. Angela Yu's Python Bootcamp (London App Brewery)☆16Jan 7, 2023Updated 3 years ago
- Time Extractor NLP project - locate dates and times in text documents☆23Oct 18, 2022Updated 3 years ago
- ☆16May 2, 2025Updated 11 months ago
- Python API for PDF documents☆124Sep 5, 2024Updated last year
- Tập dữ liệu câu hỏi về người trong tiếng Việt đã được gán nhãn☆16Jul 30, 2015Updated 10 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code and documentation for analytical work on OCHA Anticipatory Action pilots.☆17Sep 20, 2024Updated last year
- An easy-to-use point-and-click geocoder 🌍📍☆15Jan 6, 2023Updated 3 years ago
- Code for our NeurIPS 2023 paper Towards Evaluating Transfer-based Attacks Systematically, Practically, and Fairly☆14Jan 22, 2024Updated 2 years ago
- An opensource TAR framework for experiments and applications☆18Mar 18, 2024Updated 2 years ago
- Source code of "TRAP: Targeted Random Adversarial Prompt Honeypot for Black-Box Identification", ACL2024 (findings)☆14Nov 20, 2024Updated last year
- Examples for using the Pipl SEARCH API☆11Dec 19, 2023Updated 2 years ago
- Python manager for spark-submit jobs☆10Jan 6, 2024Updated 2 years ago