This project aims to extract text from PDF files using the outputs generated by the pdf-document-layout-analysis service. By leveraging the segmentation and classification capabilities of the underlying analysis tool, this project automates the process of text extraction from PDF files.
☆38Feb 3, 2025Updated last year
Alternatives and similar repositories for pdf-text-extraction
Users that are interested in pdf-text-extraction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This project aims to extract Table of Contents (TOC) information from PDF files using the outputs generated by the pdf-document-layout-an…☆20Feb 3, 2025Updated last year
- A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The servic…☆1,105Apr 2, 2026Updated last week
- ☆11Sep 19, 2019Updated 6 years ago
- Provides classes for working with sets☆11Dec 19, 2025Updated 3 months ago
- Taskwarrior tasks reviewing script☆14May 25, 2015Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- An open source deep research clone. AI Agent (Local LLM or Gemini) that reasons large amounts of web data extracted with SwiftSoup.☆13Feb 10, 2025Updated last year
- A textual TUI for Prodigy☆16Jun 8, 2023Updated 2 years ago
- R Code + R Notebook on how to process and visualize the official IMDb datasets.☆12Jul 16, 2018Updated 7 years ago
- Stores email header and body information in JSON format☆12Mar 10, 2016Updated 10 years ago
- A Swift package for interacting with selenium and undetected-chromedriver through python by using PythonKit.☆13Jun 21, 2025Updated 9 months ago
- Swift framework and tool to launch and control Jupyter Notebooks.☆15Sep 18, 2025Updated 6 months ago
- 🧰 📰 Tools to Work with the 'Feedly' 'API'☆18Jan 22, 2020Updated 6 years ago
- A ggplot interface for creating clincial trial swimlanes☆16Aug 2, 2022Updated 3 years ago
- R package for Extended Date/Time Format (EDTF)☆16Jun 2, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- parallel execution of RSelenium☆14Apr 22, 2025Updated 11 months ago
- ☆12Dec 4, 2024Updated last year
- R access to high resolution raster maps using the OpenStreetMap protocol. Dozens of road, satellite, and topographic map servers are dire…☆14Sep 29, 2025Updated 6 months ago
- FiscalSim US is a microsimulation model of the US federal and state tax and benefit system relating to households and individuals.☆11Dec 20, 2024Updated last year
- Turn browser clicks into reproducible scraping code.☆11Oct 27, 2024Updated last year
- A hierarchical tree structure for Swift☆18Dec 28, 2024Updated last year
- Materials for the Learn Julia with Us workshop series☆12Jul 7, 2022Updated 3 years ago
- MacOS, Linux and Windows Clipboard Management App☆11Mar 10, 2025Updated last year
- r2Symbols : Direct insertion of over 1000 HTML symbol entities in Rmarkdown, Quarto and Shiny Applications☆10Mar 17, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Visual tools to help machine learning model selection☆15Jun 11, 2021Updated 4 years ago
- Replication Package for the Book Non-Democratic Politics by Xavier Marquez☆14May 21, 2023Updated 2 years ago
- A Framework for Web API Packages☆16Updated this week
- ⛔ ARCHIVED ⛔ Importing and Analyzing Twitter Data Collected with Twitter Archiving Google Sheets☆13Jan 23, 2024Updated 2 years ago
- ☆21Aug 26, 2025Updated 7 months ago
- MacOS Finder Sync Extension to Allow Adding Custom Actions☆14Feb 15, 2022Updated 4 years ago
- ☆12Jan 20, 2026Updated 2 months ago
- Lots of plots, various labeling, axis and color scaling functions.☆13Feb 13, 2026Updated last month
- ☆14Jan 11, 2024Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- An R package for checking R packages and R code☆21Mar 6, 2026Updated last month
- Building a Chain of Thought RAG Model with DSPy, Qdrant and Ollama☆35Mar 22, 2024Updated 2 years ago
- A Swift version of Marvis TTS, running locally on Apple Silicon using MLX Swift.☆22Jan 4, 2026Updated 3 months ago
- Robust speech recognition on-device with CoreML and Swift for iOS and macOS applications.☆11Feb 21, 2024Updated 2 years ago
- How to build a map view based app with SwiftUI☆11Jul 29, 2019Updated 6 years ago
- A VS Code extension for document and book proofreading based on LLM services☆19Updated this week
- Xcode MCP Server xcf is a 100% Swift based allowing you to integrate Xcode with your favorite AI IDE or MCP Client☆12Mar 30, 2026Updated last week