EDS-PDF is a generic, pure-Python framework for text extraction from PDF documents. It provides the machinery to use rule- or machine-learning-based approaches to classify text blocs between body and meta-data.
☆63Feb 12, 2025Updated last year
Alternatives and similar repositories for edspdf
Users that are interested in edspdf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Confit is a complete and easy-to-use configuration framework aimed at improving the reproducibility of experiments by relying on the Pyth…☆11Jan 21, 2026Updated 2 months ago
- eds-scikit is a Python library providing tools to process and analyse OMOP data☆45Dec 19, 2024Updated last year
- PyTorch extension for handling deeply nested sequences of variable length☆14Mar 19, 2026Updated 3 weeks ago
- Table logger using Rich☆13Aug 13, 2025Updated 7 months ago
- ropensci registry☆13Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Natural language structuring library☆22Jun 5, 2024Updated last year
- Jupyter Widget to display resources used by the kernels☆13Aug 11, 2021Updated 4 years ago
- parse_mediawiki_dump clone☆12Mar 22, 2025Updated last year
- UMLS Graph database for semantic queries☆27Jan 11, 2023Updated 3 years ago
- Python Client Package for WikiPathways☆24Mar 15, 2026Updated 3 weeks ago
- API en GraphQL pour la Base de Données Publique des Médicaments (BDPM)☆21Dec 16, 2022Updated 3 years ago
- Customize 'react-toastify' to integrate nicely in JupyterLab.☆22Feb 3, 2023Updated 3 years ago
- ☆11Apr 15, 2022Updated 3 years ago
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆21Aug 15, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- 🧪 Cutting-edge experimental spaCy components and features☆105Apr 23, 2024Updated last year
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- Terminal UI for monitoring SLURM jobs☆14Mar 29, 2026Updated last week
- Deep RL agents with PyTorch☆36Sep 25, 2021Updated 4 years ago
- ☆19Mar 8, 2023Updated 3 years ago
- Python package for early warning signals (EWS) of bifurcations in time series data.☆95Updated this week
- Repository of the HBCP project.☆22Jul 25, 2024Updated last year
- An efficient binary serialization format for numerical data.☆18Nov 3, 2025Updated 5 months ago
- A fast TUI application (with optional webui) to visually navigate and inspect JSON and JSONL data. Easily localize parse errors in large …☆15Sep 30, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- a subset of sql dialect for clickhouse db.☆13Mar 25, 2026Updated 2 weeks ago
- Multiple-criteria decision-making (MCDM) with Electre, Promethee, Weighted Sum and Pareto☆17Apr 3, 2022Updated 4 years ago
- Further developed as SyntaxDot: https://github.com/tensordot/syntaxdot☆13Dec 18, 2020Updated 5 years ago
- Intuitive graphical representation of source code☆14Mar 15, 2023Updated 3 years ago
- Generate reports for spaCy models.☆29May 27, 2022Updated 3 years ago
- Implementation of the Tower Method, a novel approach to handling missing values.☆12Mar 12, 2024Updated 2 years ago
- A machine learning tool for fishing entities☆269Feb 27, 2026Updated last month
- Make asyncio great again☆11Feb 3, 2026Updated 2 months ago
- Continual pretraining of foundation LLM using ⚡ Lightning Fabric☆37Nov 27, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Functional interface for concurrent futures, including async coroutines.☆11Mar 31, 2026Updated last week
- An R package for tidyverse-friendly causal inference☆10Aug 2, 2019Updated 6 years ago
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- Automatically update SUMMARY.md of a GitBook repo,default Based on the markdown title, not the article name,But if without a title, artic…☆11Jul 6, 2022Updated 3 years ago
- Ubiflux Vigor ventilation system RS485 Modbus communications with Python☆12Feb 20, 2026Updated last month
- This program goes through an imgur album and finds all duplicate images.☆12Dec 2, 2019Updated 6 years ago
- A graph query engine☆23Nov 25, 2025Updated 4 months ago