aphp/edspdf

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aphp/edspdf)

aphp / edspdf

EDS-PDF is a generic, pure-Python framework for text extraction from PDF documents. It provides the machinery to use rule- or machine-learning-based approaches to classify text blocs between body and meta-data.

☆65

Alternatives and similar repositories for edspdf

Users that are interested in edspdf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aphp / eds-scikit
View on GitHub
eds-scikit is a Python library providing tools to process and analyse OMOP data
☆45Dec 19, 2024Updated last year
aphp / Cohort360-FrontEnd
View on GitHub
A web application to find patients, build cohorts and visualize health records
☆55Updated this week
baatout / ml-in-prod
View on GitHub
Tutorial repo for the article "ML in Production"
☆13Sep 8, 2018Updated 7 years ago
X-DataInitiative / SCALPEL-Flattening
View on GitHub
This repository host code related SNDS database flattening
☆16Aug 3, 2022Updated 3 years ago
koaning / sentence-models
View on GitHub
A different, but useful, textcat approach.
☆18Jul 15, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
soda-inria / caussim
View on GitHub
Simulations for predictive model selection in causal inference
☆13Jan 16, 2025Updated last year
Flow-IPC / flow
View on GitHub
Flow - Modern C++ toolkit for async loops, logs, config, benchmarking, and more [See also `ipc` repo]
☆14Jun 24, 2026Updated 3 weeks ago
bcdavasconcelos / citetools
View on GitHub
📚 This extension introduces advanced bibliography features to Pandoc and Quarto's Citeproc environment. It bundles several Lua filters t…
☆42Dec 21, 2023Updated 2 years ago
wjbmattingly / bagpipes-spacy
View on GitHub
Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.
☆22Aug 15, 2024Updated last year
winston0410 / cmd-parser.nvim
View on GitHub
A command-line parser for neovim for plugin authors.
☆13Feb 23, 2022Updated 4 years ago
explosion / spacy-experimental
View on GitHub
🧪 Cutting-edge experimental spaCy components and features
☆104Apr 23, 2024Updated 2 years ago
lintopher0315 / gadget
View on GitHub
A no-hassle GVim-inspired GUI text editor built with wxWidgets.
☆15Mar 14, 2021Updated 5 years ago
Harry-Chan / seq2seqlm-on-qg
View on GitHub
☆13Feb 9, 2022Updated 4 years ago
Abhijit-2592 / spacy-langdetect
View on GitHub
A fully customisable language detection pipeline for spaCy
☆93May 2, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
iPieter / llmq
View on GitHub
A Scheduler for Batched LLM Inference
☆19Oct 5, 2025Updated 9 months ago
biozz / sublime-taskfile
View on GitHub
A Sublime Text 4 plugin for running Taskfile tasks
☆12Sep 30, 2022Updated 3 years ago
qanastek / DrBERT
View on GitHub
DrBERT: A Robust Pre-trained Model in French for Biomedical and Clinical domains
☆22Feb 7, 2024Updated 2 years ago
antonvw / wex
View on GitHub
wex is a library that offers windows ex and vi components
☆15Updated this week
maxwelljohn / FlowTree
View on GitHub
Sublime Text plugin that automatically supports your working memory.
☆14Apr 7, 2019Updated 7 years ago
Manitary / PDF-Metadata-Editor
View on GitHub
A GUI to change the metadata of PDF files
☆12Oct 3, 2023Updated 2 years ago
noartem / elementor
View on GitHub
Skia based GUI library
☆17Jun 11, 2024Updated 2 years ago
NathanGodey / headless-lm
View on GitHub
Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…
☆29Apr 17, 2024Updated 2 years ago
swiss-ai / parity-aware-bpe
View on GitHub
Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization [ACL 2026]
☆19Apr 18, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yatharthsharma / TwitterBot
View on GitHub
TwitterBot - Automatic retweet/reply/like/DM/follow
☆30Jul 18, 2018Updated 8 years ago
mikesimons / yaml-dsl
View on GitHub
☆14Apr 21, 2017Updated 9 years ago
BartovOleg / prohack-2020
View on GitHub
☆13Jun 26, 2020Updated 6 years ago
nonstd-lite / string-lite
View on GitHub
String algorithms for C+11 and later in a single-file header-only library.
☆16Feb 4, 2026Updated 5 months ago
aaronpeikert / repro-tutorial
View on GitHub
☆11Mar 11, 2026Updated 4 months ago
hdf / Patcher2
View on GitHub
Small C# binary file patcher utility. The interesting bit is the byte pattern / mask based search and replace. (Created for educational p…
☆18Aug 25, 2014Updated 11 years ago
CodeCreator / datatools
View on GitHub
Common tools for data processing
☆22Dec 8, 2025Updated 7 months ago
HumanBehaviourChangeProject / Info-extract
View on GitHub
Repository of the HBCP project.
☆23Jul 25, 2024Updated last year
intfloat / interactive-bert-masked-lm
View on GitHub
Simple script for running interactive masked language model with pre-trained BERT models.
☆18May 3, 2020Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
lanl / impala
View on GitHub
☆10Jun 29, 2026Updated 3 weeks ago
gbprod / gbvim
View on GitHub
my neovim setup
☆11Mar 5, 2026Updated 4 months ago
qurator-spk / sbb_ner
View on GitHub
Named Entity Recognition
☆19Feb 13, 2026Updated 5 months ago
soda-inria / survival-analysis-benchmark
View on GitHub
Exploratory repository to study predictive survival analysis models
☆39May 15, 2023Updated 3 years ago
adayim / causalMed
View on GitHub
Causal Mediation analysis
☆10Updated this week
CoderPat / croissant-llm-training
View on GitHub
Repository containing the code for training the CroissantLLM
☆21Feb 4, 2024Updated 2 years ago
DS4PS / ds4ps.github.io
View on GitHub
Umbrella website for Data Science for the Public Sector
☆12Aug 2, 2024Updated last year