bitextor/pdf-extract

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bitextor/pdf-extract)

bitextor / pdf-extract

PDF parser and converter to HTML

☆94

Alternatives and similar repositories for pdf-extract

Users that are interested in pdf-extract are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bitextor / bifixer
View on GitHub
Tool to fix bitexts and tag near-duplicates for removal
☆35Sep 4, 2025Updated 9 months ago
Alexmhack / Django-Rasa-Sockets
View on GitHub
Rasa Chatbot using Django backend and Sockets for communication
☆12Dec 8, 2022Updated 3 years ago
thephpleague / flysystem-gridfs
View on GitHub
GridFS Adapter for Flysystem
☆20Jan 23, 2026Updated 5 months ago
paracrawl / keops
View on GitHub
Tool for manual evaluation of parallel sentences.
☆15Jan 26, 2026Updated 5 months ago
DrSnowbird / blazegraph-docker
View on GitHub
Blazegraph docker container for deploying to Container Cluster Platforms (OpenShift, Kubernetes, etc)
☆15Feb 6, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
fghazaleh / multi-thread-manager
View on GitHub
Multi Threading Manager using PHP Symfony process component
☆16Mar 24, 2023Updated 3 years ago
DrSnowbird / knime-vnc-docker
View on GitHub
KNIME Analytics Platform in Docker with VNC for Kubernetes, Openshift, DC/OS, Container Cloud Platforms
☆11Feb 26, 2022Updated 4 years ago
VisualDataWeb / OntoBench
View on GitHub
A modular generator for OWL ontologies.
☆14Jul 1, 2016Updated 9 years ago
caresteouvert / Covid_enseignes
View on GitHub
Chain stores and services open during Covid-19 lockdown
☆16May 9, 2023Updated 3 years ago
mattoopie / fold
View on GitHub
Advanced fold methods for Kotlin
☆12May 1, 2026Updated last month
shdev / phpflashtext
View on GitHub
Extract Keywords from sentence or Replace keywords in sentences. @ https://github.com/vi3k6i5/flashtext
☆20Jul 22, 2019Updated 6 years ago
minad / osm2shp
View on GitHub
Convert large OpenStreetMap files to shapefiles (Uses sqlite3 db as temporary storage)
☆33Jan 15, 2024Updated 2 years ago
Shivam-Miglani / contextual_drl
View on GitHub
Extracting action sequences and generating domain models.
☆15Dec 18, 2022Updated 3 years ago
DrSnowbird / mysql-workbench
View on GitHub
mysql-workbench
☆15Nov 11, 2018Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
InvirganceOpenSource / convirgance
View on GitHub
A core library for reading, transforming, filtering, and writing data records
☆15Apr 29, 2026Updated 2 months ago
aboullaite / covid-19-picocli
View on GitHub
Covid-19 dashboard built using picocli
☆12May 11, 2020Updated 6 years ago
yyz1989 / NoSPA-RDF-Data-Cube-Validator
View on GitHub
[0.9.9 Released] A high performance non-SPARQL based RDF data cube validator
☆16Mar 11, 2016Updated 10 years ago
DrSnowbird / docker-spark-bde2020-zeppelin
View on GitHub
Zeppelin docker
☆16Nov 16, 2020Updated 5 years ago
SOLR4189 / solcolator
View on GitHub
Implementation Saved Searches a la ElasticSearch Percolator
☆12May 20, 2022Updated 4 years ago
vhyza / lemmagen-lexicons
View on GitHub
Language lexicons for elasticsearch https://github.com/vhyza/elasticsearch-analysis-lemmagen plugin
☆15Dec 11, 2018Updated 7 years ago
weso / rdfshape-client
View on GitHub
Web client for RDFShape API with human-friendly validations and visualizations.
☆11Apr 23, 2024Updated 2 years ago
smalldirector / solr-multilingual-analyzer
View on GitHub
A new solr multilingual index and search architecture, it can support index and search across multiple languages at the same time in the …
☆13Oct 18, 2019Updated 6 years ago
nrv / pimmi
View on GitHub
Python IMage MIning
☆15Mar 19, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
SolidBench / SolidBench.js
View on GitHub
A benchmark for Solid to simulate vaults with social network data.
☆11May 14, 2026Updated last month
eduardoarandah / autohttptests
View on GitHub
Laravel http tests generator. No more writing tests by hand
☆34Mar 24, 2021Updated 5 years ago
aws-samples / lambda-efs-deep-learning-inference
View on GitHub
Deep Learning inference with AWS Lambda and Amazon EFS
☆14Aug 24, 2020Updated 5 years ago
etalab-ia / ocr-xtract
View on GitHub
☆16Jun 22, 2022Updated 4 years ago
Fraunhofer-IESE / badgers
View on GitHub
Badgers: Bad Data Generators
☆15Updated this week
datagouv / cada.data.gouv.fr
View on GitHub
A simple interface to search and display CADA advices
☆18May 15, 2025Updated last year
comunica / sparqlee
View on GitHub
⚙️ SPARQL expression evaluator library - Moved to @comunica/expression-evaluator
☆15Sep 19, 2023Updated 2 years ago
giltesa / GBA-Wireless-Gamepad
View on GitHub
Turn your Game Boy Advance into a Bluetooth Gamepad.
☆18May 2, 2026Updated last month
amolkokje / elk-docker
View on GitHub
☆34Aug 26, 2019Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
entrepreneur-interet-general / the-magical-csv-merge-machine
View on GitHub
API and interface for CSV normalization and linking
☆14May 15, 2018Updated 8 years ago
TREEcg / event-stream-client
View on GitHub
Deprecated! Use the rdf-connect/ldes-client instead
☆14Mar 5, 2024Updated 2 years ago
Drunkar / tensor2tensor-optuna
View on GitHub
Hyperparameter tuning with Optuna integrated tensor2tensor.
☆10Oct 7, 2020Updated 5 years ago
Elopteryx / bean-mirror
View on GitHub
Modern reflection library.
☆14Jun 2, 2024Updated 2 years ago
soaxelbrooke / phrase
View on GitHub
A tool for learning significant phrase/term models, and efficiently labeling with them.
☆34Apr 23, 2025Updated last year
ArdalanM / pyDD
View on GitHub
☆11Oct 10, 2017Updated 8 years ago
dreamnotover / oqmrc2018
View on GitHub
观点型问题阅读理解 challenger.ai
☆10Nov 14, 2018Updated 7 years ago