my take at a PDF text extraction utility
☆15Jun 15, 2015Updated 10 years ago
Alternatives and similar repositories for PDFExtract
Users that are interested in PDFExtract are comparing it to the libraries listed below
Sorting:
- ☆14Jan 2, 2024Updated 2 years ago
- Words -> Phrases; NLP☆11Apr 8, 2016Updated 9 years ago
- Example programs that demonstrate using the odmlib Python package for working with the CDISC ODM standard☆13Mar 10, 2026Updated last week
- A Cassandra schema change management tool for applications running on the JVM☆14Apr 19, 2018Updated 7 years ago
- name entity recognition with recurrent neural network(RNN) in tensorflow☆16Feb 9, 2022Updated 4 years ago
- A CLI app for taking simple notes without ever leaving the terminal.☆12Jan 7, 2019Updated 7 years ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 3 months ago
- several algorithms for converting dependency structures into constituency structures.☆10Feb 7, 2022Updated 4 years ago
- flexible VersionNumber parsing in Julia☆14May 16, 2023Updated 2 years ago
- all your base are belong to me☆15Feb 3, 2021Updated 5 years ago
- A minimal web API rendering SMILES molecules☆19May 29, 2019Updated 6 years ago
- texrex web page cleaning & ClaraX random walk crawler☆11Dec 13, 2021Updated 4 years ago
- IPv4 / IPv6 network abstractions for Julia☆13Jan 14, 2026Updated 2 months ago
- ☆12Mar 3, 2026Updated 2 weeks ago
- ODK Validate is a Java application for confirming that a form is valid and compliant with the ODK XForms specification. Contribute and ma…☆12Jan 8, 2026Updated 2 months ago
- Python package for working with CDISC ODM☆25Mar 2, 2026Updated 2 weeks ago
- Create HTML extensions for Adobe Creative Cloud products [CEP 8] for Brackets☆11Jun 29, 2019Updated 6 years ago
- LocalAI website, powered by Hugo☆15Nov 22, 2023Updated 2 years ago
- ☆13Feb 12, 2023Updated 3 years ago
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Oct 24, 2016Updated 9 years ago
- Sequence Labeling Parsing by Learning Across Representations☆13Oct 3, 2019Updated 6 years ago
- Distillation of Ensemble Dependency Parsers into a Single Graph-Based Parser☆11Oct 14, 2016Updated 9 years ago
- Computer Vision tutorial for DH Summer School Antwerp☆11May 25, 2023Updated 2 years ago
- An optimized prime sieve in Julia☆14Dec 10, 2024Updated last year
- A package for fast evaluation of multivariate polynomials.☆12Oct 15, 2021Updated 4 years ago
- Utility class for working with multiple screens in Cocoa☆19Sep 5, 2011Updated 14 years ago
- A C4D Plugin that enables to recieve FaceShift-Animationdata per TCP/IP Stream.☆16Jan 21, 2016Updated 10 years ago
- UniParse: A universal graph-based parsing toolkit☆10Oct 2, 2019Updated 6 years ago
- nginx reverse proxy vs go for ssl termination☆15Nov 30, 2016Updated 9 years ago
- Dependency Parsing as Sequence Labeling with Python3+ and PyTorch1+ and MTL☆10Nov 21, 2019Updated 6 years ago
- ☆10Jul 3, 2019Updated 6 years ago
- A python web application in a single binary☆14Nov 14, 2023Updated 2 years ago
- A rust library for extracting content from pdfs☆573Feb 23, 2026Updated 3 weeks ago
- A trading (matching) engine implementation in Rust.☆51Oct 31, 2022Updated 3 years ago
- Pipeline for the production of digital scholarly editions of archival collections☆14Feb 22, 2024Updated 2 years ago
- Save examples from dictionaries to Google Sheet in one click.☆12Apr 24, 2023Updated 2 years ago
- Misc resources for Dotsies - a font that uses dots instead of letters.☆15Apr 3, 2012Updated 13 years ago
- GC4LM: A Colossal (Biased) language model for German☆13May 2, 2021Updated 4 years ago
- Tools for working with Iterators of Iterators of ...., with particular application in NLP which has Corpus made up of Document made up of…☆13Aug 27, 2021Updated 4 years ago