my take at a PDF text extraction utility
☆25Jun 15, 2015Updated 10 years ago
Alternatives and similar repositories for PDFExtract
Users that are interested in PDFExtract are comparing it to the libraries listed below
Sorting:
- ☆19Sep 5, 2013Updated 12 years ago
- PDF Extraction Toolkit☆42Nov 23, 2020Updated 5 years ago
- High-level build project for all LAPDF-Text submodules☆103Jul 2, 2015Updated 10 years ago
- This is a side project from 2008. This package contains a tool for automatically cropping and deskewing images of book pages captured by …☆28Apr 25, 2013Updated 12 years ago
- Navigating around a grid of cells like XPath for spreadsheets; supports Python 3.5+☆48Feb 1, 2023Updated 3 years ago
- A framework, data and configs for generating and building Tesseract OCR lang.traineddata model files, specifically for Japanese☆10Dec 9, 2013Updated 12 years ago
- ☆12Dec 8, 2022Updated 3 years ago
- dynamic planning, hybrid models, hierarchical active inference, tool use☆13Jun 13, 2025Updated 8 months ago
- "Save as DAISY" add-in for Microsoft Word☆10Dec 22, 2025Updated 2 months ago
- Focused Crawler for VT's CTRNet☆10May 13, 2013Updated 12 years ago
- (Labeled) Latent Dirichlet Allocation on a sentence level with Gibbs Sampling☆10Mar 27, 2014Updated 11 years ago
- Material parsers and other tools, scripts Initially developed for Grobid Superconductor☆13Feb 21, 2025Updated last year
- Redis tcp map for postfix☆12Jun 28, 2024Updated last year
- Speech ANDroid Apps☆20Jan 22, 2014Updated 12 years ago
- Grecka is a python script to convert Greek to Greeklish based on ELOT 743☆12Aug 4, 2018Updated 7 years ago
- Scholarly Big Data Subject Category Classifier☆10Jul 15, 2019Updated 6 years ago
- Pulsar-plot of Strava runs in Swift 3☆12Apr 2, 2018Updated 7 years ago
- Madek main web interface☆21Updated this week
- LOC Standards, Schemas, Stylesheets, etc.☆11Sep 30, 2025Updated 5 months ago
- A React component for building D3 Chord Diagrams☆12Dec 10, 2022Updated 3 years ago
- Automated svn2git mirror of include-what-you-use: link goes to upstream☆13May 27, 2015Updated 10 years ago
- Source code for this blog post: http://marceldegraaf.net/2014/05/05/coreos-follow-up-sinatra-logstash-elasticsearch-kibana.html☆27May 4, 2014Updated 11 years ago
- An example of centralising clojure/java logging with Logback, LogStash, ElasticSearch, and Kibana☆17Mar 27, 2014Updated 11 years ago
- A plug-in architecture for extending Siri virtual assistant☆29Mar 30, 2014Updated 11 years ago
- Using SepFormer☆10Feb 2, 2023Updated 3 years ago
- Self-improving LLM system using Generator-Reflector-Curator pattern for online learning from execution feedback☆27Feb 17, 2026Updated last week
- Python utility to export a user's starred repositories list into a CSV file☆17May 3, 2018Updated 7 years ago
- Examples of using Diderot☆11Sep 16, 2019Updated 6 years ago
- The Ensemble distributed communications toolkit☆13Jul 26, 2020Updated 5 years ago
- Event matching for log records☆11May 12, 2014Updated 11 years ago
- Adium plugin for Tox IM protocol☆14Sep 6, 2014Updated 11 years ago
- Grapheme to phoneme converter for Estonian☆14May 27, 2021Updated 4 years ago
- Human-friendly query language for Elasticsearch☆23Jun 8, 2021Updated 4 years ago
- A simple algorithm to find ordered key-value pairs from paddleOCR recognition outputs☆10Mar 1, 2021Updated 5 years ago
- Colab notebooks for d2l-book☆11Dec 5, 2019Updated 6 years ago
- A windows dll call hellper☆14Dec 19, 2014Updated 11 years ago
- Antivirus engine that allows you to create your own anti-virus☆11Nov 2, 2012Updated 13 years ago
- Moana implementation in OCaml☆16Jul 15, 2015Updated 10 years ago
- Experimental Redis plugin for Vim☆13Jun 8, 2013Updated 12 years ago