PDF Extraction Toolkit
☆42Nov 23, 2020Updated 5 years ago
Alternatives and similar repositories for pdfxtk
Users that are interested in pdfxtk are comparing it to the libraries listed below
Sorting:
- Java command-line tools for comparing results to ground truth for table location and structure detection as used in the ICDAR 2013 Table …☆33May 31, 2020Updated 5 years ago
- Computer Vision Segmentation for Document Layout Analysis☆10Sep 26, 2022Updated 3 years ago
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆69Nov 7, 2020Updated 5 years ago
- Functional and structural analysis of tables in research papers (Table disentangling)☆20Aug 7, 2017Updated 8 years ago
- A basic tool that extracts the structure from the PDF files of scientific articles.☆76Jan 4, 2022Updated 4 years ago
- hnsw implemented by python☆21Nov 28, 2019Updated 6 years ago
- Document Layout Analysis Projects☆23Sep 4, 2019Updated 6 years ago
- clone of https://code.google.com/p/splitta/ so it can be a git submodule☆34Jun 11, 2013Updated 12 years ago
- Gust is a set of GPU extensions for Breeze.☆32Apr 10, 2015Updated 10 years ago
- DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confiden…☆26Dec 31, 2020Updated 5 years ago
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Mar 4, 2022Updated 4 years ago
- N-Gram Weighting Scheme☆36Jul 19, 2017Updated 8 years ago
- ☆12Dec 28, 2021Updated 4 years ago
- Scholarly Big Data Subject Category Classifier☆10Jul 15, 2019Updated 6 years ago
- Python Client Library for FROST.☆11Sep 19, 2025Updated 5 months ago
- Build and run container environment for LFRic☆10Jan 8, 2024Updated 2 years ago
- ☆15Updated this week
- A set of visualization engines.☆14Updated this week
- PoseNet integration for Node for Max.☆14Nov 23, 2019Updated 6 years ago
- Chromium extension to send a page to a e-mail recipient.☆13Aug 28, 2024Updated last year
- Material parsers and other tools, scripts Initially developed for Grobid Superconductor☆13Feb 21, 2025Updated last year
- Tabula Rasa Tic-Tac-Toe☆10Jan 3, 2019Updated 7 years ago
- Python utility to listen, timestamp and log data received from a serial port.☆11Aug 28, 2023Updated 2 years ago
- Framework for information extraction from tables☆40Apr 15, 2019Updated 6 years ago
- A JupyterHub authenticator using Kerberos☆12Jul 16, 2019Updated 6 years ago
- Webpage for Unibeautifier☆10Feb 23, 2026Updated last week
- lichess game download link creator☆10Apr 3, 2020Updated 5 years ago
- Home is where the dotfiles are.☆11Apr 29, 2025Updated 10 months ago
- Semi-automated process to create an audiobook (m4b format) from markdown files.☆11Jan 12, 2017Updated 9 years ago
- Colab notebooks for d2l-book☆11Dec 5, 2019Updated 6 years ago
- Curated list of CLI tools and plugins that help you use AI in Vim, Neovim, and the Terminal.☆23Updated this week
- Zurich Morphological Lexicon for German: a tool to extract a morphological lexicon from Wiktionary☆12Aug 10, 2023Updated 2 years ago
- Create P2P apps between browsers☆13Dec 30, 2022Updated 3 years ago
- LuaJIT FFI bindings to jq☆12Jun 6, 2025Updated 9 months ago
- Solver in the low-rank tensor train format with cross approximation approach for the multidimensional Fokker-Planck equation☆14Oct 24, 2023Updated 2 years ago
- A proof of concept project for testing the modifications to the PS4 UI.☆11May 11, 2021Updated 4 years ago
- Generate zsh completion functions from manpage or `--help`☆10Mar 18, 2020Updated 5 years ago
- Epub reader with speech synthesis and dictionaries for Windows☆10Aug 14, 2024Updated last year
- 🍒 Dynamically inline assets into the DOM using Fetch Injection. Mirror of Fetch Inject on Codeberg.☆13May 26, 2024Updated last year