my take at a PDF text extraction utility
☆25Jun 15, 2015Updated 10 years ago
Alternatives and similar repositories for PDFExtract
Users that are interested in PDFExtract are comparing it to the libraries listed below
Sorting:
- PDF article title extraction tool☆13Oct 9, 2015Updated 10 years ago
- This is a side project from 2008. This package contains a tool for automatically cropping and deskewing images of book pages captured by …☆28Apr 25, 2013Updated 12 years ago
- Morphosyntactic tagger for Norwegian bokmål and nynorsk☆29Jun 20, 2023Updated 2 years ago
- High-level build project for all LAPDF-Text submodules☆103Jul 2, 2015Updated 10 years ago
- Language lexicons for elasticsearch https://github.com/vhyza/elasticsearch-analysis-lemmagen plugin☆15Dec 11, 2018Updated 7 years ago
- Český tvarotvorný slovník☆14Feb 4, 2019Updated 7 years ago
- A new solr multilingual index and search architecture, it can support index and search across multiple languages at the same time in the …☆13Oct 18, 2019Updated 6 years ago
- Navi support for ROCm☆12Jan 22, 2020Updated 6 years ago
- A basic tool that extracts the structure from the PDF files of scientific articles.☆76Jan 4, 2022Updated 4 years ago
- Navigating around a grid of cells like XPath for spreadsheets; supports Python 3.5+☆48Feb 1, 2023Updated 3 years ago
- ☆14Mar 31, 2015Updated 10 years ago
- OpenXC iOS framework for use with the C5 BLE device. Also see the openxc-ios-app-demo.☆14Feb 26, 2020Updated 6 years ago
- A library for extracting tables from PDF files☆92Aug 2, 2020Updated 5 years ago
- Moana implementation in OCaml☆16Jul 15, 2015Updated 10 years ago
- High-level Rust library that binds to Poppler to extract text from a PDF☆11Dec 16, 2020Updated 5 years ago
- The Ensemble distributed communications toolkit☆13Jul 26, 2020Updated 5 years ago
- Gamera 3 for Python 2 (deprecated)☆39Aug 15, 2022Updated 3 years ago
- OpenGGSN is a Gateway GPRS Support Node (GGSN). It is used by mobile operators as the interface between the Internet and the rest of the …☆21Feb 2, 2011Updated 15 years ago
- Cloud agnostic resource monitoring and janitor tool☆21Aug 22, 2025Updated 6 months ago
- ☆16Feb 5, 2014Updated 12 years ago
- Reasonable Go.☆10Aug 13, 2018Updated 7 years ago
- ☆14Dec 9, 2022Updated 3 years ago
- Controlling the Texas Instruments SensorTag (CC2650) from anywhere in the world☆19Jan 8, 2016Updated 10 years ago
- Go Based Lightweight RAG / LLM Tool with CLI + API☆14Sep 28, 2023Updated 2 years ago
- A tool to find all duplicates in large sets of text documents.☆16Sep 29, 2021Updated 4 years ago
- Self-improving LLM system using Generator-Reflector-Curator pattern for online learning from execution feedback☆27Mar 6, 2026Updated 2 weeks ago
- Convert CSV files to Apache Arrow.☆16Feb 2, 2023Updated 3 years ago
- Redis tcp map for postfix☆12Jun 28, 2024Updated last year
- 国家统计局中国省市县乡村5级地址抓取,http://www.stats.gov.cn/tjsj/tjbz/tjyqhdmhcxhfdm/2018/index.html☆12Jan 8, 2020Updated 6 years ago
- ☆12Aug 29, 2019Updated 6 years ago
- Efficient LDA solution on GPUs.☆24Aug 20, 2018Updated 7 years ago
- ☆12Dec 8, 2022Updated 3 years ago
- An application of stacked denoising autoencoders to multi-modal (images and audio) abstract feature discovery☆12Oct 23, 2013Updated 12 years ago
- Factor Graph Grammars in Python☆13Jan 17, 2026Updated 2 months ago
- An open-source CRF Reference String Parsing Package☆161May 6, 2020Updated 5 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- Colab notebooks for d2l-book☆11Dec 5, 2019Updated 6 years ago
- ☆10May 30, 2024Updated last year
- A simple indexing program to quickly search through source code.☆22May 19, 2014Updated 11 years ago