Fast and memory-efficient Python PDF Parser based on xpdf sources
☆44Dec 15, 2023Updated 2 years ago
Alternatives and similar repositories for pyxpdf
Users that are interested in pyxpdf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- String Distance using cython☆13Jan 19, 2020Updated 6 years ago
- Cython based high performance alternative to Python (re) module for doing basic pattern matching on large data-set..☆11Dec 15, 2022Updated 3 years ago
- Annotation de la jurisprudence des CA Fr☆12May 4, 2018Updated 7 years ago
- ☆11Aug 10, 2021Updated 4 years ago
- Reusable Django application for storing and accessing municipality-related geospatial data☆14Mar 20, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- EPUB Media Overlays javascript implementation☆14Aug 19, 2016Updated 9 years ago
- CyDifflib is a fast implementation of difflib's algorithms, which can be used as a drop-in replacement.☆31Apr 11, 2025Updated 11 months ago
- Per-collection OCR leaderboards using VLM-as-judge☆54Updated this week
- Tools and scripts for working with ELAN☆10Aug 4, 2022Updated 3 years ago
- Bajo los adoquines, la PLAYA 🏖️☆16Feb 17, 2026Updated last month
- Toolkit for training/adapting CMU Sphinx acoustic models☆17May 25, 2018Updated 7 years ago
- ☆17Dec 8, 2024Updated last year
- 🌏 Modular retrievers for zero-shot multilingual IR.☆30Mar 6, 2024Updated 2 years ago
- A fast multiple string replace library for ruby. Uses a C implementation of the Aho–Corasick Algorithm based on https://github.com/moreni…☆21Sep 11, 2025Updated 6 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- VB Diarization with Eigenvoice and HMM Priors, refactored☆15Jul 27, 2021Updated 4 years ago
- YACYAML for Cocoa reads and writes YAML, a friendlier, more human, plain text replacement for plists, JSON or NSKeyedArchives.☆48May 31, 2023Updated 2 years ago
- GPU Implementation of "Fast Burrows Wheeler Compression Using All-Cores" IPDSW'15☆15May 7, 2020Updated 5 years ago
- A django application that contains a class for admin interface to render a text field as beautiful Imperavi WYSIWYG editor http://redacto…☆40Jun 14, 2017Updated 8 years ago
- Simplified Objective-C wrappers over various Security.framework and CommonCrypto APIs☆13Aug 25, 2015Updated 10 years ago
- Twitter dataset for Conversational Document Prediction to Assist Customer Care Agents (Ganhotra et al. 2020, EMNLP)☆15Nov 15, 2020Updated 5 years ago
- Extension of ColabTurtle by tolgaatam using classes☆13Mar 19, 2025Updated last year
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆35Oct 3, 2024Updated last year
- ☆14Aug 21, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- LAWLIA is an open-source computational legal framework designed to revolutionize legal reasoning and analysis. It combines the power of l…☆21Dec 6, 2023Updated 2 years ago
- Site that shows information about the coronavirus from Brazil and the world.☆11Jun 29, 2023Updated 2 years ago
- A minimal template for creating a wxPython GUI application and compiling it into an *.app (for OS X) / *.exe (for MS Windows) with py2app…☆11Sep 24, 2022Updated 3 years ago
- A Python tool to help extracting information from structured PDFs.☆429Mar 16, 2026Updated last week
- Domain-specific programming language for linguistic grammars and transducers — Langage dédié pour les grammaires linguistiques et les tra…☆17Updated this week
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆35Aug 21, 2025Updated 7 months ago
- Minimal implementation of a Byte Pair Encoding (BPE) tokenizer in Zig☆14Apr 7, 2025Updated 11 months ago
- ORM async for Python☆16Dec 8, 2024Updated last year
- A simple way to keep your Django Model data 'n-sync with N external systems.☆14Mar 6, 2018Updated 8 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- My SpaceVim configuration. Clone it into ~/.SpaceVim.d☆10Jan 18, 2026Updated 2 months ago
- protected/large media file handler for django ( nginx, apache, caddy ) with development server support (python)☆13Dec 16, 2020Updated 5 years ago
- gosocks is a golang based implementation of a socks5 server which supports custom handlers☆12Mar 10, 2026Updated 2 weeks ago
- Speech annotation web app for regular folk☆22Aug 5, 2016Updated 9 years ago
- Adobe PDFServices Python SDK☆34Jul 10, 2025Updated 8 months ago
- Example implementation of Iteration of Tought - Gives a star if you like the project☆41Dec 24, 2024Updated last year
- RubyMotion + iOS 7 & up + NSMutableAttributedString☆11May 3, 2015Updated 10 years ago