danigm / poppler
Personal clone of Poppler, official repository is here: https://gitlab.freedesktop.org/poppler/poppler
☆130Updated 6 years ago
Alternatives and similar repositories for poppler:
Users that are interested in poppler are comparing it to the libraries listed below
- This is not the poppler repository. Please see https://poppler.freedesktop.org/☆54Updated 15 years ago
- A small framework taking over the manual training process described in the Tesseract3 Wiki: https://code.google.com/p/tesseract-ocr/wiki/…☆131Updated 2 years ago
- PDFEdit is a free PDF editor.☆91Updated 13 years ago
- PoDoFo is a library to work with the PDF file format. The name comes from the first letter of PDF (Portable Document Format). A few tools…☆52Updated 10 years ago
- ☆422Updated 10 years ago
- A Lingoes dictionary file (LD2/LDX) reader/extractor. Written in C++ with Qt☆77Updated 10 years ago
- Yet another Chinese word segmentation package based on character-based tagging heuristics and CRF algorithm☆245Updated 12 years ago
- Command line tool to extract figures, tables, and captions from scholarly documents in PDF form.☆130Updated 7 years ago
- ☆76Updated 10 years ago
- A more complete example of programming with PDFMiner, which continues where the default documentation stops☆214Updated 5 years ago
- Extremely Naive Charset Analyser☆285Updated 6 months ago
- CMap Resources☆270Updated last year
- Non-Overlapping Aho-Corasick Python extension, for Python 2 (str and unicode) and Python 3☆51Updated 9 years ago
- Convert a docx (OOXML) file to html. This project is deprecated in favor of https://github.com/OpenScienceFramework/pydocx☆45Updated 11 years ago
- Python bindings for cld3☆27Updated last year
- WMF to SVG Converting Tool & Library for Java☆85Updated last year
- python module reading the StarDict dictionaries☆45Updated last year
- CoNLL-U format library for JavaScript☆72Updated 8 years ago
- A library for extracting tables from PDF files☆90Updated 11 years ago
- Open source handwriting recognition keyboard written in QML/JavaScript☆84Updated 9 years ago
- ZPar statistical parser. Universal language support (depending on the availability of training data), with language-specific features for…☆135Updated 8 years ago
- Lexical database of any language☆179Updated 2 years ago
- PDFium library without V8 JavaScript engine - compiles under Linux, Mac and Windows☆62Updated 9 years ago
- Chinese morphological analysis with Word Segment and POS Tagging data for MeCab☆160Updated 7 years ago
- Homebrew tap with some KDE packages. For now contains KDevelop and Kate☆106Updated 7 years ago
- [DEPRECATED - please use rups instead] RUPS is an abbreviation for Reading and Updating PDF Syntax. RUPS is a tool built on top of iText®…☆110Updated 6 years ago
- Python binding to libpoppler with focus on text extraction☆97Updated 3 years ago
- Python 3 port of pdfminer☆188Updated 6 years ago
- A toolbox for working with the Chinese language in Python☆150Updated 5 years ago
- Library for manipulating StarDict dictionaries from within Python☆104Updated last year