dsidavis / pdftohtmlLinks
copy of pdftohtml code with enhancements
☆25Updated 2 years ago
Alternatives and similar repositories for pdftohtml
Users that are interested in pdftohtml are comparing it to the libraries listed below
Sorting:
- Python binding to libpoppler with focus on text extraction☆97Updated 4 years ago
- PolyTeX to LaTeX and HTML☆48Updated 7 months ago
- PDF Extraction Toolkit☆42Updated 5 years ago
- PDBF - A Toolkit for Creating Janiform Data Documents☆50Updated 9 years ago
- Archive of monolithic GF repository until 2018-07-25☆189Updated 7 years ago
- A library for extracting tables from PDF files☆89Updated 12 years ago
- A Prolog implementation based on generators☆20Updated 11 years ago
- Quickly turn command-line applications into RESTful webservices with a web-application front-end. You provide a specification of your com…☆134Updated last month
- A Lua custom writer for Pandoc generating JATS XML☆76Updated 7 years ago
- Automatically refresh Pandoc documents in your web browser☆50Updated 8 years ago
- Edit Textbooks using Javascript and save to GitHub☆103Updated 7 years ago
- Ocular is a state-of-the-art historical OCR system.☆265Updated last year
- Offline storage for the Annotator☆43Updated 8 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆56Updated 4 years ago
- A library that helps you to convert from one subtitle format to another☆19Updated 6 years ago
- A set of tools to allow PDF to XML conversion, utilising Apache Beam and other tools. The aim of this project is to bring multiple tools…☆296Updated last week
- We introduce TACIT: An Open-Source Text Analysis, Crawling and Interpretation Tool. TACIT's plugin architecture has three main components…☆109Updated 6 years ago
- Released Apertium translation pairs☆30Updated 4 years ago
- PDF Command Line Tools Source☆266Updated this week
- ☆92Updated last year
- Manifests of the public domain images uploaded to Flickr Commons, with descriptive information about the books they were taken from.☆75Updated 11 years ago
- Automatic text summarization☆244Updated 7 years ago
- Archive.org OPDS Bookserver - A standard for digital book distribution☆130Updated 7 years ago
- Chart parser (Earley SPPF)☆27Updated 7 years ago
- Write your project report/academic paper in Markdown and convert it with ease to the ACM format☆44Updated 8 years ago
- Use visual programming to build data tables based on text data within the Orange data mining software environment☆30Updated 2 months ago
- An active essay revisiting Gabriel Groner's GRAIL handwriting recognizer from the 1960s:☆143Updated 5 years ago
- ☆39Updated 10 years ago
- Toki Pona Visual Dictionary with English, Italian and Russian translation in pictures☆32Updated 3 years ago
- ☆19Updated 8 years ago