dsidavis / pdftohtmlLinks
copy of pdftohtml code with enhancements
☆25Updated 2 years ago
Alternatives and similar repositories for pdftohtml
Users that are interested in pdftohtml are comparing it to the libraries listed below
Sorting:
- PDBF - A Toolkit for Creating Janiform Data Documents☆50Updated 9 years ago
- Python binding to libpoppler with focus on text extraction☆97Updated 4 years ago
- Zorba - the NoSQL processor☆42Updated 2 years ago
- Facilitating the global conversation on academic literature☆267Updated 8 years ago
- Navigating around a grid of cells like XPath for spreadsheets; supports Python 3.5+☆48Updated 2 years ago
- a visual programming language inspired by Scratch☆51Updated 6 years ago
- Quickly turn command-line applications into RESTful webservices with a web-application front-end. You provide a specification of your com…☆134Updated 2 months ago
- A fast, responsive HTML5 viewer for scanned items, developed for the World Digital Library. A project of the Library of Congress. Note: p…☆22Updated 10 years ago
- A Lua custom writer for Pandoc generating JATS XML☆76Updated 7 years ago
- 📝 Markdeep☆260Updated 7 years ago
- Ocular is a state-of-the-art historical OCR system.☆265Updated last year
- An active essay revisiting Gabriel Groner's GRAIL handwriting recognizer from the 1960s:☆143Updated 5 years ago
- An implementation of miller columns with jQuery☆51Updated 13 years ago
- Markdown + Tangle.js + (someday) SymPy☆31Updated 3 years ago
- A command-line tool for interacting with books in git☆112Updated last year
- Read natural language interactive queries. Great for bots.☆18Updated 9 years ago
- Select elements from large XML files, fast.☆54Updated 2 weeks ago
- Deutsch Language Tool Kit☆12Updated 10 years ago
- Strips boilerplate from Project Gutenberg text files☆18Updated 4 years ago
- Convert XML/SVG/PDF into normalised, sectioned, scholarly HTML☆37Updated last year
- Stand-off Text Annotation Model (STAM) is a data model for stand-off-text annotation where any information on a text is represented as an…☆19Updated 3 weeks ago
- The open source tools for building, maintaining and deploying Topic Maps-based applications.☆57Updated 2 weeks ago
- Pandoc filter to include CSV data (from file or URL)☆40Updated 5 years ago
- Fiction publishing platform☆28Updated 7 years ago
- Markdown editor for scientific writing. Batteries included.☆325Updated 10 years ago
- Edit Textbooks using Javascript and save to GitHub☆103Updated 7 years ago
- ☆19Updated 8 years ago
- An example of PEG usage☆55Updated 9 years ago
- We introduce TACIT: An Open-Source Text Analysis, Crawling and Interpretation Tool. TACIT's plugin architecture has three main components…☆109Updated 6 years ago
- Write your project report/academic paper in Markdown and convert it with ease to the ACM format☆44Updated 8 years ago