dsidavis / pdftohtmlLinks
copy of pdftohtml code with enhancements
☆25Updated last year
Alternatives and similar repositories for pdftohtml
Users that are interested in pdftohtml are comparing it to the libraries listed below
Sorting:
- PDBF - A Toolkit for Creating Janiform Data Documents☆50Updated 9 years ago
- Python binding to libpoppler with focus on text extraction☆97Updated 3 years ago
- Use visual programming to build data tables based on text data within the Orange data mining software environment☆29Updated 4 months ago
- Multi-Entity Extraction Framework for Academic Documents (with default extraction tools)☆31Updated 2 years ago
- A visualization tool to support reviewing the scientific literature☆14Updated 7 years ago
- Automatic text summarization☆243Updated 6 years ago
- Zorba - the NoSQL processor☆42Updated last year
- a visual programming language inspired by Scratch☆51Updated 6 years ago
- The open source tools for building, maintaining and deploying Topic Maps-based applications.☆57Updated last month
- KMDoc is a software for an intelligent representation of knowledge useful for quick learning and browsing.☆43Updated 5 years ago
- Navigating around a grid of cells like XPath for spreadsheets; supports Python 3.5+☆48Updated 2 years ago
- MathWebSearch Implementation☆48Updated 2 years ago
- PDF Extraction Toolkit☆42Updated 4 years ago
- PDF.js + Hypothesis viewer / annotator☆398Updated 8 months ago
- Facilitating the global conversation on academic literature☆267Updated 8 years ago
- Visualize your SQLite database schema☆117Updated 2 weeks ago
- A Scrivener 2.2 document that uses MMD 3 and BibDesk to compile into an academic thesis PDF via LaTeX.☆59Updated 13 years ago
- An application for creating digital math worksheets that can be completed by students on their computers.☆70Updated 3 years ago
- A library that helps you to convert from one subtitle format to another☆19Updated 6 years ago
- Plugins by Clement Levallois for Gephi☆34Updated 7 years ago
- ☆138Updated 2 years ago
- ☆37Updated 7 years ago
- 📝 Markdeep☆259Updated 7 years ago
- A framework for creating web-based knowledge maps☆208Updated this week
- Convert a corpus of PDF to clean text files on a distributed architecture☆38Updated last year
- ☆19Updated 7 years ago
- Multilingual handwriting recognition engine for iOS, Android, Windows, Linux, MAC OS X...☆75Updated 3 years ago
- PolyTeX to LaTeX and HTML☆48Updated 4 months ago
- Reproducible Document Archive☆81Updated 6 years ago
- Ocular is a state-of-the-art historical OCR system.☆264Updated last year