kevinboone / epub2txt2
A simple command-line utility for Linux, for extracting text from EPUB documents.
☆185Updated this week
Related projects: ⓘ
- A simple utility to extract text from EPUB documents and, optionally, format it☆48Updated 4 years ago
- convert epub file to txt☆80Updated 4 years ago
- Offline bilingual dictionaries made using data from Wiktionary☆52Updated 9 years ago
- 📦 A collection of files for LibriVox recordings to produce ebooks with synchronized text and audio☆22Updated 4 years ago
- Command-line interface for Goldendict dictionaries☆44Updated 9 years ago
- k2pdfopt library for koreader, based on http://willus.com/k2pdfopt☆89Updated last week
- Creates interlinearized versions of books (EPUB, MOBI, etc), adding "subtitles" with translations under each word in the text.☆22Updated 3 years ago
- Convert Wiktionary entries to various formats such as StarDict or DB (MariaDB/MySQL). This used to be the main repository for this projec…☆15Updated 2 years ago
- losslessly convert images to pdf☆53Updated 4 years ago
- ☆297Updated last month
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆92Updated this week
- XDXF — an open and free dictionary format, that stores word articles in a structural and semantic way. The most convertible format☆225Updated 4 months ago
- Latin language dictionaries☆34Updated 3 years ago
- ScanTailor Universal - a fork based on Enhanced+Featured+Master versions of ST☆181Updated 2 weeks ago
- Charset converter tool and library☆130Updated this week
- hand-written dictionaries from the FreeDict project☆388Updated 10 months ago
- A small framebuffer pdf, djvu, epub, xps, and cbz viewer☆188Updated 2 years ago
- creates ZIM files for Kiwix from arbitrary websites with wget and some nifty tricks (doesn't need ServiceWorkers)☆68Updated 8 months ago
- gdcv - GoldenDict console version and emacs dynamic module☆29Updated last year
- PDF to DjVu converter☆92Updated 8 months ago
- A post-processing tool for scanned sheets of paper.☆69Updated 6 months ago
- Library for manipulating StarDict dictionaries from within Python☆104Updated last year
- Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code☆43Updated last year
- Data store for Aard 2☆238Updated 5 months ago
- Tools, documentation, and libraries related to Kobo dictionaries.☆56Updated 2 years ago
- Library for extracting text and timestamps from multiple subtitle files (.ass, .ssa, .srt, .sub, .txt).☆54Updated 6 months ago
- HTML to Markdown converter☆201Updated 7 months ago
- C library for handling Kindle (MOBI) formats of ebook documents☆419Updated 2 months ago
- etm: event and task manager☆44Updated 2 weeks ago
- Client/server software, human language dictionary databases, and tools supporting the DICT protocol (RFC 2229)☆68Updated 4 months ago