kevinboone / epub2txt2
A simple command-line utility for Linux, for extracting text from EPUB documents.
☆193Updated last month
Related projects ⓘ
Alternatives and complementary repositories for epub2txt2
- A simple utility to extract text from EPUB documents and, optionally, format it☆48Updated 4 years ago
- convert epub file to txt☆83Updated 4 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆94Updated this week
- Offline bilingual dictionaries made using data from Wiktionary☆52Updated 9 years ago
- 🏆 • 5050 most frequent words in 109 languages☆35Updated last year
- k2pdfopt library for koreader, based on http://willus.com/k2pdfopt☆93Updated last week
- Convert Wiktionary entries to various formats such as StarDict or DB (MariaDB/MySQL). This used to be the main repository for this projec…☆15Updated 2 years ago
- IPA Pronunciation Dictionaries in DSL format☆39Updated 7 years ago
- Scripts to auto-OCR PDFs, translate output using publicly-available or DIY NLP translation models, and generate epub/PDF☆43Updated 6 months ago
- Command-line interface for Goldendict dictionaries☆45Updated 9 years ago
- Monolingual wordlists with pronunciation information in IPA☆558Updated last year
- XDXF — an open and free dictionary format, that stores word articles in a structural and semantic way. The most convertible format☆227Updated 6 months ago
- Base framework offering a Lua scriptable environment for creating document readers☆138Updated this week
- The Language Learning Toolkit (LLTK) performs a variety of tasks useful for (human) language learning.☆41Updated 5 years ago
- Scraper for downloading the entire ebooks repository of project Gutenberg☆135Updated 2 weeks ago
- 📦 A collection of files for LibriVox recordings to produce ebooks with synchronized text and audio☆24Updated 4 years ago
- Creates interlinearized versions of books (EPUB, MOBI, etc), adding "subtitles" with translations under each word in the text.☆22Updated 4 years ago
- 📈 A forced aligner intended for synchronization of narrated text☆85Updated last year
- Convert epub file to txt☆24Updated last year
- Mediawiki scraper: all your wiki articles in one highly compressed ZIM file☆292Updated this week
- Data store for Aard 2☆241Updated 7 months ago
- ScanTailor Universal - a fork based on Enhanced+Featured+Master versions of ST☆187Updated 2 months ago
- Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project☆44Updated 3 weeks ago
- British English pronunciation dictionary☆89Updated 7 years ago
- PDF to DjVu converter☆94Updated 10 months ago
- Data for the International Phonetic Alphabet (IPA)☆26Updated last year
- This is the KOReader CREngine fork. It cross-pollinates with the official CoolReader repository at https://github.com/buggins/coolreader,…☆72Updated last week
- Library for manipulating StarDict dictionaries from within Python☆104Updated last year
- Graduated Interval Recall program☆20Updated 4 months ago
- Chinese font displaying Hanzi (汉字) characters with by transliteration/pronunciation (Pīnyīn).☆139Updated 7 years ago