kevinxiong / epub2txt
convert epub file to txt
☆87Updated 5 years ago
Alternatives and similar repositories for epub2txt:
Users that are interested in epub2txt are comparing it to the libraries listed below
- python based software to unpack kindlegen generated ebooks☆62Updated 2 years ago
- Library for extracting text and timestamps from multiple subtitle files (.ass, .ssa, .srt, .sub, .txt).☆52Updated last year
- List of English synonyms and antonyms parsed from the public domain book of James C. Fernald, 1896☆43Updated 6 years ago
- 中文古诗词语料库☆27Updated 8 years ago
- A simple command-line utility for Linux, for extracting text from EPUB documents.☆225Updated 2 months ago
- Offline bilingual dictionaries made using data from Wiktionary☆54Updated 10 years ago
- The New York Times English-Chinese parallel corpus☆16Updated 3 years ago
- Utility scripts or libraries for various Natural Language Processing tasks.☆39Updated 3 years ago
- EpubSplit Calibre Plugin☆101Updated 4 months ago
- Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆90Updated this week
- Scripts to auto-OCR PDFs, translate output using publicly-available or DIY NLP translation models, and generate epub/PDF☆43Updated 11 months ago
- words frequency top100k from BNC/ANC/COCA, dsl format, for goldendict☆60Updated 8 years ago
- 为epub电子书添加词频标记和注释(词典释义)☆15Updated 6 years ago
- download youtube subtitles(closed caption, cc) as txt or json, support translation and proxy. available on PIP 🐍 . try it online at goo…☆70Updated last year
- Creates interlinearized versions of books (EPUB, MOBI, etc), adding "subtitles" with translations under each word in the text.☆24Updated 4 years ago
- A python module for word inflections designed for use with spaCy.☆92Updated 5 years ago
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and other languages☆50Updated 3 weeks ago
- Multi-class text categorization using state-of-the-art pre-trained contextualized language models, e.g. BERT☆21Updated 2 years ago
- pygoogletranslation: Free and Unlimited Google translate API for Python. Translates totally free of charge.☆159Updated 4 years ago
- python module reading the StarDict dictionaries☆45Updated last year
- Extract and align grammar patterns from English sentences.☆54Updated 2 years ago
- The multilingual variant of GLM, a general language model trained with autoregressive blank infilling objective☆62Updated 2 years ago
- 汉字五笔转换工具☆33Updated 6 years ago
- Python module that identifies Chinese text as being Simplified or Traditional☆91Updated 5 months ago
- A simple website demonstrating TextRank's extractive summarization capability.☆55Updated 4 years ago
- Multilingual sentence alignment using sentence embeddings☆116Updated 6 months ago
- Offline etymological dictionary based on Wiktionary data☆21Updated 3 years ago
- Almost automatically collect dictionary data from https://kotobank.jp/dictionary/.☆18Updated 6 years ago
- 搜狗细胞词库到普通文本的转换提取工具。提取词汇表,用于深度学习做数据生成 和字典特征☆23Updated 6 years ago
- Translation demonstrator☆33Updated 4 years ago