kevinxiong / epub2txt
convert epub file to txt
☆83Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for epub2txt
- python based software to unpack kindlegen generated ebooks☆61Updated last year
- Offline bilingual dictionaries made using data from Wiktionary☆52Updated 9 years ago
- Library for extracting text and timestamps from multiple subtitle files (.ass, .ssa, .srt, .sub, .txt).☆55Updated 8 months ago
- List of English synonyms and antonyms parsed from the public domain book of James C. Fernald, 1896☆43Updated 5 years ago
- A simple command-line utility for Linux, for extracting text from EPUB documents.☆193Updated last month
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆94Updated this week
- Converts between traditional and simplified Chinese☆30Updated 2 months ago
- Multilingual sentence alignment using sentence embeddings☆101Updated 2 weeks ago
- Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆82Updated this week
- Library for manipulating StarDict dictionaries from within Python☆104Updated last year
- Tools for extracting data from Apple dictionary files (used by the Dictionary application on Mac).☆113Updated last year
- Python module that identifies Chinese text as being Simplified or Traditional☆86Updated this week
- Break long English Sentence into simple sentences☆12Updated last year
- A Python package for learning, evaluating, annotating, and extracting vector representations of construction grammars☆34Updated last month
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆72Updated 9 years ago
- A database of number names for 186 languages, locales, and scripts☆66Updated last year
- pygoogletranslation: Free and Unlimited Google translate API for Python. Translates totally free of charge.☆158Updated 3 years ago
- Identifying complex sentences (with more than 2 clauses), detecting clause breakpoints and coverting them to simpler sentences.☆16Updated 4 years ago
- words frequency top100k from BNC/ANC/COCA, dsl format, for goldendict☆62Updated 7 years ago
- Machine Translation Web Interface for OpenNMT-py☆25Updated 2 years ago
- python module reading the StarDict dictionaries☆44Updated last year
- Creates interlinearized versions of books (EPUB, MOBI, etc), adding "subtitles" with translations under each word in the text.☆22Updated 4 years ago
- Export UNIHAN's database to csv, json or yaml☆52Updated this week
- Tool to fix bitexts and tag near-duplicates for removal☆29Updated 3 months ago
- ☆67Updated 3 months ago
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa models for Japanese and other languages☆48Updated last month
- máobĭ (毛笔) is an Anki add-on to create cards with writing quizzes for Hanzi (Chinese characters)☆51Updated 3 weeks ago
- 为epub电子书添加词频标记和注释(词典释义)☆15Updated 6 years ago
- Language model powered proof reader for correcting contextual errors in natural language.☆24Updated last year
- ☆91Updated last week