kevinxiong / epub2txt
convert epub file to txt
☆85Updated 4 years ago
Alternatives and similar repositories for epub2txt:
Users that are interested in epub2txt are comparing it to the libraries listed below
- List of English synonyms and antonyms parsed from the public domain book of James C. Fernald, 1896☆43Updated 6 years ago
- python based software to unpack kindlegen generated ebooks☆62Updated 2 years ago
- Library for extracting text and timestamps from multiple subtitle files (.ass, .ssa, .srt, .sub, .txt).☆52Updated last year
- Convert epub file to txt☆31Updated last year
- 中文古诗词语料库☆26Updated 8 years ago
- A simple command-line utility for Linux, for extracting text from EPUB documents.☆221Updated last month
- Stand-alone WordNet API☆48Updated 3 years ago
- python bindings of cppjieba ,recommand jieba_fast for results consistency and speed balance☆21Updated 5 years ago
- Python module that identifies Chinese text as being Simplified or Traditional☆91Updated 4 months ago
- Summary of Responses to Questionnaire on Annotation Platform https://forms.gle/iZk8kehkjAWmB8xe9☆59Updated 4 years ago
- Faster, modernized fork of the language identification tool langid.py☆55Updated 4 months ago
- 汉字组件笔画数据☆14Updated 6 years ago
- Converts between traditional and simplified Chinese☆30Updated 7 months ago
- maximum entropy based part-of-speech tagger for NLTK☆45Updated 8 years ago
- An open-access corpus of conversational bilingual speech in Cantonese and English☆40Updated 2 years ago
- 《现代汉语大词典》字词头☆26Updated 4 years ago
- worddict crawler and transfer for sougpuinput wordict , 搜狗输入法词库抓取与格式转换☆25Updated 6 years ago
- Wrapper for pdftohtml that tries to extract paragraph structure☆50Updated 6 years ago
- pygoogletranslation: Free and Unlimited Google translate API for Python. Translates totally free of charge.☆159Updated 4 years ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆112Updated 2 months ago
- Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆89Updated last week
- Tower Parse: Low-Resource Dependency Parsing via Hierarchical Source Selection☆15Updated 3 years ago
- ☆35Updated 10 months ago
- The multilingual variant of GLM, a general language model trained with autoregressive blank infilling objective☆62Updated 2 years ago
- Creates interlinearized versions of books (EPUB, MOBI, etc), adding "subtitles" with translations under each word in the text.☆23Updated 4 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆97Updated this week
- pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation☆58Updated 7 months ago
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and other languages☆50Updated 2 weeks ago
- 漢語拼音轉換表☆39Updated 4 years ago
- 汉字五笔转换工具☆33Updated 6 years ago