kevinxiong / epub2txt
convert epub file to txt
☆85Updated 4 years ago
Alternatives and similar repositories for epub2txt:
Users that are interested in epub2txt are comparing it to the libraries listed below
- python based software to unpack kindlegen generated ebooks☆61Updated 2 years ago
- A simple command-line utility for Linux, for extracting text from EPUB documents.☆208Updated 2 months ago
- Converts between traditional and simplified Chinese☆30Updated 5 months ago
- Multilingual sentence alignment using sentence embeddings☆108Updated 3 months ago
- Offline bilingual dictionaries made using data from Wiktionary☆52Updated 9 years ago
- List of English synonyms and antonyms parsed from the public domain book of James C. Fernald, 1896☆43Updated 6 years ago
- 中文古诗词语料库☆22Updated 8 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆238Updated 2 years ago
- Scrape glosbe dicts☆9Updated 2 years ago
- colordict词典库☆84Updated 10 years ago
- Multi-class text categorization using state-of-the-art pre-trained contextualized language models, e.g. BERT☆21Updated last year
- Extract and align grammar patterns from English sentences.☆54Updated 2 years ago
- pygoogletranslation: Free and Unlimited Google translate API for Python. Translates totally free of charge.☆158Updated 3 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆97Updated this week
- Bitextor generates translation memories from multilingual websites☆293Updated 3 months ago
- Python module that identifies Chinese text as being Simplified or Traditional☆89Updated 3 months ago
- A sentence segmentation library with wide language support optimized for speed and utility.☆57Updated 5 months ago
- ☆15Updated last year
- download youtube subtitles(closed caption, cc) as txt or json, support translation and proxy. available on PIP 🐍 . try it online at goo…☆70Updated last year
- 古汉语(文言文)字典-爬取文言文字典网,制作Kindle字典.☆65Updated 6 years ago
- A python module for word inflections designed for use with spaCy.☆92Updated 5 years ago
- Seed Machine Translation Data☆30Updated 3 months ago
- The New York Times English-Chinese parallel corpus☆16Updated 3 years ago
- Sentence aligner☆109Updated 3 years ago
- Measure the readability of a given text using surface characteristics☆77Updated 3 weeks ago
- Convert epub file to txt☆31Updated last year
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆89Updated this week
- Fifteen Thousand Useful Phrases, by Greenville Kleiser☆54Updated 8 years ago
- Faster, modernized fork of the language identification tool langid.py☆53Updated 2 months ago