kevinxiong / epub2txtLinks
convert epub file to txt
☆94Updated 5 years ago
Alternatives and similar repositories for epub2txt
Users that are interested in epub2txt are comparing it to the libraries listed below
Sorting:
- python based software to unpack kindlegen generated ebooks☆71Updated 2 months ago
- Library for extracting text and timestamps from multiple subtitle files (.ass, .ssa, .srt, .sub, .txt).☆53Updated last year
- List of English synonyms and antonyms parsed from the public domain book of James C. Fernald, 1896☆43Updated 7 years ago
- The New York Times English-Chinese parallel corpus☆17Updated 3 years ago
- Python module that identifies Chinese text as being Simplified or Traditional☆104Updated last year
- Offline bilingual dictionaries made using data from Wiktionary☆61Updated 10 years ago
- PyMultiDictionary is a dictionary module that gets meanings, translations, synonyms, and antonyms of words in 20 different languages☆54Updated 5 months ago
- download youtube subtitles(closed caption, cc) as txt or json, support translation and proxy. available on PIP 🐍 . try it online at goo…☆72Updated 2 years ago
- Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆94Updated this week
- This Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.☆125Updated last year
- 中文古诗词语料库☆27Updated 9 years ago
- 🔥 专注于中文的「自然语言处理框架」:中文分词;平衡类别;数据集划分...☆12Updated 5 years ago
- A natural language date parser. (Python version of chrono.js)☆25Updated 5 months ago
- máobĭ (毛笔) is an Anki add-on to create cards with writing quizzes for Hanzi (Chinese characters)☆58Updated last year
- pygoogletranslation: Free and Unlimited Google translate API for Python. Translates totally free of charge.☆160Updated 4 years ago
- Extract and align grammar patterns from English sentences.☆56Updated 2 years ago
- free google translation api(免费google翻译api)☆26Updated 6 years ago
- Multilingual sentence alignment using sentence embeddings☆130Updated last year
- Hanzi Converter for Traditional and Simplified Chinese☆190Updated 5 years ago
- Extract dates from text☆65Updated 4 years ago
- Extract templated Open Information Extraction☆17Updated 8 years ago
- Multi-class text categorization using state-of-the-art pre-trained contextualized language models, e.g. BERT☆23Updated 2 years ago
- 物种名称语料库。植物名,动物名。☆51Updated last year
- 🦜 NLP for Tibetan, in Python.☆37Updated 2 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆56Updated 4 years ago
- Python wrapper for LanguageTool grammar checker☆329Updated 4 years ago
- maximum entropy based part-of-speech tagger for NLTK☆45Updated 8 years ago
- OKR: A Consolidated Open Knowledge Representation for Multiple Texts☆41Updated 7 years ago
- Prior Knowledge Integration for Neural Machine Translation using Posterior Regularization☆11Updated 6 years ago
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and other languages☆52Updated 2 months ago