kevinxiong / epub2txtLinks
convert epub file to txt
☆94Updated 5 years ago
Alternatives and similar repositories for epub2txt
Users that are interested in epub2txt are comparing it to the libraries listed below
Sorting:
- python based software to unpack kindlegen generated ebooks☆74Updated 4 months ago
- Library for extracting text and timestamps from multiple subtitle files (.ass, .ssa, .srt, .sub, .txt).☆53Updated last year
- download youtube subtitles(closed caption, cc) as txt or json, support translation and proxy. available on PIP 🐍 . try it online at goo…☆72Updated 2 years ago
- The New York Times English-Chinese parallel corpus☆17Updated 4 years ago
- List of English synonyms and antonyms parsed from the public domain book of James C. Fernald, 1896☆43Updated 7 years ago
- pygoogletranslation: Free and Unlimited Google translate API for Python. Translates totally free of charge.☆160Updated 4 years ago
- A simple command-line utility for Linux, for extracting text from EPUB documents.☆249Updated 2 months ago
- PyMultiDictionary is a dictionary module that gets meanings, translations, synonyms, and antonyms of words in 20 different languages☆56Updated 7 months ago
- This is a python code based on Scrapy package to crawl famous online dictionaries like Oxford, Longman, Cambridge, Webster, and Collins t…☆109Updated 2 years ago
- Python module that identifies Chinese text as being Simplified or Traditional☆105Updated last year
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆256Updated 3 years ago
- Multilingual sentence alignment using sentence embeddings☆139Updated last year
- Chinese Characters Visualization & Chinese Text Augmentation.☆16Updated 3 years ago
- Python parser for SubRip (srt) files☆484Updated 2 years ago
- Offline bilingual dictionaries made using data from Wiktionary☆62Updated 10 years ago
- 洛克生词本☆26Updated 5 years ago
- ☆81Updated 2 weeks ago
- This Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.☆127Updated last year
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and other languages☆54Updated last week
- Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆94Updated last week
- Scripts to auto-OCR PDFs, translate output using publicly-available or DIY NLP translation models, and generate epub/PDF☆44Updated last year
- A python module for word inflections designed for use with spaCy.☆93Updated 5 years ago
- Remove duplicate documents/videos/images via popular algorithms such as SimHash, SpotSig, Shingling, etc.☆19Updated 2 years ago
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of …☆62Updated 5 years ago
- Generate subtitle files with timelines in an automatic way.☆62Updated 3 years ago
- Extract and align grammar patterns from English sentences.☆56Updated 3 years ago
- A tiny script to convert your mdx dictionary file to CSV☆11Updated 7 years ago
- 汉字数据集,包括汉字的相关信息,例如笔画数、部首、拼音、英文释义/同义词等。☆126Updated 5 years ago
- ☆23Updated 2 years ago
- 物种名称语料库。植物名,动物名。☆51Updated last year