kevinxiong / epub2txtLinks
convert epub file to txt
☆94Updated 5 years ago
Alternatives and similar repositories for epub2txt
Users that are interested in epub2txt are comparing it to the libraries listed below
Sorting:
- Library for extracting text and timestamps from multiple subtitle files (.ass, .ssa, .srt, .sub, .txt).☆53Updated last year
- python based software to unpack kindlegen generated ebooks☆72Updated 3 months ago
- download youtube subtitles(closed caption, cc) as txt or json, support translation and proxy. available on PIP 🐍 . try it online at goo…☆72Updated 2 years ago
- List of English synonyms and antonyms parsed from the public domain book of James C. Fernald, 1896☆43Updated 7 years ago
- Remove duplicate documents/videos/images via popular algorithms such as SimHash, SpotSig, Shingling, etc.☆19Updated 2 years ago
- A simple command-line utility for Linux, for extracting text from EPUB documents.☆247Updated last month
- extract data from html table☆88Updated 5 years ago
- Bilingual sengence aligner☆28Updated last month
- Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆94Updated last week
- The New York Times English-Chinese parallel corpus☆17Updated 4 years ago
- Multilingual sentence alignment using sentence embeddings☆135Updated last year
- pygoogletranslation: Free and Unlimited Google translate API for Python. Translates totally free of charge.☆160Updated 4 years ago
- Extract and align grammar patterns from English sentences.☆56Updated 3 years ago
- Python module that identifies Chinese text as being Simplified or Traditional☆105Updated last year
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and other languages☆54Updated 4 months ago
- This Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.☆126Updated last year
- a utility to extract the title from a PDF file☆143Updated 10 months ago
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of …☆62Updated 5 years ago
- PyMultiDictionary is a dictionary module that gets meanings, translations, synonyms, and antonyms of words in 20 different languages☆55Updated 7 months ago
- Multi-class text categorization using state-of-the-art pre-trained contextualized language models, e.g. BERT☆23Updated 2 years ago
- Python library for downloading closed captions(subtitles) from Youtube☆61Updated 2 years ago
- A simple website demonstrating TextRank's extractive summarization capability.☆55Updated 4 years ago
- free google translation api(免费google翻译api)☆26Updated 6 years ago
- Chinese Characters Visualization & Chinese Text Augmentation.☆16Updated 3 years ago
- 💡✏️️ ⬇️️ JSON to Markdown converter - Generate Markdown from format independent JSON☆78Updated 6 years ago
- Text pattern search using marisa-trie☆18Updated 11 months ago
- A python module for word inflections designed for use with spaCy.☆93Updated 5 years ago
- Wrapper for pdftohtml that tries to extract paragraph structure☆52Updated 7 years ago
- Measure the readability of a given text using surface characteristics☆81Updated 11 months ago
- Offline bilingual dictionaries made using data from Wiktionary☆62Updated 10 years ago