kevinxiong / epub2txtLinks
convert epub file to txt
☆88Updated 5 years ago
Alternatives and similar repositories for epub2txt
Users that are interested in epub2txt are comparing it to the libraries listed below
Sorting:
- Library for extracting text and timestamps from multiple subtitle files (.ass, .ssa, .srt, .sub, .txt).☆52Updated last year
- A simple command-line utility for Linux, for extracting text from EPUB documents.☆230Updated 3 weeks ago
- A tiny script to convert your mdx dictionary file to CSV☆11Updated 6 years ago
- List of English synonyms and antonyms parsed from the public domain book of James C. Fernald, 1896☆43Updated 6 years ago
- The New York Times English-Chinese parallel corpus☆16Updated 3 years ago
- python based software to unpack kindlegen generated ebooks☆64Updated 2 years ago
- Offline bilingual dictionaries made using data from Wiktionary☆55Updated 10 years ago
- 中文古诗词语料库☆27Updated 8 years ago
- Extract and align grammar patterns from English sentences.☆55Updated 2 years ago
- 古汉语(文言文)字典-爬取文言文字典网,制作Kindle字典.☆66Updated 7 years ago
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆66Updated 2 weeks ago
- Convert epub file to txt☆35Updated 2 years ago
- Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆91Updated this week
- classic Chinese punctuate experiment with keras using daizhige(殆知阁古代文献藏书) dataset☆35Updated 2 years ago
- download youtube subtitles(closed caption, cc) as txt or json, support translation and proxy. available on PIP 🐍 . try it online at goo…☆70Updated last year
- Multilingual sentence alignment using sentence embeddings☆120Updated 7 months ago
- Collaborative on-line editor for aligned parallel texts.☆13Updated 3 years ago
- ☆16Updated last year
- An open-access corpus of conversational bilingual speech in Cantonese and English☆40Updated 3 years ago
- Python module that identifies Chinese text as being Simplified or Traditional☆95Updated 7 months ago
- worddict crawler and transfer for sougpuinput wordict , 搜狗输入法词库抓取与格式转换☆25Updated 7 years ago
- bilingual dictionary extractor from parallel corpora☆22Updated 10 years ago
- The zhong [|] Chinese grammars☆14Updated last month
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and other languages☆51Updated 2 months ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆248Updated 2 years ago
- colordict词典库☆87Updated 11 years ago
- Faster, modernized fork of the language identification tool langid.py☆56Updated 7 months ago
- Hanzipy is a Chinese character and NLP module for Chinese language processing for python. It is primarily written to help provide a frame…☆21Updated last year
- Translation demonstrator☆34Updated 5 years ago
- Text pattern search using marisa-trie☆18Updated 4 months ago