jenh / epub-ocr-and-translate
Scripts to auto-OCR PDFs, translate output using publicly-available or DIY NLP translation models, and generate epub/PDF
☆43Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for epub-ocr-and-translate
- Unofficial Anna's Archive API written in JS.☆32Updated last year
- Translate HTML using Argos Translate☆49Updated last year
- Convert epub file to txt☆24Updated last year
- A Bash script to search and download books using shadows libraries☆43Updated 3 months ago
- A Python library that provides an api to search and get links from Books,Magazines,Comics,... from Library Genesis.☆117Updated 2 years ago
- Download, convert and organize Gutenberg books for eBook Readers☆46Updated 5 years ago
- Scraper for downloading the entire ebooks repository of project Gutenberg☆135Updated 2 weeks ago
- Local cross-platform machine translation GUI, based on CTranslate2☆88Updated 10 months ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆33Updated last week
- Download borrowed books from archive.org☆51Updated last year
- 📈 A forced aligner intended for synchronization of narrated text☆85Updated last year
- Library Genesis (libgen) db dumps mirror on ipfs☆43Updated 4 months ago
- Readium Desktop is an SDK for ebooks, audiobooks and comics written in Typescript and using node.js and Electron.js.☆61Updated 2 years ago
- ReadablePDF streamlines the effort of turning a not so great PDF into a more easily readable PDF (or of course a pretty decent PDF into a…☆33Updated 3 years ago
- smoothscan is a tool to convert scanned text into a vectorized output form.☆67Updated 11 years ago
- ☆95Updated 8 years ago
- EpubSplit Calibre Plugin☆87Updated 3 months ago
- Building scantailor and its dependencies☆55Updated last year
- Want a new ZIM file? Propose ZIM content improvements or fixes? Here you are!☆42Updated last month
- 📦 A collection of files for LibriVox recordings to produce ebooks with synchronized text and audio☆24Updated 4 years ago
- web based editor for subtitles and transcripts☆112Updated 3 months ago
- Google Books Downloader / Image Scraper☆53Updated 5 years ago
- Offline bilingual dictionaries made using data from Wiktionary☆52Updated 9 years ago
- Download books from a 📚 Goodreads shelf using ⛵ Library Genesis.☆82Updated 4 years ago
- Customizable machine translation in C++☆43Updated 7 months ago
- k2pdfopt library for koreader, based on http://willus.com/k2pdfopt☆93Updated last week
- ez audio transcription tool with flexible processing and post-processing options☆130Updated 9 months ago
- A simple command-line utility for Linux, for extracting text from EPUB documents.☆193Updated last month
- Tero Subtitler is an open source, cross-platform, and free subtitle editing software.☆259Updated 3 months ago