jenh / epub-ocr-and-translateLinks
Scripts to auto-OCR PDFs, translate output using publicly-available or DIY NLP translation models, and generate epub/PDF
☆43Updated last year
Alternatives and similar repositories for epub-ocr-and-translate
Users that are interested in epub-ocr-and-translate are comparing it to the libraries listed below
Sorting:
- EpubSplit Calibre Plugin☆105Updated last week
- Building scantailor and its dependencies☆58Updated last year
- Power Search: A full-text search plugin for Calibre☆38Updated last year
- ReadablePDF streamlines the effort of turning a not so great PDF into a more easily readable PDF (or of course a pretty decent PDF into a…☆33Updated 3 years ago
- Recoll Full Text Search Plugin for Calibre☆25Updated 4 years ago
- ☆27Updated 2 years ago
- A tiny script to convert your mdx dictionary file to CSV☆11Updated 6 years ago
- Clip web page content to Obsidian as Markdown☆13Updated 2 years ago
- Extracts per-sentence subtitles + audio from a subtitle file + video file.☆11Updated 5 years ago
- Scraper for downloading the entire ebooks repository of project Gutenberg☆149Updated 3 weeks ago
- Server backend and CLI toolkit for WebScrapBook browser extension.☆86Updated 2 weeks ago
- A mini Anki web server based on Flask, works with anki-sync-server.☆36Updated 2 years ago
- Latin language dictionaries☆36Updated 4 years ago
- Automatic de-keystoning for single camera DIY book scanners☆22Updated 9 years ago
- Hypertext-infused personal research productivity/database software (Mac/Win/Linux)☆153Updated this week
- Batch processing helper – GUI – for “ScanTailor-CLI” -- created by Csaba Kovacs☆15Updated 8 years ago
- Translate HTML using Argos Translate☆50Updated last year
- PDF to DjVu converter☆100Updated last year
- Library Genesis (libgen) db dumps mirror on ipfs☆48Updated 11 months ago
- OCR for DjVu☆48Updated 2 years ago
- [CLI] count the words in an epub file☆33Updated 4 months ago
- Ergonomic line-by-line transcription of scanned text.☆51Updated 4 years ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆188Updated 2 weeks ago
- Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page…☆40Updated 8 months ago
- 🍣 A set of tools to enhance GoldenDict.☆43Updated 3 months ago
- Structured data for classical studies☆19Updated 8 years ago
- Thoughts Memo 小站☆17Updated 2 years ago
- Open source projects for the Boox ebook reader.☆81Updated 11 years ago
- Audio Book scrapper☆26Updated last year
- TagSpaces Web Clipper for Chrome and Firefox☆44Updated last month