timminator / PaddleOCR-StandaloneLinks
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices). Now as a standalone executable!
☆45Updated 4 months ago
Alternatives and similar repositories for PaddleOCR-Standalone
Users that are interested in PaddleOCR-Standalone are comparing it to the libraries listed below
Sorting:
- Batch speech to text using OpenAI's whisper.☆304Updated 9 months ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆162Updated this week
- GUI for whispercpp, a high performance C++ port of OpenAI's whisper☆96Updated 9 months ago
- An easy-to-use GUI addon for whisper-standalone-win. Designed for those who prefer a simple interface over typing commands and file paths…☆13Updated 2 years ago
- Meta's "No Language Left Behind" models served as web app and REST API☆250Updated 7 months ago
- Library to use Google Lens OCR for free, via API used in Chromium on python.☆45Updated 3 months ago
- A Screen Translator/OCR Translator made by using Python and Tesseract, the user interface are made using Tkinter. All code written in pyt…☆137Updated last year
- Efficient translation tool based on ChatGPT or any OpenAI compatible LLM chat completion API☆359Updated 3 weeks ago
- Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT,Claude等)来转录、翻译你的 音频为字幕文件。☆618Updated 4 months ago
- Easy to use interface for the Whisper model optimized for all GPUs!☆405Updated 4 months ago
- 🎦 Extract video hard subtitles and automatically generate corresponding srt files.☆468Updated 3 months ago
- ☆44Updated 10 months ago
- Context-aware LLM Translator (CALT)☆45Updated 11 months ago
- Convert captured images to text using BaiduOCR, GoogleOCR, WindowsOCR, tesseractOCR, RapidOCR or Capture2Text, and translate the resultin…☆85Updated last year
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆132Updated this week
- A Python package for unlimited DeepL translation☆205Updated 9 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆267Updated 9 months ago
- Manga&Comic text detection☆305Updated 2 years ago
- A UI for the Piper TTS