timminator / PaddleOCR-StandaloneLinks
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices). Now as a standalone executable!
☆31Updated last month
Alternatives and similar repositories for PaddleOCR-Standalone
Users that are interested in PaddleOCR-Standalone are comparing it to the libraries listed below
Sorting:
- Enhancing Translation with RAG-Powered Large Language Models☆83Updated 2 weeks ago
- Batch speech to text using OpenAI's whisper.☆299Updated 6 months ago
- Synchronize SRT timestamps over an existing accurate transcription☆35Updated 11 months ago
- Convert captured images to text using BaiduOCR, GoogleOCR, WindowsOCR, tesseractOCR, RapidOCR or Capture2Text, and translate the resultin…☆81Updated last year
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆259Updated 7 months ago
- ez audio transcription tool with flexible processing and post-processing options☆159Updated last year
- A simple no-install web UI for Ollama and OAI-Compatible APIs!☆31Updated 8 months ago
- Privacy-first agentic framework with powerful reasoning & task automation capabilities. Natively distributed and fully ISO 27XXX complian…☆66Updated 6 months ago
- AI Powered search tool offers content-based, text, and visual similarity system-wide search.☆266Updated 4 months ago
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆126Updated 6 months ago
- Context-aware LLM Translator (CALT)☆43Updated 9 months ago
- Create text chunks which end at natural stopping points without using a tokenizer☆25Updated 6 months ago
- Easy to use interface for the Whisper model optimized for all GPUs!☆320Updated 2 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆57Updated 10 months ago
- Meta's "No Language Left Behind" models served as web app and REST API☆240Updated 4 months ago
- EPUB, PDF, DOCX, MD, and TXT file text to speech document reader. Read documents in realtime with high-quality TTS; or extract audiobooks…☆203Updated last month
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆152Updated 3 weeks ago
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆126Updated this week
- Chatbot-to-speech using Orpheus TTS model. Interactive console app.☆20Updated 5 months ago
- A bot that checks your grammar and phrasing using LLM of choice☆32Updated 8 months ago
- Generate high quality Japanese audio for your Anki cards using the VOICEVOX speech synthesis software☆35Updated 2 weeks ago
- ☆42Updated 8 months ago
- My personal fork of koboldcpp where I hack in experimental samplers.☆46Updated last year
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆57Updated last year
- Autonomous, agentic, creative story writing system that incorporates stored embeddings and Knowledge Graphs.☆79Updated last week
- 🎦 Extract video hard subtitles and automatically generate corresponding srt files.☆446Updated last month
- GUI for whispercpp, a high performance C++ port of OpenAI's whisper☆87Updated 6 months ago
- A natural language file search tool that uses LLMs to help you find files by describing what you're looking for.☆28Updated 7 months ago
- Kosmos-2.5 is a cutting-edge Multimodal-LLM (MLLM) specializing in image OCR. However, its stringent software requirements & Python-scrip…☆63Updated last year
- Simple TTS using MS Edge built-in voices☆28Updated 3 years ago