timminator / PaddleOCR-StandaloneLinks
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices). Now as a standalone executable!
☆24Updated this week
Alternatives and similar repositories for PaddleOCR-Standalone
Users that are interested in PaddleOCR-Standalone are comparing it to the libraries listed below
Sorting:
- Convert captured images to text using BaiduOCR, GoogleOCR, WindowsOCR, tesseractOCR, RapidOCR or Capture2Text, and translate the resultin…☆77Updated 10 months ago
- Enhancing Translation with RAG-Powered Large Language Models☆81Updated 2 weeks ago
- Polyglot is a fast, elegant, and free translation tool using AI.☆62Updated last year
- A bot that checks your grammar and phrasing using LLM of choice☆31Updated 6 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆259Updated 5 months ago
- Easy to use interface for the Whisper model optimized for all GPUs!☆281Updated last month
- AI Powered search tool offers content-based, text, and visual similarity system-wide search.☆263Updated 3 months ago
- Create text chunks which end at natural stopping points without using a tokenizer☆26Updated 5 months ago
- GUI for whispercpp, a high performance C++ port of OpenAI's whisper☆84Updated 5 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆57Updated 9 months ago
- Kosmos-2.5 is a cutting-edge Multimodal-LLM (MLLM) specializing in image OCR. However, its stringent software requirements & Python-scrip…☆61Updated last year
- ☆22Updated 7 months ago
- Context-aware LLM Translator (CALT)☆39Updated 7 months ago
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆112Updated 5 months ago
- ez audio transcription tool with flexible processing and post-processing options☆158Updated last year
- Local11Labs allows generating high-quality text-to-speech and podcast content using the fast and tiny Kokoro-82M.☆49Updated 7 months ago
- EPUB, PDF, DOCX, MD, and TXT file text to speech document reader. Read documents in realtime with high-quality TTS; or extract audiobooks…☆189Updated last month
- A minimal Android demo app for Kokoro-TTS☆29Updated 6 months ago
- OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR☆144Updated 2 weeks ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆56Updated last year
- Object Detection Model for Scanned Documents☆95Updated 5 months ago
- ☆48Updated 5 months ago
- A simple no-install web UI for Ollama and OAI-Compatible APIs!☆31Updated 7 months ago
- web based editor for subtitles and transcripts☆140Updated last year
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆241Updated 7 months ago
- AI management tool☆119Updated 9 months ago
- KoboldCpp Smart Launcher with GPU Layer and Tensor Override Tuning☆27Updated 3 months ago
- Codebase for fine-tuning / evaluating nougat-based image2latex generation models☆157Updated 11 months ago
- My personal fork of koboldcpp where I hack in experimental samplers.☆47Updated last year
- A python program that turns an LLM, running on Ollama, into an automated researcher, which will with a single query determine focus areas…☆22Updated 6 months ago