straussmaximilian / ocrmacLinks
A python wrapper to extract text from images on a mac system. Uses the vision framework from Apple.
☆417Updated 5 months ago
Alternatives and similar repositories for ocrmac
Users that are interested in ocrmac are comparing it to the libraries listed below
Sorting:
- A simple HTML content extractor in Python. Can be run as a wrapper for Mozilla's Readability.js package or in pure-python mode.☆339Updated 8 months ago
- Easy-to-Use Apple Vision wrapper for text extraction, scalar representation and clustering using K-means.☆112Updated last year
- A command-line application to convert images, PDFs, and audio files to text using Apple's APIs☆740Updated 2 years ago
- ⚡️ 80x faster Fasttext language detection out of the box | Split text by language☆228Updated 5 months ago
- A powerful command-line OCR tool built with Apple's Vision framework, supporting single image and batch processing with detailed position…☆134Updated 6 months ago
- An implementation of the Nvidia's Parakeet models for Apple Silicon using MLX.☆436Updated this week
- Python bindings to PDFium, reasonably cross-platform.☆617Updated last week
- An extremely fast implementation of whisper optimized for Apple Silicon using MLX.☆769Updated last year
- Extract structured text from pdfs quickly☆585Updated 2 months ago
- The Python <-> Objective-C Bridge with bindings for macOS frameworks☆699Updated 2 weeks ago
- 🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with 🧠 LLM.☆583Updated 2 months ago
- Command line interface for the built-in speech recognition and transcription capabilities in macOS.☆515Updated 3 months ago
- Lightweight, performant, deep table extraction☆503Updated 3 weeks ago
- Python tools for WhisperKit: Model conversion, optimization and evaluation☆225Updated 3 weeks ago
- System prompts from Apple's new Apple Intelligence on MacOS Sequoia☆197Updated 7 months ago
- 🗣️ A CLI for on-device speech transcription using Speech.framework on macOS 26☆1,085Updated 2 months ago
- HTML to Markdown converter and crawler.☆587Updated last year
- A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one pac…☆292Updated 3 months ago
- Dictation app based on the OpenAI speech-to-text models☆199Updated last year
- Use Ollama to talk to local LLMs in Apple Notes☆689Updated 3 weeks ago
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆213Updated 10 months ago
- Convert HTML to Markdown☆1,756Updated 2 weeks ago
- macOS OCR CLI for https://developer.apple.com/documentation/vision/vnrecognizetextrequest☆46Updated last year
- 🩹Pure CDP but type-safe in Python☆147Updated 2 weeks ago
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.☆193Updated last week
- MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. I…☆531Updated last week
- Simple MacOS StatusBar / Menu Bar app to automatically detect text in screenshots☆195Updated last year
- FastMLX is a high performance production ready API to host MLX models.☆325Updated 5 months ago
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆1,598Updated last week
- LM Studio Apple MLX engine☆752Updated last week