bytefer / macos-vision-ocr
A powerful command-line OCR tool built with Apple's Vision framework, supporting single image and batch processing with detailed positional information output.
β58Updated last week
Alternatives and similar repositories for macos-vision-ocr:
Users that are interested in macos-vision-ocr are comparing it to the libraries listed below
- Implementing OCR with a local visual model run by ollama.β236Updated 2 months ago
- π·οΈ DiscovAI Crawl API(π§ Work in Progress π§): A powerful web scraping solution for AI tools and vector databases. Extract clean HTML, gβ¦β18Updated 6 months ago
- Generate ideal question-answers for testing RAGβ126Updated 2 weeks ago
- Open-source unstructured data (PDFs, Images, Audiofiles) processing platform built for knowledge workersβ253Updated 2 weeks ago
- High accurate text detection (OCR) Javascript/Typescript library that runs on Node.js, Browser, React Native and C++. Based on PaddleOCR β¦β61Updated this week
- MagicPush is the open-source push notification service for developersβ219Updated 6 months ago
- Flowchart-like UI to interconnect LLM's and Huggingface models, and deploy them as a REST API with little to no code.β63Updated this week
- A smrt, no, smart, ok, no dumb smartbar for Ollamaβ52Updated last year
- RAG Logger is an open-source logging tool designed specifically for Retrieval-Augmented Generation (RAG) applications. It serves as a ligβ¦β217Updated last month
- Use cloudflare worker and rust wasm to build an image processing service. δ½Ώη¨ Cloudflare Worker ε Rust WASM ζε»ΊεΎεε€ηζε‘β28Updated 4 months ago
- Chat with any website on your local machineβ72Updated 7 months ago
- β125Updated 3 months ago
- Parse PDFs into markdown using Vision LLMsβ273Updated 2 weeks ago
- Ollama desktop client for everyday useβ44Updated 4 months ago
- Lightweight Loom alternative to show your face on videosβ17Updated 4 months ago
- kokoro text to speech using javascriptβ52Updated 3 weeks ago
- Extremely memory-efficient vector databaseβ63Updated 5 months ago
- π¦ echoOLlama: A real-time voice AI platform powered by local LLMs. Features WebSocket streaming, voice interactions, and OpenAI API compβ¦β90Updated 3 months ago
- π₯ LitLytics - an affordable, simple analytics platform that leverages LLMs to automate data analysisβ95Updated 2 months ago
- chromeζδ»Ά + electron + node + react => ι»θΎζ΅η¨εΎη½ι‘΅θͺε¨εε·₯ε ·β36Updated last month
- Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, scalable (?), WIPβ345Updated this week
- Detect whether or not an audio file was generated by NotebookLMβ131Updated 2 months ago
- An extensive desktop app for ChatGPT and other LLMs.β175Updated last year
- Blazing fast whisper turbo for ASR (speech-to-text) tasksβ192Updated 4 months ago