A simple CPU only OCR for pdf/images/word/excel to markdown. With streamlit.
☆47Jan 26, 2026Updated 2 months ago
Alternatives and similar repositories for exaOCR
Users that are interested in exaOCR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- pdfLLM is a completely open source, proof of concept RAG app.☆186Sep 1, 2025Updated 7 months ago
- A simple streamlit app, dockerized, to do OCR on documents. I'm lazy, idk.☆26Aug 18, 2025Updated 7 months ago
- TreeThinkerAgent is a lightweight orchestration layer that turns any LLM into an autonomous multi-step reasoning agent. It supports multi…☆21Feb 11, 2026Updated 2 months ago
- One library to split them all: Sentence, Code, Docs. Chunk smarter, not harder — built for LLMs, RAG pipelines, and beyond.☆66Apr 8, 2026Updated last week
- Template CrewAI allowing for selection of multiple agents including GPT-3, GPT-4, Mixtral, Llama 3, and Gemma☆11May 11, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Multi-agent autonomous research system using LangGraph and LangChain. Generates citation-backed reports with credibility scoring and web …☆151Apr 4, 2026Updated last week
- Simile combines the power of AI embeddings with fuzzy string matching and keyword search to deliver highly relevant search results—all ru…☆30Dec 28, 2025Updated 3 months ago
- An easy-to-use library and command-line tool for TTS☆15May 3, 2025Updated 11 months ago
- Elixir library to generate Ecto migrations from a PostgreSQL schema SQL file. Uses NimbleParsec and macro-style code generation.☆18Dec 12, 2025Updated 4 months ago
- An MCP server for JupyterCAD that allows you to control it using LLMs/natural language.☆17Oct 7, 2025Updated 6 months ago
- js client for R2R: production-ready RAG engine with a sh*t ton of features.☆12Aug 30, 2024Updated last year
- ☆13Oct 20, 2025Updated 5 months ago
- A Python-native Terminal-Based Git Client - Navigate and manage your Git repositories with a beautiful TUI interface inspired by LazyGit.☆34Feb 7, 2026Updated 2 months ago
- Offline LLM chatbot with personalized memory — works on CPU with multi-session memory support.☆22Jan 10, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Paperless-ngx consume script that leverages Docling to provide superior OCR and layout analysis for PDFs, Office documents, and images.☆15Dec 7, 2025Updated 4 months ago
- Claude code slash commands creation for session management☆14Sep 13, 2025Updated 7 months ago
- A crate to list WiFi hotspots in your area. Fork of booyaa/wifiscanner.☆40Jan 6, 2026Updated 3 months ago
- Stop using static chunk sizes. A lightweight, production-ready RAG ingestion toolkit. Uses Docling for layout-aware parsing and applies s…☆66Mar 15, 2026Updated 3 weeks ago
- ☆27Jun 11, 2025Updated 10 months ago
- ☆23Oct 28, 2025Updated 5 months ago
- ☆11Apr 7, 2026Updated last week
- Professional RAG development skills for Claude Code - audit, evaluate, optimize, and scaffold RAG pipelines☆30Jan 18, 2026Updated 2 months ago
- Middleware for AI Agents that verifies grounding and prevents hallucinations. Returns structured retry suggestions for self-correction.☆51Dec 11, 2025Updated 4 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A high-performance, distributed memory management system for LLM agents built with LangGraph, LangChain, Ray, and vLLM. Features multi-la…☆11Apr 23, 2025Updated 11 months ago
- Self-Extensible Multi Agent Assistant 🐋☆54Feb 13, 2026Updated 2 months ago
- KGet is a modern, lightweight download manager written in Rust for fast and reliable file downloads from the command line and native app …☆36Mar 2, 2026Updated last month
- [H] HyperspaceDB is a high-performance, hyperbolic vector database written in Rust. It features 1-bit quantization, async replication, an…☆78Mar 26, 2026Updated 2 weeks ago
- Perplexity style AI answer engine for AI PCs with CPU,GPU and NPU support☆48Mar 1, 2026Updated last month
- ☆27Jun 22, 2025Updated 9 months ago
- A modern Python GUI application to batch extract YouTube comments and metadata to CSV. Features spam filtering, relevance sorting, and a …☆29Jan 5, 2026Updated 3 months ago
- ☆49Feb 27, 2026Updated last month
- ☆39Nov 17, 2025Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Squash adds an invisible memory layer to your browser, compressing every click into portable context for any AI agent☆31Sep 22, 2025Updated 6 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 7 months ago
- A scalable solution that simplifies the integration of ComfyUI for developers☆11Jul 15, 2024Updated last year
- FastMLX is a high performance production ready API to host MLX models.☆25Nov 18, 2024Updated last year
- ☆41Nov 3, 2025Updated 5 months ago
- A comprehensive toolkit that provides building blocks for LLM-based named entity recognition, attribute extraction, and relation extracti…☆55Apr 6, 2026Updated last week
- Pluggable sample-level metadata versioning for incremental multimodal pipelines.☆87Updated this week