mmiklavc / scalable-ocrView external linksLinks
Scalable Optical Character Recognition with Apache NiFi and Tesseract
☆33Aug 11, 2016Updated 9 years ago
Alternatives and similar repositories for scalable-ocr
Users that are interested in scalable-ocr are comparing it to the libraries listed below
Sorting:
- Apache NiFi Custom Processor Extracting Text From Files with Apache Tika☆34Aug 24, 2023Updated 2 years ago
- ☆19Sep 8, 2017Updated 8 years ago
- Apache NiFi NLP Processor☆18Nov 8, 2023Updated 2 years ago
- A collection of “cookbook-style” scripts for simplifying data engineering and machine learning in Apache Spark.☆13Oct 27, 2021Updated 4 years ago
- Open-source boilerplate for computer vision research☆16Aug 4, 2020Updated 5 years ago
- Advanced desktop search/corpus exploration prototype☆21Jun 23, 2021Updated 4 years ago
- In-browser OCR of Ancient Greek and Latin☆26Jan 6, 2026Updated last month
- A PHP library for comparing two or more Sanskrit TEI XML files and generating an apparatus with variants☆14Aug 18, 2025Updated 5 months ago
- GRASS GIS module for wildfire simulation wrapping r.ros and r.spread modules☆11Dec 13, 2021Updated 4 years ago
- ☆38Aug 27, 2016Updated 9 years ago
- TEI-encoded contents of the Egyptian Gazette☆15Jun 11, 2024Updated last year
- TiO is an AirBnB like android app demo developed from a hackathon. I developed it with another Android developer, a backend, and a UI des…☆11May 30, 2016Updated 9 years ago
- Scalable genomic analysis pipelines, written in WDL☆11Updated this week
- Oracc GUI☆12Jun 27, 2025Updated 7 months ago
- This repository contains NiFi processors for interacting with Snowflake Cloud Data Platform.☆12Dec 13, 2024Updated last year
- Vector extraction of data from scientific publications☆10Apr 26, 2023Updated 2 years ago
- Open Source Visual Acuity Testing for Optometrists☆11Oct 22, 2025Updated 3 months ago
- Open Source Computer Vision with TensorFlow, MiniFi, Apache NiFi, OpenCV, Apache Tika and Python For processing images from IoT devices…☆45Jun 16, 2018Updated 7 years ago
- Azure-Sentinel-BYOML☆12Nov 8, 2019Updated 6 years ago
- A TensorFlow 2.0 .whl file compiled with an old processor/computer☆11Dec 12, 2020Updated 5 years ago
- Pytorch implementation of Nueral Style transfer☆10Jun 22, 2021Updated 4 years ago
- Artifical-intelligence☆10Dec 27, 2022Updated 3 years ago
- Project to digitize avant-garde periodicals☆12May 13, 2022Updated 3 years ago
- A single source of truth for data definitions☆11Dec 10, 2022Updated 3 years ago
- Simple videoconferencing service created using Twilio's Programmable Video Group Rooms API☆10May 24, 2018Updated 7 years ago
- A template repo for squids indexing Ethereum mainnet☆12Oct 30, 2024Updated last year
- A reinforcement learning package implemented in Torch☆11Jan 24, 2016Updated 10 years ago
- Sharepoint Java API (Restful)☆14Oct 17, 2017Updated 8 years ago
- Online voter registration (OVR) application-as-a-service for 3rd party registrar organizations.☆11Updated this week
- ☆13Aug 11, 2025Updated 6 months ago
- ☆16Jul 19, 2024Updated last year
- A Simple Sudoku Solver☆23Nov 26, 2012Updated 13 years ago
- Additional convenience processors not found in core Apache NiFi☆97Apr 12, 2022Updated 3 years ago
- Surfacing Semantic Data from Clinical Notes in Electronic Health Records for Tailored Care, Trial Recruitment and Clinical Research☆88Jan 13, 2023Updated 3 years ago
- OpenNCC Frame☆12Oct 21, 2022Updated 3 years ago
- rnsh is a command-line utility written in Python that facilitates shell sessions over Reticulum networks and aims to provide a similar ex…☆12Jan 6, 2026Updated last month
- MDLText☆12Jul 13, 2017Updated 8 years ago
- Docker compose and Google Colab demo to build a CDC with Delta Lake☆15Sep 7, 2022Updated 3 years ago
- A prototype demostrating querying, filtering and visualizing data from remote geoparquet files in the browser through duckdb-wasm and dec…☆19Dec 15, 2024Updated last year