Docker container to provide Apache Tika RESTful API
☆41Feb 12, 2016Updated 10 years ago
Alternatives and similar repositories for tika-tesseract-docker
Users that are interested in tika-tesseract-docker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Make for data☆21Aug 17, 2018Updated 7 years ago
- Ideas for (tech) stuff to research, build or work on.☆49Jan 27, 2026Updated 5 months ago
- Simplifying the process of launching an open data repository. [RETIRED]☆20Jan 7, 2015Updated 11 years ago
- NPR Visual's Carebot (deprecated, now in: https://github.com/thecarebot/carebot)☆15Jul 8, 2015Updated 10 years ago
- Extract deleted tweet & politician data from the Politwoops project☆24May 14, 2017Updated 9 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A Nutch 2.2.1 plugin which allows users to shuffle off the responsibility for retrieving pages to a selenium hub/node spoke system. This …☆16Jun 9, 2016Updated 10 years ago
- General information and docs about Crosscloud☆18Oct 30, 2014Updated 11 years ago
- online condolence book☆18Jan 4, 2016Updated 10 years ago
- CKAN Geospatial ResourceView☆48Jun 19, 2026Updated last week
- Transitioning the Web to HTTPS☆19Feb 25, 2022Updated 4 years ago
- Work relating to the OCR wish-list item "figure out an algorithm that would separate images into sets with no handwriting, little handwri…☆20Feb 22, 2013Updated 13 years ago
- BibJSON spec and website☆20Mar 19, 2015Updated 11 years ago
- a list of public cabals☆19Sep 27, 2024Updated last year
- Elwha is a Java application for monitoring topics, sentiment and events on Twitter streams with the ability to generate notification mess…☆17Sep 11, 2015Updated 10 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Uduvudu☆17Mar 2, 2020Updated 6 years ago
- Open Knowledge Labs website (and general issue tracker).☆83Feb 4, 2025Updated last year
- Create and manage needs on GOV.UK☆16Aug 7, 2025Updated 10 months ago
- CKAN Geospatial ResourceView☆16Sep 12, 2022Updated 3 years ago
- Offenes Ratsinformationssystem: Weboberfläche☆12Jan 4, 2017Updated 9 years ago
- CLI utility to spider websites and extract links to data files☆13Mar 18, 2015Updated 11 years ago
- Generate BigQuery tables, load and extract data, based on JSON Table Schema descriptors.☆18Jun 1, 2021Updated 5 years ago
- Corruption Perceptions Index - CPI☆18May 22, 2026Updated last month
- ☆48Feb 13, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆12Jan 12, 2016Updated 10 years ago
- A sample app that combines geolocated entities from Freebase with Maps API☆43Mar 20, 2014Updated 12 years ago
- Create and validate Data Packages in the browser☆27Dec 20, 2021Updated 4 years ago
- The Linked GTFS vocabulary☆39Mar 20, 2022Updated 4 years ago
- Open Knowledge standard "nice" jekyll theme - used for OpenDataHandbook etc☆16May 9, 2022Updated 4 years ago
- A spec for reporting errors in data quality.☆20May 25, 2021Updated 5 years ago
- Some code to examine and modify your experience of Twitter.☆11May 30, 2020Updated 6 years ago
- Utilities for converting between Prosemirror schemas and the Pandoc JSON format☆18Mar 4, 2023Updated 3 years ago
- Edit CSV files in the browser and sync them with GitHub☆19Jan 3, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Data Package Manager for R☆56Jul 14, 2017Updated 8 years ago
- ARCHIVED Read and Write Data Packages☆37May 10, 2022Updated 4 years ago
- Eco-living maps and data based on OpenStreetMap☆17Jul 12, 2013Updated 12 years ago
- This is a REST Server endpoint built using Flask and Python.☆24Nov 16, 2022Updated 3 years ago
- A minimal Akoma Ntoso -based legal informatics toolchain☆16Oct 25, 2023Updated 2 years ago
- GenderTracker is a service that decomposes articles and computes various gender-related metrics based on the content.☆25Jan 2, 2014Updated 12 years ago
- A souped-up CSV-based data format☆35Apr 19, 2013Updated 13 years ago