Docker container to provide Apache Tika RESTful API
☆41Feb 12, 2016Updated 10 years ago
Alternatives and similar repositories for tika-tesseract-docker
Users that are interested in tika-tesseract-docker are comparing it to the libraries listed below
Sorting:
- Make for data☆21Aug 17, 2018Updated 7 years ago
- Ideas for (tech) stuff to research, build or work on.☆49Jan 27, 2026Updated last month
- Simplifying the process of launching an open data repository. [RETIRED]☆20Jan 7, 2015Updated 11 years ago
- NPR Visual's Carebot (deprecated, now in: https://github.com/thecarebot/carebot)☆15Jul 8, 2015Updated 10 years ago
- Extract deleted tweet & politician data from the Politwoops project☆24May 14, 2017Updated 8 years ago
- A Javascript-only data library providing functionality like DataFrame in Pandas or R. (Currently in research phase - does this already ex…☆13Aug 4, 2017Updated 8 years ago
- Open database of scholarly journals☆10Oct 26, 2022Updated 3 years ago
- A Nutch 2.2.1 plugin which allows users to shuffle off the responsibility for retrieving pages to a selenium hub/node spoke system. This …☆16Jun 9, 2016Updated 9 years ago
- General information and docs about Crosscloud☆18Oct 30, 2014Updated 11 years ago
- online condolence book☆16Jan 4, 2016Updated 10 years ago
- CKAN Geospatial ResourceView☆48Jan 8, 2026Updated 2 months ago
- Transitioning the Web to HTTPS☆19Feb 25, 2022Updated 4 years ago
- Work relating to the OCR wish-list item "figure out an algorithm that would separate images into sets with no handwriting, little handwri…☆20Feb 22, 2013Updated 13 years ago
- Display CKAN resource views on dataset and home pages☆10Aug 8, 2017Updated 8 years ago
- BibJSON spec and website☆20Mar 19, 2015Updated 11 years ago
- a list of public cabals☆19Sep 27, 2024Updated last year
- Elwha is a Java application for monitoring topics, sentiment and events on Twitter streams with the ability to generate notification mess…☆17Sep 11, 2015Updated 10 years ago
- FOAF specification☆46Dec 11, 2025Updated 3 months ago
- Create and manage needs on GOV.UK☆16Aug 7, 2025Updated 7 months ago
- Offenes Ratsinformationssystem: Weboberfläche☆12Jan 4, 2017Updated 9 years ago
- CLI utility to spider websites and extract links to data files☆13Mar 18, 2015Updated 11 years ago
- Open source and open knowledge (data and content) licenses together with API and web service.☆71Jul 3, 2024Updated last year
- Corruption Perceptions Index - CPI☆18Oct 25, 2024Updated last year
- ☆48Feb 13, 2024Updated 2 years ago
- ☆12Jan 12, 2016Updated 10 years ago
- A static documentation generator for Swagger APIs☆15Feb 3, 2015Updated 11 years ago
- A sample app that combines geolocated entities from Freebase with Maps API☆43Mar 20, 2014Updated 11 years ago
- Make templates and then make documents from templates: https://www.youtube.com/watch?v=sKhsy0e0lqk☆11Apr 8, 2015Updated 10 years ago
- Create and validate Data Packages in the browser☆27Dec 20, 2021Updated 4 years ago
- The Linked GTFS vocabulary☆39Mar 20, 2022Updated 3 years ago
- Open Knowledge standard "nice" jekyll theme - used for OpenDataHandbook etc☆16May 9, 2022Updated 3 years ago
- A spec for reporting errors in data quality.☆20May 25, 2021Updated 4 years ago
- Some code to examine and modify your experience of Twitter.☆11May 30, 2020Updated 5 years ago
- Utilities for converting between Prosemirror schemas and the Pandoc JSON format☆18Mar 4, 2023Updated 3 years ago
- Edit CSV files in the browser and sync them with GitHub☆19Jan 3, 2023Updated 3 years ago
- Data Package Manager for R☆57Jul 14, 2017Updated 8 years ago
- ARCHIVED Read and Write Data Packages☆37May 10, 2022Updated 3 years ago
- metadata indexing and searching of video containers☆17Dec 28, 2016Updated 9 years ago
- Eco-living maps and data based on OpenStreetMap☆17Jul 12, 2013Updated 12 years ago