Apache Tika Server as a Docker Image
☆172Jul 17, 2022Updated 3 years ago
Alternatives and similar repositories for docker-tikaserver
Users that are interested in docker-tikaserver are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Efficient indexing and retrieval of OCR bounding boxes in Solr☆22Mar 13, 2019Updated 7 years ago
- Docker container to provide Apache Tika RESTful API☆41Feb 12, 2016Updated 10 years ago
- A Python wrapper for the nascent hypothes.is web API☆11Jan 28, 2026Updated 4 months ago
- Django SKOS-XL Thesaurus manager☆13Oct 18, 2021Updated 4 years ago
- Ideas for (tech) stuff to research, build or work on.☆49Jan 27, 2026Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- LDIF - Linked Data Integration Framework☆37Aug 2, 2016Updated 9 years ago
- Solr client and user interface for search☆22Apr 25, 2024Updated 2 years ago
- Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.☆1,657Jun 10, 2026Updated last week
- Extract Data from Wikipedia Lists☆31Aug 27, 2017Updated 8 years ago
- A project aiming "to significantly advance the state of the art with regard to indexing and querying biomedical data with freely availabl…☆80Feb 17, 2026Updated 4 months ago
- Extract Data from Wikipedia Tables☆34Aug 26, 2017Updated 8 years ago
- SKOS analysis for Elasticsearch☆54Jun 15, 2016Updated 10 years ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆46Jan 16, 2022Updated 4 years ago
- neonion is a user-centered collaborative semantic annotation webapp developed at the Human-Centered Computing group at Freie Universität …☆70Feb 13, 2019Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- EEA ElasticSearch RDF River Plugin☆65Dec 14, 2021Updated 4 years ago
- This is a REST Server endpoint built using Flask and Python.☆24Nov 16, 2022Updated 3 years ago
- A Nutch 2.2.1 plugin which allows users to shuffle off the responsibility for retrieving pages to a selenium hub/node spoke system. This …☆16Jun 9, 2016Updated 10 years ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆100Oct 9, 2022Updated 3 years ago
- Elwha is a Java application for monitoring topics, sentiment and events on Twitter streams with the ability to generate notification mess…☆17Sep 11, 2015Updated 10 years ago
- This project has been archived and is no longer being developed or supported. The Curator's Workbench is an extensible digital collectio…☆24Jun 25, 2020Updated 5 years ago
- Document management system. Based on bill tracking needs. Simple model for stages, priorities, authors, content (abstract, tags), releate…☆20Sep 16, 2014Updated 11 years ago
- Simplified version of a common crawl fetcher☆16Dec 24, 2025Updated 5 months ago
- Core libraries by the PRImA Research Lab☆16Jul 30, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & N…☆281Oct 9, 2022Updated 3 years ago
- PHP client library for communicating with GetEventStore.☆12Mar 7, 2016Updated 10 years ago
- CKAN Geospatial ResourceView☆16Sep 12, 2022Updated 3 years ago
- SwiftHLM - a middleware for using OpenStack Swift with tape and other hight latency media storage backends☆14May 18, 2018Updated 8 years ago
- The nginx module to invalidate complete cache zone☆11Jul 1, 2020Updated 5 years ago
- Java Wiktionary Library☆61Nov 19, 2022Updated 3 years ago
- ☆11Aug 8, 2019Updated 6 years ago
- UI Components for Solr☆11Apr 24, 2018Updated 8 years ago
- Java client for test.ai classifier server☆12Dec 19, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- In the Django Authentication package is that all users use the same model/profile. This can be a drawback if you have lots of users or yo…☆25Feb 6, 2016Updated 10 years ago
- Text mining on the Royal Library newspaper corpus☆11Dec 3, 2025Updated 6 months ago
- Add editing UI and other power-user features to Datasette.☆14Mar 4, 2023Updated 3 years ago
- A suite of Machine Learning / Deep Learning Dockerfiles to allow Apache Tika to extract objects and to produce textual captions for image…☆21Jun 18, 2024Updated 2 years ago
- python library for working with IIIF Image and Presentation APIs☆20Updated this week
- command-line tool to extract taxonomies from Wikidata☆132Jun 19, 2019Updated 7 years ago
- Express middleware for querying our graphql server built with graph.ql☆13Apr 29, 2018Updated 8 years ago