Apache Tika Server as a Docker Image
☆173Jul 17, 2022Updated 3 years ago
Alternatives and similar repositories for docker-tikaserver
Users that are interested in docker-tikaserver are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Efficient indexing and retrieval of OCR bounding boxes in Solr☆22Mar 13, 2019Updated 7 years ago
- Docker container to provide Apache Tika RESTful API☆41Feb 12, 2016Updated 10 years ago
- Ideas for (tech) stuff to research, build or work on.☆49Jan 27, 2026Updated 2 months ago
- Convenience Docker images for Apache Tika Server☆239Apr 13, 2026Updated last week
- A DropWizard wrapper around Apache Tika.☆10Dec 22, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.☆1,651Mar 28, 2026Updated 3 weeks ago
- SKOS Support for Apache Lucene and Solr☆56May 12, 2021Updated 4 years ago
- Semantic faceted search using SPARQL☆19May 18, 2018Updated 7 years ago
- Extract Data from Wikipedia Lists☆31Aug 27, 2017Updated 8 years ago
- Extract Data from Wikipedia Tables☆34Aug 26, 2017Updated 8 years ago
- Some ideas on making Bags into Git repositories☆16Dec 23, 2014Updated 11 years ago
- neonion is a user-centered collaborative semantic annotation webapp developed at the Human-Centered Computing group at Freie Universität …☆70Feb 13, 2019Updated 7 years ago
- Go package for using Apache Tika☆251Apr 17, 2025Updated last year
- General Architecture for Text Engineering☆50Mar 23, 2016Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- EEA ElasticSearch RDF River Plugin☆64Dec 14, 2021Updated 4 years ago
- This is a REST Server endpoint built using Flask and Python.☆24Nov 16, 2022Updated 3 years ago
- A JRuby command line application and library for Apache Tika to extract text and metadata from files of various formats.☆54May 1, 2025Updated 11 months ago
- A Nutch 2.2.1 plugin which allows users to shuffle off the responsibility for retrieving pages to a selenium hub/node spoke system. This …☆16Jun 9, 2016Updated 9 years ago
- Document management system. Based on bill tracking needs. Simple model for stages, priorities, authors, content (abstract, tags), releate…☆19Sep 16, 2014Updated 11 years ago
- Display CKAN resource views on dataset and home pages☆10Aug 8, 2017Updated 8 years ago
- Allow anyone with a modern browser to stream a 1GB, 10GB, 100GB, or 1TB file over the Internet and into a happy home.☆32Oct 7, 2018Updated 7 years ago
- --DEPRECATED--. Use other top level repository under IntellectualHeaven.☆42Jan 30, 2015Updated 11 years ago
- Python scripts for interacting with the hypothes.is API☆49Jun 19, 2017Updated 8 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- This project has been archived and is no longer being developed or supported. The Curator's Workbench is an extensible digital collectio…☆24Jun 25, 2020Updated 5 years ago
- Arduino Library for the MS5803-14BA underwater pressure/depth sensor☆10Oct 19, 2023Updated 2 years ago
- Simplified version of a common crawl fetcher☆17Dec 24, 2025Updated 3 months ago
- Core libraries by the PRImA Research Lab☆16Jul 30, 2024Updated last year
- Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & N…☆277Oct 9, 2022Updated 3 years ago
- CKAN Geospatial ResourceView☆16Sep 12, 2022Updated 3 years ago
- An evil web server.☆13May 9, 2015Updated 10 years ago
- The nginx module to invalidate complete cache zone☆11Jul 1, 2020Updated 5 years ago
- Java Wiktionary Library☆60Nov 19, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- In the Django Authentication package is that all users use the same model/profile. This can be a drawback if you have lots of users or yo…☆25Feb 6, 2016Updated 10 years ago
- A suite of Machine Learning / Deep Learning Dockerfiles to allow Apache Tika to extract objects and to produce textual captions for image…☆21Jun 18, 2024Updated last year
- Add editing UI and other power-user features to Datasette.☆14Mar 4, 2023Updated 3 years ago
- python library for working with IIIF Image and Presentation APIs☆20Updated this week
- Download GitHub repositories☆12May 10, 2025Updated 11 months ago
- command-line tool to extract taxonomies from Wikidata☆131Jun 19, 2019Updated 6 years ago
- Express middleware for querying our graphql server built with graph.ql☆13Apr 29, 2018Updated 7 years ago