USCDataScience / tika-dockers
A suite of Machine Learning / Deep Learning Dockerfiles to allow Apache Tika to extract objects and to produce textual captions for images and video
☆21Updated 10 months ago
Alternatives and similar repositories for tika-dockers:
Users that are interested in tika-dockers are comparing it to the libraries listed below
- Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser☆13Updated 3 years ago
- Combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text.☆34Updated last year
- Image recognition on Spark cluster powered by Deeplearning4j and Apache Tika☆14Updated 7 years ago
- Demonstration of searching PDF document with Solr, Tika, and Tesseract☆31Updated 6 months ago
- Angular JS Solr and Elasticsearch and OpenSearch Diagnostic Search Services☆26Updated last month
- Efficient indexing and retrieval of OCR bounding boxes in Solr☆22Updated 6 years ago
- Highly performant, lightweight framework for linked data processing. Supports RDFa, JSON-LD, RDF/XML and plain text formats, runs on Andr…☆52Updated 2 years ago
- Advanced desktop search/corpus exploration prototype☆21Updated 3 years ago
- Simple search results with Solr and EmberJS☆58Updated 6 years ago
- A JDBC driver that takes data from SPARQL endpoints or RDF graphs☆25Updated 7 years ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆47Updated 3 years ago
- Apache OpenNLP Sandbox☆42Updated this week
- Text similarity based on Word2Vec vectors.☆11Updated 8 years ago
- ImageCat is an Apache OODT RADIX application that uses Apache Solr, Apache Tika and Apache OODT to ingest 10s of millions of files (image…☆95Updated 6 years ago
- a pure javascript frontend for ElasticSearch search indices.☆78Updated 7 years ago
- Age classification from text using PAN16, blogs, Fisher Callhome, and Cancer Forum☆17Updated 2 years ago
- This page is a companion for the paper titled Towards Automatic Structuring and Semantic Indexing of Legal Documents☆29Updated 6 years ago
- SOLR bulk indexing utility for the command line.☆45Updated 3 weeks ago
- Sensefy is a federated enterprise semantic search framework built on Apache ManifoldCF, Apache Solr and Apache Stanbol. Development is sp…☆15Updated 2 years ago
- Uses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.☆37Updated last year
- This is the ETL lib package. It provides an API to munge and prepare JSON, TSV and other data using Apache Tika and JSON parsing/loading …☆17Updated last year
- All that entity matching, resolution, normalization, enhancement and reconciliation madness, but with a focus on data, not platforms.☆24Updated 3 years ago
- Homebase of the IPTC EXTRA project about rule-based text categorization☆13Updated 7 years ago
- Solrstrap is a Query-Result interface for Solr written in JavaScript, HTML and CSS☆86Updated 8 years ago
- This is a REST Server endpoint built using Flask and Python.☆24Updated 2 years ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆52Updated 4 years ago
- Apache NiFi NLP Processor☆18Updated last year
- Simple RESTful API server running your own machine translation model. Docker image modified from mbartoli/easy-smt☆11Updated 5 years ago
- A web tool enabling authorship and download of RDF, and RDF visualization in Linked Open Data☆37Updated 5 years ago
- Open Source, Distributed, Big Data Enterprise Search Engine☆69Updated last month