Teradata / docker-imagesLinks
Docker images used internally by various Teradata projects for automation, testing, etc
☆39Updated 8 years ago
Alternatives and similar repositories for docker-images
Users that are interested in docker-images are comparing it to the libraries listed below
Sorting:
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Updated 6 years ago
- Dockerized HDP Cluster☆84Updated 8 years ago
- Code to index Hive tables to Solr and Solr indexes to Hive☆46Updated 6 years ago
- Ansible playbook for automated HDP 2.x deployment install with Kerberos☆19Updated 9 years ago
- A testing framework for Presto☆62Updated 9 months ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆91Updated last year
- This repository is to help with the Partner Demonstration of the Apache Atlas project.☆30Updated 10 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 3 years ago
- Schema Registry integration for Apache Spark☆40Updated 3 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆63Updated 6 years ago
- A tool to install, configure and manage Presto installations☆171Updated 3 years ago
- Lightweight proxy to expose the UI of an Apache Spark cluster that is behind a firewall☆98Updated 5 years ago
- Example project showing how to use Hive UDFs in Apache Spark☆55Updated 6 years ago
- Port of TPC-DS dsdgen to Java☆50Updated last year
- A library for Spark DataFrame using MinIO Select API☆99Updated 6 years ago
- Collection of examples integrating NiFi with stream process frameworks.☆59Updated 9 years ago
- An Integrated and collaborative cloud environment for building and running Spark applications on PKS/Kubernetes☆84Updated 5 years ago
- Docker image for main Apache Hadoop components (Yarn/Hdfs)☆56Updated 3 years ago
- Demos around Ambari Views, Services, Blueprints☆63Updated 9 years ago
- Ansible playbooks to construct distributed computing environments☆62Updated 4 years ago
- Quickly deploy Hadoop with the help of Ansible and Apache Ambari☆38Updated 10 years ago
- Quark is a data virtualization engine over analytic databases.☆100Updated 8 years ago
- Cask Hydrator Plugins Repository☆68Updated last month
- A Spark datasource for the HadoopOffice library☆37Updated 4 months ago
- Docker image for Apache Spark☆76Updated 6 years ago
- Spark UDFs to deserialize Avro messages with schemas stored in Schema Registry.☆20Updated 8 years ago
- ☆39Updated 6 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 6 years ago
- Python client for Spark Jobserver Rest API☆40Updated 5 years ago
- Build configuration-driven ETL pipelines on Apache Spark☆161Updated 3 years ago