Teradata / docker-imagesLinks
Docker images used internally by various Teradata projects for automation, testing, etc
☆40Updated 7 years ago
Alternatives and similar repositories for docker-images
Users that are interested in docker-images are comparing it to the libraries listed below
Sorting:
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Updated 5 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆90Updated last year
- An Integrated and collaborative cloud environment for building and running Spark applications on PKS/Kubernetes☆84Updated 5 years ago
- A tool to install, configure and manage Presto installations☆170Updated 2 years ago
- A testing framework for Presto☆63Updated 3 months ago
- This repository is to help with the Partner Demonstration of the Apache Atlas project.☆30Updated 9 years ago
- A library for Spark DataFrame using MinIO Select API☆98Updated 5 years ago
- Port of TPC-DS dsdgen to Java☆50Updated last year
- Dockerized HDP Cluster☆84Updated 7 years ago
- Convert a CSV fle to ORCFile☆26Updated 6 years ago
- Example project showing how to use Hive UDFs in Apache Spark☆55Updated 6 years ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆52Updated 2 months ago
- Lightweight proxy to expose the UI of an Apache Spark cluster that is behind a firewall☆98Updated 5 years ago
- An example Apache Beam project.☆111Updated 8 years ago
- Airflow workflow management platform chef cookbook.☆71Updated 6 years ago
- Code to index Hive tables to Solr and Solr indexes to Hive☆47Updated 6 years ago
- The SpliceSQL Engine☆170Updated 2 years ago
- hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.☆28Updated 7 years ago
- ☆63Updated 5 years ago
- Ambari stack service for installing and managing Apache Airflow on HDP cluster☆59Updated 6 years ago
- Apache Drill Dialect for SQL Alchemy☆54Updated 2 months ago
- Docker image for main Apache Hadoop components (Yarn/Hdfs)☆56Updated 2 years ago
- Vagrant files creating multi-node virtual Hadoop clusters with or without security.☆67Updated 5 years ago
- Ansible playbooks to construct distributed computing environments☆62Updated 4 years ago
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆70Updated this week
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 6 years ago
- Quark is a data virtualization engine over analytic databases.☆100Updated 8 years ago
- Splittable Gzip codec for Hadoop☆72Updated last week
- Export Airflow metrics (from mysql) in prometheus format☆29Updated 4 months ago
- Groovy client library for Apache Ambari's REST API☆20Updated 4 years ago