TOSIT-IO / TDP
Main TDP repository
☆59Updated last week
Alternatives and similar repositories for TDP:
Users that are interested in TDP are comparing it to the libraries listed below
- Ansible collection to deploy the components of TDP☆21Updated last week
- Vagrant / Ansible environment to deploy a local TDP cluster☆20Updated last month
- A kubernetes operator for Apache NiFi☆34Updated last week
- Sparglim✨ makes PySpark App Configurable and Deploy Spark Connect Server Easier!☆37Updated last month
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆94Updated this week
- Operator for Apache Spark-on-Kubernetes for Stackable Data Platform☆61Updated this week
- REST API for Apache Spark on K8S or YARN☆98Updated this week
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆121Updated this week
- ☆40Updated last year
- ☆56Updated this week
- Minimal example to run Trino, Minio, and Hive standalone metastore on docker☆52Updated 2 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆98Updated 2 years ago
- Monitoring and insights on your data lakehouse tables☆28Updated this week
- Apache NiFi Python Extensions☆22Updated 5 months ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆77Updated 2 weeks ago
- Helm Charts to Deploy Apache Drill on Kubernetes☆17Updated last year
- Stackable Operator for Apache Airflow☆24Updated this week
- Test data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.☆52Updated 2 months ago
- Tutorial on how to setup Trino and Apache Ranger using docker☆41Updated 9 months ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated this week
- Docker image for Apache Hive Metastore☆71Updated 2 years ago
- The Internals of Spark on Kubernetes☆71Updated 2 years ago
- Ambari stack service for installing and managing Apache Airflow on HDP cluster☆59Updated 6 years ago
- Copy Hive tables definitions to Compute Cluster, while still using Storage on original cluster☆11Updated last week
- ☆19Updated 2 years ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆58Updated last year
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆120Updated 3 weeks ago
- An Ansible collection for lifecycle and management of Cloudera CDP Private Cloud resources on bare metal, IaaS, and PaaS.☆34Updated last week
- The NiFiKop NiFi Kubernetes operator makes it easy to run Apache NiFi on Kubernetes. Apache NiFI is a free, open-source solution that sup…☆129Updated 3 years ago
- ☆51Updated this week