TOSIT-IO / TDP
Main TDP repository
☆56Updated 3 weeks ago
Alternatives and similar repositories for TDP
Users that are interested in TDP are comparing it to the libraries listed below
Sorting:
- Ansible collection to deploy the components of TDP☆21Updated this week
- Vagrant / Ansible environment to deploy a local TDP cluster☆20Updated this week
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆124Updated this week
- Sparglim✨ makes PySpark App Configurable and Deploy Spark Connect Server Easier!☆37Updated 2 months ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆94Updated this week
- A kubernetes operator for Apache NiFi☆34Updated last week
- Copy Hive tables definitions to Compute Cluster, while still using Storage on original cluster☆11Updated this week
- ☆40Updated 2 years ago
- A Python package to submit and manage Apache Spark applications on Kubernetes.☆41Updated last month
- Operator for Apache Spark-on-Kubernetes for Stackable Data Platform☆62Updated this week
- REST API for Apache Spark on K8S or YARN☆98Updated this week
- Kafka Connector for Iceberg tables☆16Updated last year
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆75Updated 3 years ago
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆122Updated last week
- ☆53Updated last week
- ☆56Updated this week
- ☆19Updated 2 years ago
- ☆46Updated 2 weeks ago
- A simple Spark-powered ETL framework that just works 🍺☆181Updated last week
- An Ansible collection for lifecycle and management of Cloudera CDP Private Cloud resources on bare metal, IaaS, and PaaS.☆34Updated this week
- Minimal example to run Trino, Minio, and Hive standalone metastore on docker☆52Updated 2 years ago
- ☆10Updated last year
- EverythingApacheNiFi☆110Updated last year
- Test data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.☆54Updated 2 months ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆77Updated last month
- Docker envinroment to stream data from Kafka to Iceberg tables☆28Updated last year
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆98Updated 2 years ago
- ☆23Updated 8 months ago
- Tutorial on how to setup Trino and Apache Ranger using docker☆41Updated 9 months ago
- Setup for running Trino with Hive Metastore on Kubernetes☆100Updated 2 years ago