TOSIT-IO / tdp-getting-started
Vagrant / Ansible environment to deploy a local TDP cluster
☆20Updated last week
Alternatives and similar repositories for tdp-getting-started:
Users that are interested in tdp-getting-started are comparing it to the libraries listed below
- Ansible collection to deploy the components of TDP☆21Updated this week
- Main TDP repository☆59Updated 2 months ago
- Trino connectors for accessing APIs with an OpenAPI spec☆31Updated last week
- Traditionally, engineers were needed to implement business logic via data pipelines before business users can start using it. Using this …☆12Updated 2 weeks ago
- Copy Hive tables definitions to Compute Cluster, while still using Storage on original cluster☆11Updated last week
- A kubernetes CRD and controller to manage Flink jobs running on your any Flink Job Manager☆8Updated 3 months ago
- Presto Trino with Apache Hive Postgres metastore☆40Updated 6 months ago
- BigQuery connector for Apache Flink☆29Updated 2 weeks ago
- Datagenerator for Data Services☆16Updated 2 months ago
- A testing framework for Trino☆26Updated 3 months ago
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆120Updated this week
- Storage connector for Trino☆105Updated this week
- CDC with NiFi, Kafka Connect, Flink SQL, Cloudera Data in Motion☆12Updated last year
- ☆40Updated last year
- Provide functionality to build statistical models to repair dirty tabular data in Spark☆12Updated last year
- ☆25Updated 6 months ago
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino☆19Updated 2 years ago
- Tutorial on how to setup Trino and Apache Ranger using docker☆41Updated 7 months ago
- Is using KoP (Kafka-On-Pulsar) a good idea? Use the scenarios implemented in this repository to check whether Pulsar with KoP enabled is …☆10Updated 2 years ago
- GetInData Helm Charts repository☆12Updated 2 years ago
- Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines☆17Updated 5 years ago
- ☆27Updated 2 months ago
- ☆56Updated this week
- Ingest JSON records from Kafka to multiple tables in the database using the DataStax Apache Kafka Connector☆13Updated 2 years ago
- Db2 JDBC connector for Trino☆18Updated 2 years ago
- Yet Another (Spark) ETL Framework☆20Updated last year
- Discover Flink clusters on Hadoop YARN for Prometheus☆23Updated 4 years ago
- ☆38Updated 9 months ago
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆62Updated last year
- Pulsar Heartbeat monitors Pulsar cluster availability, tracks latency of Pulsar message pubsub, and reports failures of the Pulsar cluste…☆24Updated last week