TrivadisPF / platys
A tool for generating docker-compose environments
☆23Updated this week
Alternatives and similar repositories for platys:
Users that are interested in platys are comparing it to the libraries listed below
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆75Updated this week
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated this week
- Minimal example to run Trino, Minio, and Hive standalone metastore on docker☆52Updated 2 years ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆94Updated this week
- ☆15Updated 2 years ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆43Updated 2 years ago
- Docker envinroment to stream data from Kafka to Iceberg tables☆27Updated last year
- Yet Another (Spark) ETL Framework☆20Updated last year
- Kafka as your DataLake Demo☆11Updated 7 months ago
- GetInData Helm Charts repository☆12Updated 2 years ago
- A Data Mesh demo repository☆13Updated 6 months ago
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆98Updated 2 years ago
- This repository contains recipes for Apache Pinot.☆30Updated last month
- Apache NiFi Python Extensions☆22Updated 5 months ago
- A Flink applcation that demonstrates reading and writing to/from Apache Kafka with Apache Flink☆20Updated last year
- Hands-on workshop with Iceberg, Redpanda, Debezium and Kafka-Connect☆13Updated 6 months ago
- A kubernetes operator for Apache NiFi☆34Updated last week
- Presentations and other resources.☆36Updated 4 years ago
- Trino connectors for accessing APIs with an OpenAPI spec☆37Updated this week
- Explore Apache Kafka data pipelines in Kubernetes.☆45Updated 2 months ago
- dbt (data build tool) adapter for the Dremio☆51Updated last week
- An Ansible collection for lifecycle and management of Cloudera CDP Private Cloud resources on bare metal, IaaS, and PaaS.☆34Updated last week
- The Data Product Descriptor Specification (DPDS) Repository☆77Updated 3 months ago
- Unity Catalog UI☆40Updated 7 months ago
- kafka-connect-jdbc system test based on testcontainers☆13Updated last year
- ☆29Updated 2 weeks ago
- A platform to manage the data product life cycle☆16Updated last week
- Intended for internal use: deploys all infrastructure required for Astronomer to run on GCP☆10Updated 8 months ago
- Presto Trino with Apache Hive Postgres metastore☆41Updated 7 months ago