canelmas / kafka-connect-field-and-time-partitioner
Kafka Connect Store Partitioner by custom fields and time
☆40Updated 3 years ago
Alternatives and similar repositories for kafka-connect-field-and-time-partitioner:
Users that are interested in kafka-connect-field-and-time-partitioner are comparing it to the libraries listed below
- Aiven's collection of Single Message Transformations (SMTs) for Apache Kafka Connect☆76Updated 3 weeks ago
- AWS Glue Schema Registry Client library provides serializers / de-serializers for applications to integrate with AWS Glue Schema Registry…☆136Updated 2 months ago
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR☆66Updated 3 years ago
- ☆79Updated this week
- ☆40Updated last year
- Spark history server Helm Chart☆20Updated last year
- Aiven's S3 Sink Connector for Apache Kafka®☆69Updated 7 months ago
- Spark ETL example processing New York taxi rides public dataset on EKS☆44Updated 2 years ago
- ☆53Updated 8 months ago
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMR☆35Updated 2 months ago
- Pylint plugin for static code analysis on Airflow code☆93Updated 4 years ago
- ☆26Updated 4 years ago
- ☆29Updated 2 weeks ago
- Common Transforms for Kafka Connect.☆157Updated 8 months ago
- Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.☆75Updated last year
- Streaming ETL with Apache Flink and Amazon Kinesis Data Analytics☆64Updated last year
- The Internals of Spark on Kubernetes☆71Updated 2 years ago
- A pyspark lib to validate data quality☆18Updated 2 years ago
- Docker image to submit Spark applications☆38Updated 7 years ago
- Setup for running Trino with Hive Metastore on Kubernetes☆101Updated 2 years ago
- Oozie Workflow to Airflow DAGs migration tool☆87Updated last month
- Examples and custom spark images for working with the spark-on-k8s operator on AWS☆27Updated 4 years ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆26Updated 8 months ago
- Adapter for dbt that executes dbt pipelines on Apache Flink☆95Updated last year
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Updated 3 years ago
- Helm charts for Trino and Trino Gateway☆162Updated last week
- Presto Trino with Apache Hive Postgres metastore☆41Updated 7 months ago
- A demonstration of dashboards for monitoring Kafka Streams applications.☆37Updated last year
- Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work☆47Updated 2 years ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆43Updated 2 years ago