canelmas / kafka-connect-field-and-time-partitionerLinks
Kafka Connect Store Partitioner by custom fields and time
☆40Updated 3 years ago
Alternatives and similar repositories for kafka-connect-field-and-time-partitioner
Users that are interested in kafka-connect-field-and-time-partitioner are comparing it to the libraries listed below
Sorting:
- Aiven's collection of Single Message Transformations (SMTs) for Apache Kafka Connect☆84Updated last week
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR☆67Updated 3 years ago
- Common Transforms for Kafka Connect.☆168Updated 2 months ago
- Spark ETL example processing New York taxi rides public dataset on EKS☆44Updated 2 years ago
- ☆80Updated 5 months ago
- AWS Glue Schema Registry Client library provides serializers / de-serializers for applications to integrate with AWS Glue Schema Registry…☆143Updated last week
- Setup for running Trino with Hive Metastore on Kubernetes☆103Updated 3 years ago
- The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog a…☆225Updated 6 months ago
- Snowflake Kafka Connector (Sink Connector)☆160Updated this week
- Helm charts for Trino and Trino Gateway☆180Updated 2 weeks ago
- Experiments and demonstrations of AVRO, Protobuf serialisation☆61Updated 2 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆100Updated 2 years ago
- A Python client for managing connectors using the Kafka Connect API.☆12Updated 2 weeks ago
- ☆25Updated last year
- Repository of helm charts for deploying DataHub on a Kubernetes cluster☆194Updated this week
- Aiven's S3 Sink Connector for Apache Kafka®☆71Updated last year
- This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concu…☆77Updated 6 years ago
- PySpark data-pipeline testing and CICD☆28Updated 4 years ago
- Visualize dependencies between Airflow DAGs☆49Updated 4 years ago
- Adapter for dbt that executes dbt pipelines on Apache Flink☆95Updated last year
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMR☆39Updated 7 months ago
- Performant Redshift data source for Apache Spark☆142Updated 3 months ago
- Spark runtime on AWS Lambda☆110Updated last month
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆30Updated 2 years ago
- Enables synchronizing metadata changes (Create/Drop table/partition) from Hive Metastore to AWS Glue Data Catalog☆35Updated last year
- Airflow Backfill UI based plugin for existing / new Airflow environment☆65Updated 4 years ago
- Pylint plugin for static code analysis on Airflow code☆96Updated 4 years ago
- Example code for running Spark and Hive jobs on EMR Serverless.☆168Updated 9 months ago
- Continuously synchronize directories from remote object store to local filesystem☆107Updated last week
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆286Updated this week