canelmas / kafka-connect-field-and-time-partitionerLinks
Kafka Connect Store Partitioner by custom fields and time
☆40Updated 3 years ago
Alternatives and similar repositories for kafka-connect-field-and-time-partitioner
Users that are interested in kafka-connect-field-and-time-partitioner are comparing it to the libraries listed below
Sorting:
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR☆67Updated 3 years ago
- Aiven's collection of Single Message Transformations (SMTs) for Apache Kafka Connect☆83Updated 2 weeks ago
- AWS Glue Schema Registry Client library provides serializers / de-serializers for applications to integrate with AWS Glue Schema Registry…☆143Updated last week
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆96Updated last week
- Snowflake Kafka Connector (Sink Connector)☆159Updated this week
- Setup for running Trino with Hive Metastore on Kubernetes☆103Updated 3 years ago
- The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog a…☆225Updated 6 months ago
- A Helm chart to install Apache Airflow on Kubernetes☆286Updated this week
- Repository of helm charts for deploying DataHub on a Kubernetes cluster☆194Updated last week
- Common Transforms for Kafka Connect.☆167Updated 2 months ago
- ☆80Updated 4 months ago
- Experiments and demonstrations of AVRO, Protobuf serialisation☆61Updated 2 years ago
- Spark ETL example processing New York taxi rides public dataset on EKS☆44Updated 2 years ago
- Performant Redshift data source for Apache Spark☆142Updated 2 months ago
- Pylint plugin for static code analysis on Airflow code☆96Updated 4 years ago
- This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concu…☆77Updated 6 years ago
- Continuously synchronize directories from remote object store to local filesystem☆106Updated 6 months ago
- Spark on Kubernetes using Helm☆34Updated 5 years ago
- Example applications in Java, Python and SQL for Kinesis Data Analytics, demonstrating sources, sinks, and operators.☆146Updated last year
- Multiple node presto cluster on docker container☆125Updated 3 years ago
- Performance optimization for Spark running on Kubernetes☆90Updated 5 years ago
- Grafana dashboards and StatsD exporter config for Airflow monitoring☆286Updated last year
- An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR☆175Updated 3 months ago
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆70Updated 3 weeks ago
- Spark runtime on AWS Lambda☆109Updated 3 weeks ago
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMR☆38Updated 7 months ago
- Bare minimal Airflow on Kubernetes (Local, EKS, AKS)☆53Updated 5 years ago
- ❤for real-time DataOps - where the application and data fabric blends - Lenses☆159Updated this week
- A Python client for managing connectors using the Kafka Connect API.☆12Updated last year
- A best practices guide for using AWS EMR. The guide will cover best practices on the topics of cost, performance, security, operational e…☆109Updated 3 months ago