llofberg / kafka-connect-s3-parquet
☆11Updated 6 years ago
Alternatives and similar repositories for kafka-connect-s3-parquet:
Users that are interested in kafka-connect-s3-parquet are comparing it to the libraries listed below
- ☆40Updated last year
- Apiary provides modules which can be combined to create a federated cloud data lake☆36Updated 9 months ago
- Cloud Storage Connector integrates Apache Pulsar with cloud storage.☆28Updated this week
- Spark history server Helm Chart☆19Updated 10 months ago
- A library for Spark DataFrame using MinIO Select API☆97Updated 5 years ago
- Spark ETL example processing New York taxi rides public dataset on EKS☆44Updated 2 years ago
- Deploy Presto on the cloud easily, using Terraform and Packer☆44Updated last year
- ☆62Updated 5 years ago
- Streaming ETL with Apache Flink and Amazon Kinesis Data Analytics☆65Updated last year
- Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work☆48Updated 2 years ago
- Open Source Secret Provider plugin for the Kafka Connect framework☆46Updated 6 months ago
- Reference architecture for real-time stream processing with Apache Flink on Amazon EMR, Amazon Kinesis, and Amazon Elasticsearch Service.☆70Updated 11 months ago
- ☆22Updated 5 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆97Updated last year
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR☆65Updated 3 years ago
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆60Updated last year
- ❤for real-time DataOps - where the application and data fabric blends - Lenses☆154Updated last month
- Utility project for working with Kafka Connect.☆33Updated 5 months ago
- Open Source Kafka Connect Connector plugin repository built and maintained by Instaclustr☆12Updated last year
- ☆47Updated 5 months ago
- Demonstration of a Hive Input Format for Iceberg☆26Updated 3 years ago
- Optimizing downstream data processing with Amazon Kinesis Data Firehose and Amazon EMR running Apache Spark☆13Updated last year
- Presto Trino with Apache Hive Postgres metastore☆38Updated 4 months ago
- This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concu…☆76Updated 6 years ago
- ☆79Updated last year
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆71Updated 3 years ago
- Spark stream from kafka(json) to s3(parquet)☆15Updated 6 years ago
- HDFS inotify Example☆22Updated last year