10xfuturetechnologies / kafka-connect-icebergLinks
Kafka Connector for Iceberg tables
☆16Updated 2 years ago
Alternatives and similar repositories for kafka-connect-iceberg
Users that are interested in kafka-connect-iceberg are comparing it to the libraries listed below
Sorting:
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆91Updated 2 months ago
- ☆80Updated 3 months ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆80Updated 3 months ago
- BigQuery connector for Apache Flink☆32Updated 3 weeks ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated last week
- ☆58Updated last week
- A library that brings useful functions from various modern database management systems to Apache Spark☆60Updated last year
- Extensible streaming ingestion pipeline on top of Apache Spark☆45Updated 2 weeks ago
- ☆40Updated 2 years ago
- Avro SerDe for Apache Spark structured APIs.☆235Updated last month
- Flowchart for debugging Spark applications☆106Updated 10 months ago
- The Internals of Spark on Kubernetes☆71Updated 3 years ago
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆124Updated this week
- Adapter for dbt that executes dbt pipelines on Apache Flink☆95Updated last year
- Multi-hop declarative data pipelines☆117Updated last week
- A testing framework for Trino☆26Updated 4 months ago
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Updated 7 months ago
- Helm charts for Trino and Trino Gateway☆171Updated last week
- Kubernetes Helm Chart to deploy Apache Atlas☆16Updated 4 years ago
- Magic to help Spark pipelines upgrade☆35Updated 10 months ago
- Spark Structured Streaming State Tools☆34Updated 5 years ago
- Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.☆76Updated last year
- Setup for running Trino with Hive Metastore on Kubernetes☆102Updated 2 years ago
- A Table format agnostic data sharing framework☆38Updated last year
- Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!☆231Updated 6 months ago
- Sample processing code using Spark 2.1+ and Scala☆51Updated 5 years ago
- A simple Spark-powered ETL framework that just works 🍺☆182Updated this week
- REST API for Apache Spark on K8S or YARN☆98Updated last month
- A dynamic data completeness and accuracy library at enterprise scale for Apache Spark☆29Updated 8 months ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆88Updated last year