liquibase / liquibase-databricksLinks
☆35Updated this week
Alternatives and similar repositories for liquibase-databricks
Users that are interested in liquibase-databricks are comparing it to the libraries listed below
Sorting:
- A COBOL parser and Mainframe/EBCDIC data source for Apache Spark☆157Updated 2 weeks ago
- Yet Another (Spark) ETL Framework☆21Updated 2 years ago
- Code snippets used in demos recorded for the blog.☆37Updated this week
- Kafka Connect REST connector☆112Updated 3 years ago
- Kafka Connector for Iceberg tables☆16Updated 2 years ago
- A Kafka Serde that reads and writes records from and to Blob storage (S3, Azure, Google) transparently.☆62Updated last month
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆97Updated last week
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆65Updated 2 years ago
- Extensible streaming ingestion pipeline on top of Apache Spark☆45Updated 4 months ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated this week
- A Table format agnostic data sharing framework☆42Updated last year
- Delta lake and filesystem helper methods☆51Updated last year
- ☆269Updated last year
- ☆81Updated 7 months ago
- An implementation of the DatasourceV2 interface of Apache Spark™ for writing Spark Datasets to Apache Druid™.☆43Updated 2 months ago
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆123Updated last week
- A library that provides useful extensions to Apache Spark and PySpark.☆231Updated this week
- Snowflake Data Source for Apache Spark.☆230Updated last week
- Column-wise type annotations for pyspark DataFrames☆93Updated last week
- Utility functions and base classes for Kafka Streams applications☆34Updated this week
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆52Updated 5 months ago
- Multi-hop declarative data pipelines☆122Updated this week
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆94Updated 7 months ago
- A Python Library to support running data quality rules while the spark job is running⚡☆193Updated this week
- ☆63Updated last year
- Adapter for dbt that executes dbt pipelines on Apache Flink☆96Updated last year
- Apache Hive Metastore as a Standalone server in Docker☆80Updated last year
- Test data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.☆71Updated this week
- 🌟 Examples of use cases that utilize Decodable, as well as demos for related open-source projects such as Apache Flink, Debezium, and Po…☆85Updated 5 months ago
- The Internals of Spark on Kubernetes☆72Updated 3 years ago