goodwillpunning / hyperleaupLinks
Create and manipulate Tableau Hyper files from Apache Spark DataFrames and Spark SQL
☆31Updated last month
Alternatives and similar repositories for hyperleaup
Users that are interested in hyperleaup are comparing it to the libraries listed below
Sorting:
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆168Updated 2 years ago
- Great Expectations Airflow operator☆170Updated last week
- Spark app to merge different schemas☆23Updated 5 years ago
- All the basics to get a nice containerized dbt development environment☆58Updated 3 years ago
- This repository contains the dbt-glue adapter☆139Updated last month
- The shared semantic layer definitions that dbt-core and MetricFlow use.☆92Updated this week
- Data-aware orchestration with dagster, dbt, and airbyte☆31Updated 3 years ago
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆226Updated last week
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 3 years ago
- Data Product Portal created by Dataminded☆198Updated this week
- Delta Lake Documentation☆53Updated last year
- Utility functions for dbt projects running on Spark☆34Updated last month
- A curated list of resources about Snowflake☆256Updated last year
- A collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.☆211Updated last month
- PyJaws: A Pythonic Way to Define Databricks Jobs and Workflows☆45Updated 2 weeks ago
- A Python Library to support running data quality rules while the spark job is running⚡☆197Updated this week
- re_data - fix data issues before your users & CEO would discover them 😊☆101Updated last year
- Python project template for Snowpark development☆80Updated 2 years ago
- Delta Lake examples☆238Updated last year
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆27Updated last year
- Data engineering with dbt, published by Packt☆89Updated 5 months ago
- Data pipeline with dbt, Airflow, Great Expectations☆166Updated 4 years ago
- A curated list of awesome blogs, videos, tools and resources about Data Contracts☆182Updated last year
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆115Updated this week
- A Python API for Asynchronously Loading Data into Snowflake DB -☆68Updated 3 months ago
- An example dbt project using AutomateDV to create a Data Vault 2.0 Data Warehouse based on the Snowflake TPC-H dataset.☆57Updated last year
- Make simple storing test results and visualisation of these in a BI dashboard☆52Updated last month
- A repository of sample code to show data quality checking best practices using Airflow.☆78Updated 2 years ago
- Delta Lake helper methods. No Spark dependency.☆22Updated 3 weeks ago
- A bunch of hacks developed around dbt☆48Updated 6 years ago