vitoravancini / spark-data-moverLinks
A cli tool that uses spark sql to move data around
☆13Updated 5 years ago
Alternatives and similar repositories for spark-data-mover
Users that are interested in spark-data-mover are comparing it to the libraries listed below
Sorting:
- ☆28Updated 3 weeks ago
- Transporter for integrating OpenLineage with OpenMetadata☆14Updated last month
- ELT With Airflow Helper - Classes and functions to make apache airflow life easier☆12Updated last week
- dbt ksqlDB adapter☆27Updated 2 years ago
- Dremio driver for Metabase BI☆52Updated 8 months ago
- Execution of DBT models using Apache Airflow through Docker Compose☆117Updated 2 years ago
- A collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.☆202Updated last month
- A repository of sample code to accompany our blog post on Airflow and dbt.☆174Updated last year
- This repo helps bootstrap the infrastructures with a modern data stack on Google Cloud Platform using Terraform.☆116Updated 3 years ago
- Adapter for dbt that executes dbt pipelines on Apache Flink☆95Updated last year
- Intelligent data generator for Apache Kafka. Generates streams of realistic data with support for cross-topic relationships, tombstoning,…☆155Updated last year
- The go to demo for public and private dbt Learn☆80Updated 3 months ago
- Library to convert DBT manifest metadata to Airflow tasks☆48Updated last year
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆163Updated 7 months ago
- New Generation Opensource Data Stack Demo☆438Updated 2 years ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆155Updated last week
- dbt (data build tool) adapter for the Dremio☆52Updated this week
- A generic ETL framework with Spark_SQL for transforming data by constructing pipelines with Yaml/Json/Xml☆15Updated 6 months ago
- ☆152Updated last week
- A Database Change Management tool for Snowflake☆576Updated last week
- Repository for the ActivitySchema spec and supporting materials☆421Updated 2 years ago
- Visio stencils and artefacts related to data vault guru☆49Updated 3 years ago
- The metrics layer for your data. Join us at https://metriql.com/slack☆310Updated 2 years ago
- dbt support for database features which are not yet supported natively in dbt-core☆157Updated last month
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Updated 4 years ago
- Quick Guides from Dremio on Several topics☆73Updated this week
- ☆38Updated 3 years ago
- Redshift package for dbt (getdbt.com)☆101Updated 6 months ago
- Create hadoop cluster in aws ec2 for development☆11Updated 7 years ago
- Containerized end-to-end analytics of Spotify data using Python, dbt, Postgres, and Metabase☆131Updated 3 years ago