datacamp / viewflow
Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.
β123Updated 3 years ago
Alternatives and similar repositories for viewflow:
Users that are interested in viewflow are comparing it to the libraries listed below
- π A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)β141Updated last year
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withouβ¦β113Updated last year
- Tool to automate data quality checks on data pipelinesβ254Updated 2 years ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.htmlβ61Updated 2 years ago
- Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).β122Updated 10 months ago
- Astronomer Core Docker Imagesβ106Updated 10 months ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.β167Updated last year
- dbt's adapter for dremioβ48Updated 2 years ago
- scaffold of Apache Airflow executing Docker containersβ85Updated 2 years ago
- Generate and Visualize Data Lineage from query historyβ322Updated last year
- Great Expectations Airflow operatorβ162Updated this week
- A repository of sample code to show data quality checking best practices using Airflow.β76Updated 2 years ago
- Data ingestion library for Amundsen to build graph and search indexβ205Updated last year
- A curated list of dagster code snippets for data engineersβ54Updated last year
- Making DAG construction easierβ260Updated last month
- Sample configuration to deploy a modern data platform.β88Updated 3 years ago
- Fast iterative local development and testing of Apache Airflow workflowsβ198Updated 3 months ago
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL databaseβ73Updated 3 years ago
- The metrics layer for your data. Join us at https://metriql.com/slackβ307Updated 2 years ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.β107Updated last week
- Write python locally, execute SQL in your data warehouseβ270Updated 2 years ago
- An extension for Jupyter Lab & Jupyter Notebook to monitor Apache Spark (pyspark) from notebooksβ50Updated 3 weeks ago
- dagster scikit-learn pipeline example.β44Updated 2 years ago
- Data Tools Subjective Listβ83Updated last year
- ETLy is an add-on dashboard service on top of Apache Airflow.β70Updated last year
- Data pipelines from re-usable componentsβ108Updated 2 years ago
- Make dbt docs and Apache Superset talk to one anotherβ142Updated 3 months ago
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.β79Updated last week
- The sane way of building a data layer in Airflowβ24Updated 5 years ago
- re_data - fix data issues before your users & CEO would discover them πβ98Updated 11 months ago