metinsenturk / flat_table
An extention to json_normalize() in pandas
☆27Updated 4 years ago
Alternatives and similar repositories for flat_table:
Users that are interested in flat_table are comparing it to the libraries listed below
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆112Updated 10 months ago
- A small Python module containing quick utility functions for standard ETL processes.☆34Updated this week
- Repository to maintain infrastructure to automate Data Workflows☆34Updated 3 years ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆106Updated this week
- Dask integration for Snowflake☆30Updated 3 months ago
- Fast iterative local development and testing of Apache Airflow workflows☆196Updated 2 months ago
- Pipeline definitions for managing data flows to power analytics at MIT Open Learning☆42Updated this week
- Parse dbt artifacts and search dbt models with Algolia☆52Updated 3 years ago
- dagster scikit-learn pipeline example.☆44Updated last year
- Running Meltano ELT on AWS Batch, infra with Terraform☆19Updated 2 years ago
- ☆30Updated last year
- pandabase links DataFrames to SQL databases using primary keys.☆21Updated 4 years ago
- Examples of various flow deployments for Prefect 1.0 (storage and run configurations)☆35Updated 2 years ago
- The easiest way to integrate Kedro and Great Expectations☆53Updated 2 years ago
- An extension to add Prefect flow visualizations into you Sphinx documentation.☆13Updated 2 years ago
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆80Updated 9 months ago
- This repository is part of an article "Prefect workflow automation with Azure DevOps and AKS"☆31Updated 4 years ago
- Run dbt serverless in the Cloud (AWS)☆41Updated 5 years ago
- A Delta Lake reader for Dask☆48Updated 4 months ago
- PySpark schema generator☆41Updated last year
- Data-aware orchestration with dagster, dbt, and airbyte☆31Updated 2 years ago
- Fake Pandas / PySpark DataFrame creator☆45Updated 11 months ago
- Singer.io Target for Snowflake - PipelineWise compatible☆51Updated 5 months ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆37Updated 5 years ago
- 🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)☆140Updated last year
- kedro cli plugin for generating a static kedro viz site (html, css, js) that can be deployed on many serverless tools.☆27Updated 2 years ago
- A very simple "hello world" project for deploying Prefect 2 to a docker container on Google Compute Engine.☆11Updated 2 years ago
- Composable filesystem hooks and operators for Apache Airflow.☆17Updated 3 years ago