metinsenturk / flat_table
An extention to json_normalize() in pandas
☆27Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for flat_table
- dagster scikit-learn pipeline example.☆43Updated last year
- Dask integration for Snowflake☆30Updated 4 months ago
- Pipeline definitions for managing data flows to power analytics at MIT Open Learning☆37Updated this week
- Repository to maintain infrastructure to automate Data Workflows☆34Updated 3 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆111Updated 7 months ago
- Tools for making Prefect work better for typical data science workflows☆19Updated 2 years ago
- ☆109Updated last year
- A collection of python utility functions☆12Updated 4 months ago
- PySpark schema generator☆38Updated last year
- Fuzzy joins for python pandas - easily join different datasets☆59Updated 4 years ago
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same…☆28Updated last year
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆37Updated 5 years ago
- A simple script to help schedule Jupyter Notebook execution and storing of the results using Papermill☆27Updated 5 years ago
- Prefect API Authentication/Authorization Proxy for on-premises deployments☆35Updated last week
- Read Delta tables without any Spark☆47Updated 8 months ago
- This repository is part of an article "Prefect workflow automation with Azure DevOps and AKS"☆31Updated 3 years ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆122Updated 3 years ago
- pandabase links DataFrames to SQL databases using primary keys.☆21Updated 4 years ago
- A small Python module containing quick utility functions for standard ETL processes.☆33Updated last week
- Write your dbt models using Ibis☆52Updated 2 weeks ago
- ☆29Updated 10 months ago
- An extension to add Prefect flow visualizations into you Sphinx documentation.☆14Updated 2 years ago
- A curated list of dagster code snippets for data engineers☆50Updated 8 months ago
- Cluster tools for running Dask on Databricks☆13Updated 5 months ago
- Parse dbt artifacts and search dbt models with Algolia☆52Updated 3 years ago
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆80Updated 5 months ago
- The easiest way to integrate Kedro and Great Expectations☆53Updated last year
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆134Updated 3 weeks ago
- Build your feature store with macros right within your dbt repository☆37Updated last year