metinsenturk / flat_tableLinks
An extention to json_normalize() in pandas
☆27Updated 5 months ago
Alternatives and similar repositories for flat_table
Users that are interested in flat_table are comparing it to the libraries listed below
Sorting:
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated last year
- Repository to maintain infrastructure to automate Data Workflows☆35Updated 4 years ago
- The easiest way to integrate Kedro and Great Expectations☆54Updated 2 years ago
- This article compares open-source Python packages for pipeline/workflow development: Airflow, Luigi, Gokart, Metaflow, Kedro, PipelineX.☆57Updated 5 years ago
- Code examples showing flow deployment to various types of infrastructure☆109Updated 2 years ago
- Fuzzy joins for python pandas - easily join different datasets☆59Updated 5 years ago
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆81Updated last year
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆145Updated 3 weeks ago
- dagster scikit-learn pipeline example.☆45Updated 2 years ago
- Quickly ingest messy CSV and XLS files. Export to clean pandas, SQL, parquet☆197Updated 2 years ago
- Examples of various flow deployments for Prefect 1.0 (storage and run configurations)☆35Updated 3 years ago
- The DBT of ML, as Aligned describes data dependencies in ML systems, and reduce technical data debt☆60Updated 2 weeks ago
- A small Python module containing quick utility functions for standard ETL processes.☆36Updated last month
- Tools for making Prefect work better for typical data science workflows☆18Updated 3 years ago
- Apache Avro <-> pandas DataFrame☆138Updated last month
- Dask integration for Snowflake☆30Updated last month
- Pipeline definitions for managing data flows to power analytics at MIT Open Learning☆43Updated this week
- kedro cli plugin for generating a static kedro viz site (html, css, js) that can be deployed on many serverless tools.☆28Updated 2 years ago
- Coming soon☆62Updated last year
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same…☆29Updated 2 years ago
- Type System for Data Analysis in Python☆213Updated 7 months ago
- Fast, resilient and reproducible data analysis with cached SQL queries☆30Updated 2 years ago
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆81Updated last week
- Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!☆137Updated last week
- manipulate pandas dataframes from the comfort of your browser☆174Updated 4 years ago
- A data modelling layer built on top of polars and pydantic☆198Updated 2 years ago
- A template repository with all the fundamentals needed to develop and deploy a Python data-processing routine for Prefect pipelines.☆20Updated 3 years ago
- A Python package that parses sql and converts it to ibis expressions☆55Updated last year
- SQL upsert using pandas DataFrames for PostgreSQL, SQlite and MySQL with extra features☆231Updated last year
- The goal of pandas-log is to provide feedback about basic pandas operations. It provides simple wrapper functions for the most common fun…☆216Updated 4 years ago