metinsenturk / flat_table
An extention to json_normalize() in pandas
☆27Updated 4 years ago
Alternatives and similar repositories for flat_table:
Users that are interested in flat_table are comparing it to the libraries listed below
- Repository to maintain infrastructure to automate Data Workflows☆34Updated 3 years ago
- Tools for making Prefect work better for typical data science workflows☆19Updated 3 years ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆107Updated this week
- A template repository with all the fundamentals needed to develop and deploy a Python data-processing routine for Prefect pipelines.☆20Updated 2 years ago
- A Python package that parses SQL and interprets it as methods that act upon existing pandas (or other types of) DataFrames that have been…☆98Updated 3 years ago
- A template for an AWS Lambda function that triggers Prefect Flow Runs☆20Updated 3 years ago
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- A collection of python utility functions☆11Updated 9 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated 11 months ago
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same…☆28Updated 2 years ago
- Fast, resilient and reproducible data analysis with cached SQL queries☆30Updated last year
- asyncio bridge to the duckdb library☆39Updated 2 years ago
- A very simple "hello world" project for deploying Prefect 2 to a docker container on Google Compute Engine.☆11Updated 2 years ago
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆80Updated 10 months ago
- A place to provide Coiled feedback☆18Updated 3 weeks ago
- The easiest way to integrate Kedro and Great Expectations☆53Updated 2 years ago
- dagster scikit-learn pipeline example.☆45Updated 2 years ago
- A fully-featured multi-source data pipeline for continuously extracting knowledge from COVID-19 data.☆21Updated 3 years ago
- Prefect API Authentication/Authorization Proxy for on-premises deployments☆38Updated 3 months ago
- A small Python module containing quick utility functions for standard ETL processes.☆34Updated this week
- Running Meltano ELT on AWS Batch, infra with Terraform☆19Updated 3 years ago
- Data pipelines from re-usable components☆108Updated last year
- Quickly ingest messy CSV and XLS files. Export to clean pandas, SQL, parquet☆195Updated last year
- Pipeline definitions for managing data flows to power analytics at MIT Open Learning☆43Updated this week
- Build and deploy a serverless data pipeline on AWS with no effort.☆111Updated 2 years ago
- Read Delta tables without any Spark☆47Updated last year
- Prefect integrations for working with Docker☆43Updated 11 months ago
- Comparing Polars to Pandas and a small introduction☆43Updated 3 years ago
- This library can convert a pydantic class to a avro schema or generate python code from a avro schema.☆70Updated 3 weeks ago
- This repository is part of an article "Prefect workflow automation with Azure DevOps and AKS"☆31Updated 4 years ago