metinsenturk / flat_tableLinks
An extention to json_normalize() in pandas
☆27Updated 6 months ago
Alternatives and similar repositories for flat_table
Users that are interested in flat_table are comparing it to the libraries listed below
Sorting:
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated last year
- Fast, resilient and reproducible data analysis with cached SQL queries☆30Updated 2 years ago
- Examples of various flow deployments for Prefect 1.0 (storage and run configurations)☆35Updated 3 years ago
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆145Updated 2 weeks ago
- The easiest way to integrate Kedro and Great Expectations☆54Updated 2 years ago
- SQL upsert using pandas DataFrames for PostgreSQL, SQlite and MySQL with extra features☆231Updated last year
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same…☆30Updated 2 years ago
- Apache Avro <-> pandas DataFrame☆138Updated 2 months ago
- Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).☆120Updated last month
- The Prefect API and backend☆246Updated 2 years ago
- Type System for Data Analysis in Python☆213Updated 8 months ago
- dagster scikit-learn pipeline example.☆46Updated 2 years ago
- Code examples showing flow deployment to various types of infrastructure☆110Updated 2 years ago
- JupyterHub extension for ContainDS Dashboards☆201Updated last year
- DBAPI and SQLAlchemy dialect for Databricks Workspace and SQL Analytics clusters☆22Updated 3 years ago
- Docker images for dask☆243Updated 2 weeks ago
- Fake Pandas / PySpark DataFrame creator☆48Updated last year
- A template for an AWS Lambda function that triggers Prefect Flow Runs☆20Updated 4 years ago
- kedro cli plugin for generating a static kedro viz site (html, css, js) that can be deployed on many serverless tools.☆28Updated 2 years ago
- Prefect API Authentication/Authorization Proxy for on-premises deployments☆41Updated 6 months ago
- Pipeline definitions for managing data flows to power analytics at MIT Open Learning☆44Updated this week
- manipulate pandas dataframes from the comfort of your browser☆174Updated 4 years ago
- Woodwork is a Python library that provides robust methods for managing and communicating data typing information.☆155Updated 3 weeks ago
- Build and deploy a serverless data pipeline on AWS with no effort.☆111Updated 2 years ago
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆81Updated last year
- Coming soon☆62Updated last year
- The DBT of ML, as Aligned describes data dependencies in ML systems, and reduce technical data debt☆60Updated 3 weeks ago
- Write python locally, execute SQL in your data warehouse☆269Updated 3 years ago
- Quickly ingest messy CSV and XLS files. Export to clean pandas, SQL, parquet☆197Updated 2 years ago
- Fuzzy joins for python pandas - easily join different datasets☆59Updated 5 years ago