metinsenturk / flat_tableLinks
An extention to json_normalize() in pandas
☆27Updated 9 months ago
Alternatives and similar repositories for flat_table
Users that are interested in flat_table are comparing it to the libraries listed below
Sorting:
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated 2 months ago
- SQL upsert using pandas DataFrames for PostgreSQL, SQlite and MySQL with extra features☆234Updated 2 years ago
- Code examples showing flow deployment to various types of infrastructure☆110Updated 3 years ago
- The easiest way to integrate Kedro and Great Expectations☆54Updated 3 years ago
- Quickly ingest messy CSV and XLS files. Export to clean pandas, SQL, parquet☆196Updated 2 years ago
- A template for an AWS Lambda function that triggers Prefect Flow Runs☆20Updated 4 years ago
- dagster scikit-learn pipeline example.☆46Updated 2 years ago
- Dask integration for Snowflake☆30Updated 6 months ago
- Repository to maintain infrastructure to automate Data Workflows☆35Updated 4 years ago
- Type System for Data Analysis in Python☆216Updated last year
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆145Updated 3 months ago
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same…☆30Updated 3 years ago
- manipulate pandas dataframes from the comfort of your browser☆174Updated 4 years ago
- Fast, resilient and reproducible data analysis with cached SQL queries☆30Updated 2 years ago
- Tools for making Prefect work better for typical data science workflows☆18Updated 3 years ago
- Coming soon☆62Updated 2 years ago
- This article compares open-source Python packages for pipeline/workflow development: Airflow, Luigi, Gokart, Metaflow, Kedro, PipelineX.☆57Updated 5 years ago
- JupyterHub extension for ContainDS Dashboards☆201Updated last year
- Examples of various flow deployments for Prefect 1.0 (storage and run configurations)☆35Updated 3 years ago
- A Python DB-API and SQLAlchemy dialect to Google Spreasheets☆225Updated 3 years ago
- Fuzzy joins for python pandas - easily join different datasets☆59Updated 5 years ago
- Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!☆139Updated last week
- Swiple enables you to easily observe, understand, validate and improve the quality of your data☆84Updated last week
- Docker images for dask☆244Updated this week
- Woodwork is a Python library that provides robust methods for managing and communicating data typing information.☆155Updated 4 months ago
- Cloud-agnostic Python API☆60Updated last year
- Build and deploy a serverless data pipeline on AWS with no effort.☆110Updated 2 years ago
- A frictionless integrated platform for notebook☆82Updated 3 years ago
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆286Updated 3 years ago
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆84Updated 4 months ago