frictionlessdata / frictionless-py
Data management framework for Python that provides functionality to describe, extract, validate, and transform tabular data
☆745Updated last month
Alternatives and similar repositories for frictionless-py:
Users that are interested in frictionless-py are comparing it to the libraries listed below
- A Python library for working with Table Schema.☆263Updated 5 months ago
- Python library for reading and writing tabular data via streams.☆237Updated 3 years ago
- A Python library for working with Data Packages.☆192Updated last year
- DataFlows is a simple, intuitive lightweight framework for building data processing flows in python.☆203Updated last month
- Data Package is a standard consisting of a set of simple yet extensible specifications to describe datasets, data files and tabular data.…☆527Updated 3 weeks ago
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆635Updated last week
- Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!☆565Updated this week
- Intake is a lightweight package for finding, investigating, loading and disseminating data.☆1,043Updated last week
- Framework for processing data packages in pipelines of modular components.☆121Updated 3 months ago
- SQL GUI for JupyterLab☆423Updated 2 years ago
- Bulwark is a package for convenient property-based testing of pandas dataframes.☆224Updated 4 years ago
- Writes the Singer format from Python☆561Updated last month
- Test-Driven Data Analysis Functions☆299Updated last month
- A validation library for Pandas data frames using user-friendly schemas☆191Updated 2 years ago
- The easy way to write your own flavor of Pandas☆307Updated last month
- A library for defensive data analysis.☆500Updated 5 years ago
- Immutable and statically-typeable DataFrames with runtime type and data validation☆459Updated this week
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,002Updated last year
- Tools for parsing messy tabular data. This is now superseded by https://github.com/frictionlessdata/tabulator-py☆389Updated last year
- Clean APIs for data cleaning. Python implementation of R package Janitor☆1,414Updated this week
- A web frontend for scheduling Jupyter notebook reports☆252Updated 5 months ago
- 🐳 The stupidly simple CLI workspace for your data warehouse.☆726Updated 2 years ago
- A federated, open-source data catalog for all your big data and small data☆539Updated this week
- Easy pipelines for pandas DataFrames.☆719Updated 6 months ago
- Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends☆1,578Updated this week
- Tools for diffing and merging of Jupyter notebooks.☆2,741Updated 7 months ago
- sidetable builds simple but useful summary tables of your data☆389Updated 2 years ago
- A light-weight, flexible, and expressive statistical data testing library☆3,788Updated this week
- Quilt is a data mesh for connecting people with actionable data☆1,338Updated this week
- Python Extract Transform and Load Tables of Data☆1,266Updated last week