frictionlessdata / frictionless-py
Data management framework for Python that provides functionality to describe, extract, validate, and transform tabular data
☆743Updated this week
Alternatives and similar repositories for frictionless-py:
Users that are interested in frictionless-py are comparing it to the libraries listed below
- A Python library for working with Table Schema.☆262Updated 3 months ago
- Python library for reading and writing tabular data via streams.☆237Updated 3 years ago
- A Python library for working with Data Packages.☆192Updated last year
- Intake is a lightweight package for finding, investigating, loading and disseminating data.☆1,032Updated this week
- Data Package is a standard consisting of a set of simple yet extensible specifications to describe datasets, data files and tabular data.…☆516Updated this week
- Clean APIs for data cleaning. Python implementation of R package Janitor☆1,402Updated this week
- Quilt is a data mesh for connecting people with actionable data☆1,331Updated this week
- DataFlows is a simple, intuitive lightweight framework for building data processing flows in python.☆200Updated 3 months ago
- Python Extract Transform and Load Tables of Data☆1,262Updated 10 months ago
- Writes the Singer format from Python☆553Updated 5 months ago
- Brushing and linking for big data☆956Updated last week
- Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!☆527Updated last week
- A high-level plotting API for pandas, dask, xarray, and networkx built on HoloViews☆1,173Updated this week
- CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved di…☆1,279Updated last week
- 🐳 The stupidly simple CLI workspace for your data warehouse.☆727Updated 2 years ago
- A web frontend for scheduling Jupyter notebook reports☆252Updated 3 months ago
- Fast Datagrid widget for the Jupyter Notebook and JupyterLab☆600Updated 2 months ago
- The easy way to write your own flavor of Pandas☆301Updated 3 weeks ago
- A library for recording and reading data in notebooks.☆286Updated 2 years ago
- A light-weight, flexible, and expressive statistical data testing library☆3,665Updated this week
- Framework for processing data packages in pipelines of modular components.☆121Updated last month
- Bulwark is a package for convenient property-based testing of pandas dataframes.☆224Updated 4 years ago
- Pandas DataFrames as Interactive DataTables☆837Updated this week
- A library for defensive data analysis.☆501Updated 5 years ago
- Template Language for SQL with Automatic Bind Parameter Extraction☆829Updated 11 months ago
- IPython/Jupyter notebook module for Vega and Vega-Lite☆377Updated last week
- The goal of pandas-log is to provide feedback about basic pandas operations. It provides simple wrapper functions for the most common fun…☆215Updated 3 years ago
- High-level tools to simplify visualization in Python.☆864Updated 3 months ago
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆986Updated last year
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆627Updated last week