frictionlessdata / frictionless-pyLinks
Data management framework for Python that provides functionality to describe, extract, validate, and transform tabular data
☆765Updated last week
Alternatives and similar repositories for frictionless-py
Users that are interested in frictionless-py are comparing it to the libraries listed below
Sorting:
- A Python library for working with Table Schema.☆264Updated 8 months ago
- DataFlows is a simple, intuitive lightweight framework for building data processing flows in python.☆214Updated 2 months ago
- A Python library for working with Data Packages.☆192Updated last year
- Python library for reading and writing tabular data via streams.☆238Updated 4 years ago
- Data Package is a standard consisting of a set of simple yet extensible specifications to describe datasets, data files and tabular data.…☆535Updated last week
- Intake is a lightweight package for finding, investigating, loading and disseminating data.☆1,048Updated last month
- Python Extract Transform and Load Tables of Data☆1,280Updated 3 weeks ago
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,018Updated last year
- Bulwark is a package for convenient property-based testing of pandas dataframes.☆226Updated 5 years ago
- Test-Driven Data Analysis Functions☆299Updated last week
- A validation library for Pandas data frames using user-friendly schemas☆192Updated 2 years ago
- Quickly ingest messy CSV and XLS files. Export to clean pandas, SQL, parquet☆197Updated 2 years ago
- Tools for test driven data-wrangling and data validation.☆294Updated 3 years ago
- Writes the Singer format from Python☆570Updated last month
- Tools for generating CSV and other flat versions of the structured data☆108Updated 3 months ago
- Clean APIs for data cleaning. Python implementation of R package Janitor☆1,440Updated this week
- The easy way to write your own flavor of Pandas☆307Updated last month
- Immutable and statically-typeable DataFrames with runtime type and data validation☆471Updated 3 weeks ago
- Framework for processing data packages in pipelines of modular components.☆121Updated last month
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆639Updated this week
- Making it easy to query APIs via SQL☆431Updated last week
- CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved di…☆1,301Updated last week
- SQL GUI for JupyterLab☆428Updated 2 years ago
- Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!☆586Updated last week
- SQL upsert using pandas DataFrames for PostgreSQL, SQlite and MySQL with extra features☆230Updated last year
- sidetable builds simple but useful summary tables of your data☆391Updated 2 years ago
- A web frontend for scheduling Jupyter notebook reports☆253Updated 8 months ago
- Python interface to SDMX☆132Updated last year
- 🐳 The stupidly simple CLI workspace for your data warehouse.☆727Updated 2 years ago
- Easy pipelines for pandas DataFrames.☆720Updated 3 weeks ago