frictionlessdata / frictionless-pyLinks
Data management framework for Python that provides functionality to describe, extract, validate, and transform tabular data
☆784Updated 2 months ago
Alternatives and similar repositories for frictionless-py
Users that are interested in frictionless-py are comparing it to the libraries listed below
Sorting:
- A Python library for working with Table Schema.☆264Updated 11 months ago
- Python library for reading and writing tabular data via streams.☆238Updated 4 years ago
- DataFlows is a simple, intuitive lightweight framework for building data processing flows in python.☆217Updated 5 months ago
- A Python library for working with Data Packages.☆191Updated last year
- Data Package is a standard consisting of a set of simple yet extensible specifications to describe datasets, data files and tabular data.…☆548Updated 2 weeks ago
- Python Extract Transform and Load Tables of Data☆1,290Updated 2 months ago
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,028Updated last year
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆286Updated 3 years ago
- Intake is a lightweight package for finding, investigating, loading and disseminating data.☆1,057Updated last month
- A validation library for Pandas data frames using user-friendly schemas☆193Updated 2 years ago
- Bulwark is a package for convenient property-based testing of pandas dataframes.☆226Updated 5 years ago
- A list of free data matching and record linkage software.☆393Updated last year
- CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved di…☆1,311Updated 2 weeks ago
- Flatten JSON in Python☆551Updated last year
- Opinionated JSON to CSV/XLSX/SQLITE/PARQUET converter. Flattens JSON fast.☆200Updated 4 months ago
- Framework for processing data packages in pipelines of modular components.☆121Updated 4 months ago
- Quickly ingest messy CSV and XLS files. Export to clean pandas, SQL, parquet☆197Updated 2 years ago
- Clean APIs for data cleaning. Python implementation of R package Janitor☆1,462Updated this week
- Test-Driven Data Analysis Functions☆302Updated last month
- Python interface to SDMX☆131Updated last year
- Writes the Singer format from Python☆573Updated this week
- Extract Transform Load for Python 3.5+☆1,604Updated 2 years ago
- SQL GUI for JupyterLab☆430Updated 2 years ago
- Making it easy to query APIs via SQL☆442Updated last week
- Simplifies use of the Dedupe library via Pandas☆136Updated 2 years ago
- IPython/Jupyter notebook module for Vega and Vega-Lite☆383Updated 6 months ago
- Tools for parsing messy tabular data. This is now superseded by https://github.com/frictionlessdata/tabulator-py☆390Updated 2 years ago
- Quilt is a data mesh for connecting people with actionable data☆1,349Updated this week
- Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!☆605Updated 3 weeks ago
- Tools for test driven data-wrangling and data validation.☆294Updated 3 years ago