frictionlessdata / frictionless-pyLinks
Data management framework for Python that provides functionality to describe, extract, validate, and transform tabular data
☆749Updated last month
Alternatives and similar repositories for frictionless-py
Users that are interested in frictionless-py are comparing it to the libraries listed below
Sorting:
- A Python library for working with Table Schema.☆264Updated 6 months ago
- A Python library for working with Data Packages.☆192Updated last year
- DataFlows is a simple, intuitive lightweight framework for building data processing flows in python.☆203Updated 3 weeks ago
- Clean APIs for data cleaning. Python implementation of R package Janitor☆1,425Updated this week
- A validation library for Pandas data frames using user-friendly schemas☆192Updated 2 years ago
- Bulwark is a package for convenient property-based testing of pandas dataframes.☆225Updated 4 years ago
- Intake is a lightweight package for finding, investigating, loading and disseminating data.☆1,048Updated 3 weeks ago
- Python Extract Transform and Load Tables of Data☆1,270Updated 3 weeks ago
- Data Package is a standard consisting of a set of simple yet extensible specifications to describe datasets, data files and tabular data.…☆529Updated last month
- 🐳 The stupidly simple CLI workspace for your data warehouse.☆728Updated 2 years ago
- Quilt is a data mesh for connecting people with actionable data☆1,342Updated last week
- Google BigQuery connector for pandas☆473Updated last week
- A library for defensive data analysis.☆500Updated 5 years ago
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆283Updated 2 years ago
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,007Updated last year
- Writes the Singer format from Python☆562Updated 2 months ago
- Data Migration for the Blaze Project☆1,002Updated 2 years ago
- Tools for test driven data-wrangling and data validation.☆294Updated 3 years ago
- Framework for processing data packages in pipelines of modular components.☆121Updated 4 months ago
- Extract Transform Load for Python 3.5+☆1,591Updated 2 years ago
- The easy way to write your own flavor of Pandas☆307Updated last month
- SQL upsert using pandas DataFrames for PostgreSQL, SQlite and MySQL with extra features☆229Updated last year
- Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!☆574Updated this week
- CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved di…☆1,294Updated last month
- IPython/Jupyter notebook module for Vega and Vega-Lite☆381Updated 2 months ago
- Quickly ingest messy CSV and XLS files. Export to clean pandas, SQL, parquet☆196Updated last year
- Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends☆1,603Updated last week
- Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark☆1,511Updated 6 months ago
- Tools for parsing messy tabular data. This is now superseded by https://github.com/frictionlessdata/tabulator-py☆390Updated 2 years ago
- Python interface to SDMX☆132Updated last year