frictionlessdata / frictionless-pyLinks
Data management framework for Python that provides functionality to describe, extract, validate, and transform tabular data
☆779Updated 2 months ago
Alternatives and similar repositories for frictionless-py
Users that are interested in frictionless-py are comparing it to the libraries listed below
Sorting:
- A Python library for working with Table Schema.☆264Updated 10 months ago
- DataFlows is a simple, intuitive lightweight framework for building data processing flows in python.☆216Updated 5 months ago
- Python library for reading and writing tabular data via streams.☆238Updated 4 years ago
- A Python library for working with Data Packages.☆190Updated last year
- A validation library for Pandas data frames using user-friendly schemas☆193Updated 2 years ago
- Intake is a lightweight package for finding, investigating, loading and disseminating data.☆1,055Updated 3 weeks ago
- Data Package is a standard consisting of a set of simple yet extensible specifications to describe datasets, data files and tabular data.…☆544Updated last month
- Bulwark is a package for convenient property-based testing of pandas dataframes.☆226Updated 5 years ago
- Test-Driven Data Analysis Functions☆302Updated last month
- Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!☆604Updated last week
- Tools for test driven data-wrangling and data validation.☆294Updated 3 years ago
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆285Updated 3 years ago
- SQL GUI for JupyterLab☆430Updated 2 years ago
- Python Extract Transform and Load Tables of Data☆1,289Updated last month
- Writes the Singer format from Python☆572Updated this week
- A web frontend for scheduling Jupyter notebook reports☆254Updated 10 months ago
- The easy way to write your own flavor of Pandas☆309Updated this week
- Quilt is a data mesh for connecting people with actionable data☆1,348Updated this week
- Framework for processing data packages in pipelines of modular components.☆121Updated 3 months ago
- Easy pipelines for pandas DataFrames.☆719Updated last week
- Opinionated JSON to CSV/XLSX/SQLITE/PARQUET converter. Flattens JSON fast.☆198Updated 3 months ago
- Immutable and statically-typeable DataFrames with runtime type and data validation☆475Updated this week
- Python interface to SDMX☆131Updated last year
- Quickly ingest messy CSV and XLS files. Export to clean pandas, SQL, parquet☆197Updated 2 years ago
- Type System for Data Analysis in Python☆213Updated 8 months ago
- SQLAlchemy driver for DuckDB☆463Updated this week
- A federated, open-source data catalog for all your big data and small data☆563Updated 2 weeks ago
- sidetable builds simple but useful summary tables of your data☆393Updated 2 years ago
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆646Updated last week
- A command line tool to easily add an ethics checklist to your data science projects.☆301Updated 3 weeks ago