OpenDataAlex / etlTestLinks
Automated and tool agnostic data integration testing tool.
☆10Updated 3 years ago
Alternatives and similar repositories for etlTest
Users that are interested in etlTest are comparing it to the libraries listed below
Sorting:
- A Python library for working with Table Schema.☆264Updated last year
- Framework for processing data packages in pipelines of modular components.☆123Updated 7 months ago
- Python library for reading and writing tabular data via streams.☆238Updated 4 years ago
- The simplest way to use SQL in Python☆30Updated 2 years ago
- An automation tool to refactor Jupyter Notebooks to Python modules, with code dependency analysis.☆12Updated 11 months ago
- An example mini data warehouse for python project stats, template for new projects☆178Updated 5 years ago
- Data validation as a service. Project retired, got to the current one at frictionsless/repository☆69Updated 3 years ago
- Python package to access OLAP data sources.☆65Updated 3 years ago
- "1 + 1 = 1 or Record Deduplication with Python" Jupyter Notebook☆84Updated 3 years ago
- Tools for generating CSV and other flat versions of the structured data☆109Updated last month
- A python client library for the Stitch Import API☆44Updated 2 years ago
- Tough and flexible tools for data analysis, transformation, validation and movement.☆140Updated 2 years ago
- Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).☆120Updated 4 months ago
- A simple, Qt-Webengine powered web browser with built in functionality for basic scrapy webscraping support.☆110Updated last year
- A data science Python library aimed at adding fuzz, noise and other issues to your data for testing purposes.☆30Updated 2 years ago
- A simple command line interface to the datamade/dedupe library.☆43Updated 3 years ago
- The OpenRefine Python Client from Paul Makepeace provides a library for communicating with an OpenRefine server. This fork extends the co…☆86Updated 4 years ago
- Data analysis and reporting tool for quick access to custom charts and tables in Jupyter Notebooks and in the shell.☆123Updated 2 weeks ago
- Quickly ingest messy CSV and XLS files. Export to clean pandas, SQL, parquet☆196Updated 2 years ago
- OlaPy, an experimental OLAP engine based on Pandas☆109Updated 2 years ago
- Official repository for pygrametl - ETL programming in Python☆299Updated 4 months ago
- Utilities for creating ETL pipelines with mara☆36Updated 3 years ago
- Python for people data☆71Updated last year
- Execute OpenRefine JSON scripts without OpenRefine (or Java)☆31Updated 3 years ago
- CLI for creating databases for Data Quality Dashboards.☆19Updated 6 years ago
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆286Updated 3 years ago
- Fuzzy joins for python pandas - easily join different datasets☆59Updated 5 years ago
- a set of scripts to pull meta data and data profiling metrics from relational database systems☆77Updated last year
- CLI tool for initiating dash boilerplate☆21Updated 8 years ago
- Optional extensions for petl based on third party libraries.☆44Updated 10 years ago