peerside / awesome-data-wrangling
A curated list of data wrangling resources
☆32Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for awesome-data-wrangling
- A monorepo of many Rill example projects☆31Updated this week
- CLI for creating databases for Data Quality Dashboards.☆19Updated 5 years ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆52Updated 3 weeks ago
- 🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)☆140Updated last year
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable fro…☆27Updated 2 years ago
- ☆29Updated last week
- TinyOlap is a light-weight, in-process, in-memory, multi-dimensional, model-first OLAP engine for planning, budgeting, reporting, analysi…☆42Updated 2 years ago
- SQL-based transforms compatible with Rasgo and PyRasgo☆24Updated 7 months ago
- Postgres utility package for dbt (getdbt.com)☆18Updated 3 years ago
- A CLI to build linked data cubes.☆12Updated 3 weeks ago
- A curated list of dagster code snippets for data engineers☆52Updated 8 months ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆122Updated 3 years ago
- DuckDB SQL Tools add DuckDB support to VSCode, and provide database schema and SQL query interfaces for the popular SQLTools extension, S…☆12Updated 4 months ago
- Helper code to interact with Rasgo via our SDK, PyRasgo☆40Updated last year
- This is a compilation of Data Governance resources, examples, models and communities☆10Updated 5 years ago
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆56Updated 2 years ago
- Data-aware orchestration with dagster, dbt, and airbyte☆30Updated last year
- A modern, enterprise-ready business intelligence web application. Unleash the value of your data. 📈 📉 📊☆31Updated last year
- CubETL - Framework and tool for data ETL (Extract, Transform and Load) in Python (PERSONAL PROJECT / SELDOM MAINTAINED)☆26Updated 2 years ago
- A maximum-strength name parser for record linkage.☆34Updated 3 months ago
- dbt data models for facebook ads☆37Updated last year
- Executable Examples for Making Data Visual☆46Updated 6 years ago
- a convenient way to anonymize your data for analytics☆20Updated 3 years ago
- Good Practice Tables - an XlsxWriter wrapper to write consistently formatted statistical tables to Excel.☆37Updated last week
- Repo demonstrating a Dagster pipeline to generate Neo4j Graph☆21Updated 3 years ago
- ☆71Updated this week
- Data validation as a service. Project retired, got to the current one at frictionsless/repository☆69Updated last year
- A python package to create a database on the platform using our moj data warehousing framework☆21Updated 2 months ago
- A tool to automatically infer columns data types in .csv files☆34Updated last year
- This repository contains examples of how to use dbt's metric functionality on the jaffle shop dataset☆28Updated 2 weeks ago