peerside / awesome-data-wrangling
A curated list of data wrangling resources
☆32Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for awesome-data-wrangling
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable fro…☆26Updated 2 years ago
- A monorepo of many Rill example projects☆31Updated this week
- CLI for creating databases for Data Quality Dashboards.☆19Updated 5 years ago
- Centralized whale instance using github actions, sourcing metadata from bigquery-public-data.☆17Updated 4 months ago
- 🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)☆140Updated last year
- BroadbandNow is the most comprehensive resource for internet service provider plan, pricing and coverage data.☆27Updated 3 years ago
- Shiny app to parse a .twb file to show data source as well as dependency between calculated fields, parameters and raw data.☆28Updated 2 years ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆122Updated 3 years ago
- The classic desktop version of osDQ☆10Updated 2 years ago
- Configuration and schema sync for Metabase from Python☆19Updated last year
- The Taxonomy for ETL Automation Metadata (TEAM) is a tool for design metadata management geared towards data warehouse automation. It is …☆34Updated 2 months ago
- TinyOlap is a light-weight, in-process, in-memory, multi-dimensional, model-first OLAP engine for planning, budgeting, reporting, analysi…☆42Updated 2 years ago
- A visual data pipeline builder with various backends☆97Updated this week
- This data dictionary provides information about the tables and views in the "workgroup" PostgreSQL database of the Tableau Server reposit…☆38Updated 6 months ago
- Analyzing and calculating key marketing metrics with SQL and Python☆14Updated 5 years ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated 8 months ago
- A small Python module containing quick utility functions for standard ETL processes.☆33Updated last week
- Python ELT Studio, an application for building ELT (and ETL) data flows.☆57Updated 2 years ago
- Cost Efficient Data Pipelines with DuckDB☆46Updated 3 months ago
- ☆15Updated last year
- A modern, enterprise-ready business intelligence web application. Unleash the value of your data. 📈 📉 📊☆31Updated last year
- A library for data warehouse and data integration pattern and architecture documentation.☆49Updated 11 months ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆111Updated this week
- Sample configuration to deploy a modern data platform.☆86Updated 2 years ago
- python library for reading json-stat format dataset☆21Updated 3 years ago
- Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables☆71Updated 11 months ago
- A tool to generate PySpark schema from JSON.☆26Updated 9 months ago
- ☆32Updated 4 years ago
- SQL-based transforms compatible with Rasgo and PyRasgo☆24Updated 6 months ago
- Open-source metadata collector based on ODD Specification☆42Updated last year