peerside / awesome-data-wranglingLinks
A curated list of data wrangling resources
☆38Updated 6 years ago
Alternatives and similar repositories for awesome-data-wrangling
Users that are interested in awesome-data-wrangling are comparing it to the libraries listed below
Sorting:
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable fro…☆27Updated 2 years ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆59Updated 2 weeks ago
- 🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)☆141Updated last year
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆33Updated 3 years ago
- Centralized whale instance using github actions, sourcing metadata from bigquery-public-data.☆17Updated last year
- The classic desktop version of osDQ☆10Updated 2 years ago
- A visual data pipeline builder with various backends☆103Updated this week
- Awesome Business Intelligence☆30Updated 8 months ago
- ☆39Updated 4 months ago
- CLI for creating databases for Data Quality Dashboards.☆19Updated 5 years ago
- DIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics control framework that can be used to monitor, log, aud…☆27Updated 3 weeks ago
- A monorepo of many Rill example projects☆39Updated this week
- portable Python ML-powered data bot☆23Updated 9 months ago
- A starter dbt project and synthetic claims dataset for trying out the Tuva Project.☆25Updated this week
- Generic interface exchange format for Data Warehouse Automation and ETL generation.☆41Updated 11 months ago
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 3 years ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆126Updated 3 years ago
- Configuration and schema sync for Metabase from Python☆19Updated 2 years ago
- ☆74Updated 4 months ago
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆11Updated last year
- The Taxonomy for ETL Automation Metadata (TEAM) is a tool for design metadata management geared towards data warehouse automation. It is …☆36Updated 4 months ago
- A curated list of dagster code snippets for data engineers☆55Updated last year
- A modern, enterprise-ready business intelligence web application. Unleash the value of your data. 📈 📉 📊☆33Updated 2 years ago
- This repository contains examples of how to use dbt's metric functionality on the jaffle shop dataset☆29Updated 6 months ago
- TinyOlap is a light-weight, in-process, in-memory, multi-dimensional, model-first OLAP engine for planning, budgeting, reporting, analysi…☆48Updated 3 years ago
- Pro Data Tools 🛠️ for VS Code IDE 🧙♂️: DuckDB Pro Tools, PRQL Code Lens, new Markdown SQL Pro Tools, upcoming Data Notebooks 📚 Pro To…☆32Updated 11 months ago
- Data Tools Subjective List☆83Updated last year
- DuckDB SQL Tools add DuckDB support to VSCode, and provide database schema and SQL query interfaces for the popular SQLTools extension, S…☆18Updated 11 months ago
- Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️☆17Updated last month
- ☆13Updated 2 months ago