baligoyem / dataqtorLinks
πYour Data Quality Detector / Gain insight into your data and get it ready for use before you start working with it π‘ππ π
β16Updated 2 years ago
Alternatives and similar repositories for dataqtor
Users that are interested in dataqtor are comparing it to the libraries listed below
Sorting:
- DataHub on AWS demonstration resourcesβ10Updated 2 years ago
- Using the Parquet file format with Pythonβ15Updated last year
- β15Updated last year
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable froβ¦β27Updated 3 years ago
- Library of Prefect tasks and utilities.β9Updated 9 months ago
- β29Updated last year
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise itβ26Updated last year
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clientsβ36Updated last year
- The sane way of building a data layer in Airflowβ24Updated 5 years ago
- Python ELT Studio, an application for building ELT (and ETL) data flows.β58Updated 3 years ago
- A template for an AWS Lambda function that triggers Prefect Flow Runsβ20Updated 3 years ago
- Content for a talk on "The wonderful world of data quality tools in Python"β18Updated 4 years ago
- This repository auto-configures an Apache Pinot and Superset cluster for analyzing IRA tweets from FiveThirtyEight.β11Updated 4 years ago
- A collection of python utility functionsβ11Updated last year
- A small Python module containing quick utility functions for standard ETL processes.β36Updated last week
- A few end to end examples that use data-describeβ16Updated 2 years ago
- Check the basic quality of any datasetβ10Updated 4 years ago
- Simple samples for writing ETL transform scripts in Pythonβ23Updated last week
- This repo contains the LookML for the model and dashboards used with the FHIR healthcare dataset to showcase how Looker can add value to β¦β12Updated 2 years ago
- This repository contains code to build an MVP search engine with google like interface.β15Updated last month
- Building 3D Trusted Data Pipelines With Dagster, Dbt, and Duckdbβ21Updated last year
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.β37Updated 6 years ago
- SQL interface to Pandasβ52Updated 3 years ago
- bamboolib - template for creating your own binder notebookβ21Updated 3 years ago
- π Run, schedule, and manage your dbt jobs using Kubernetes.β24Updated 6 years ago
- CLI for data platformβ19Updated last year
- Small script for automating mkgendocs and mkdocs filesβ18Updated 3 years ago
- A software engineering framework to jump start your machine learning projectsβ37Updated last year
- SQL query executor on remote DuckDB instance using Apache Arrow Flight RPC through Streamlit Web interface.β15Updated 8 months ago
- A repository containing an introduction to Panel made to be support videos and talks.β56Updated 3 years ago