Raiffeisen-DGTL / checkita-data-qualityLinks
Fast data quality framework for modern data infrastructure
☆29Updated 8 months ago
Alternatives and similar repositories for checkita-data-quality
Users that are interested in checkita-data-quality are comparing it to the libraries listed below
Sorting:
- Distributed run of dbt models using Airflow☆165Updated this week
- DE or DIE meetup made by data engineers for data engineers. Currently in Russian only.☆57Updated last year
- Docker Compose with Almond.sh core for Jupyter☆18Updated last year
- One ETL tool to rule them all☆84Updated this week
- Learning resources for Airflow Tutorial article.☆56Updated 5 years ago
- Module for pipelines concept in PySpark☆15Updated last year
- Курс про Apache Airflow 2.0☆36Updated last month
- ☆12Updated 4 years ago
- Data Engineering misc☆14Updated 4 years ago
- This project is used to capture machine learning pipelines created on top of Spark as OK☆53Updated 2 years ago
- ☆29Updated 3 years ago
- Analytics Engineer Course☆18Updated 2 years ago
- Practice course on Big Data☆15Updated last year
- The simple ETL with docker container☆59Updated 4 months ago
- Python client for MLflow REST API☆36Updated last year
- Data Engineer RoadMap☆35Updated 3 years ago
- SparkConnect Server plugin and protobuf messages for the Amazon Deequ Data Quality Engine.☆26Updated 7 months ago
- Cl app / pre-commit hook to clean Jupyter Notebooks metadata, execution_count and optionally output.☆11Updated 7 months ago
- An implementation of Pregel framework and graph algorithms on top of it with Ibis project DataFrames.☆23Updated 5 months ago
- Data catalog for everything in your company☆50Updated 2 years ago
- ☆13Updated 8 months ago
- Data Forge — a modern data stack playground to practice flows and best practices, not just tools. Spark, Trino, Kafka, Iceberg, ClickHous…☆72Updated last week
- Allow parsing Russian receipts☆53Updated 5 years ago
- A database-like benchmark of feature generation from time-series data☆13Updated 10 months ago
- ☆16Updated 7 months ago
- Open episode of the data engineering practice course☆30Updated last year
- Ambrosia is a Python library for A/B tests design, split and result measurement☆235Updated last year
- ☆47Updated 4 years ago
- python курс☆39Updated 2 weeks ago
- Курс "Прикладные задачи анализа данных" (ВМК, МГУ имени М.В. Ломоносова)☆318Updated 3 years ago