Raiffeisen-DGTL / checkita-data-qualityLinks
Fast data quality framework for modern data infrastructure
☆29Updated 10 months ago
Alternatives and similar repositories for checkita-data-quality
Users that are interested in checkita-data-quality are comparing it to the libraries listed below
Sorting:
- DE or DIE meetup made by data engineers for data engineers. Currently in Russian only.☆58Updated last year
- Distributed run of dbt models using Airflow☆167Updated 2 weeks ago
- Docker Compose with Almond.sh core for Jupyter☆18Updated last year
- Learning resources for Airflow Tutorial article.☆56Updated 5 years ago
- One ETL tool to rule them all☆84Updated this week
- Python client for MLflow REST API☆36Updated last year
- ☆22Updated 11 months ago
- ☆29Updated 3 years ago
- Numerical linear algebra course for Ozon Masters program☆14Updated 3 years ago
- This project is used to capture machine learning pipelines created on top of Spark as OK☆54Updated 3 years ago
- Utilities for monitoring and interacting with Jupyter Notebooks☆38Updated last month
- ☆398Updated last year
- Module for pipelines concept in PySpark☆16Updated last year
- ☆12Updated 4 years ago
- YTsaurus SPYT provides an integration with Apache Spark☆19Updated this week
- Practice course on Big Data☆17Updated last year
- Ambrosia is a Python library for A/B tests design, split and result measurement☆236Updated 2 years ago
- Allow parsing Russian receipts☆53Updated 5 years ago
- Курс про Apache Airflow 2.0☆36Updated 3 months ago
- Data Forge — a modern data stack playground to practice flows and best practices, not just tools. Spark, Trino, Kafka, Iceberg, ClickHous…☆161Updated last month
- ☆43Updated 4 years ago
- SparkConnect Server plugin and protobuf messages for the Amazon Deequ Data Quality Engine.☆26Updated 9 months ago
- Cl app / pre-commit hook to clean Jupyter Notebooks metadata, execution_count and optionally output.☆11Updated 9 months ago
- Курс "Прикладные задачи анализа данных" (ВМК, МГУ имени М.В. Ломоносова)☆321Updated 3 years ago
- Data Engineering misc☆14Updated 4 years ago
- Spark Cluster with 4 executors☆17Updated 3 months ago
- Репозиторий для открытого курса «Промышленная эксплуатация моделей машинного обучения»☆95Updated last year
- Home assignments for data science positions☆632Updated last year
- A database-like benchmark of feature generation from time-series data☆13Updated last year
- 100 упражнений по NumPy (версия на русском)☆165Updated last year