rafaelleinio / thoth
Python tool for profiling-based anomaly monitoring on ETL data pipelines leveraging ML and Apache Spark.
☆15Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for thoth
- A tool for building feature stores.☆283Updated last month
- Airflow Deployment on AWS ECS Fargate Using Cloudformation☆205Updated 2 years ago
- ☆20Updated 3 years ago
- Football scouts from Cartola FC at a data lake with data warehouse and dashboard☆15Updated 2 years ago
- Creates a Simulation of Fake Web Events☆80Updated 2 years ago
- ☆33Updated 3 years ago
- A data engineering personal project for applying some of my skills☆19Updated 3 years ago
- PySpark test helper methods with beautiful error messages☆622Updated last month
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 2 years ago
- Great Expectations Airflow operator☆159Updated 3 weeks ago
- ☆43Updated 2 years ago
- ☆15Updated 7 months ago
- Food for thoughts around data contracts☆24Updated last week
- Python API for Deequ☆733Updated last month
- ☆45Updated 3 years ago
- Criando Lambda Functions para Ingerir Dados de APIs com AWS CDK☆13Updated 2 years ago
- This repo provides the Kubernetes Helm chart for deploying Pyspark Notebook.☆17Updated 2 years ago
- Spark development environment for kubernetes, spark-submit and jupyter notebook☆19Updated 2 years ago
- pyspark methods to enhance developer productivity 📣 👯 🎉☆644Updated last month
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆196Updated last week
- A Python Library to support running data quality rules while the spark job is running⚡☆163Updated 2 weeks ago
- ☆58Updated 8 months ago
- Instalador autonomo do Apache Spark para Sistemas linux: based(Debian,RHEL)☆13Updated last year
- Collection of `pre-commit` hooks to ensure the quality of your `dbt` projects.☆47Updated last year
- The athena adapter plugin for dbt (https://getdbt.com)☆141Updated last year
- Dremio SDK for JavaScript☆27Updated 4 years ago
- Airflow Providers containing Deferrable Operators & Sensors from Astronomer☆139Updated last week
- ML made simple☆207Updated last year