A project for exploring how Great Expectations can be used to ensure data quality and validate batches within a data pipeline defined in Airflow.
☆25Aug 30, 2022Updated 3 years ago
Alternatives and similar repositories for GreatEx
Users that are interested in GreatEx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from loc…☆23May 14, 2022Updated 3 years ago
- End to End Sales Streaming Pipeline (FastAPI, Kafka, Spark, Cassandra, MySQL, Superset)☆10May 26, 2023Updated 2 years ago
- Deployed an kafka instance in AWS EC2 Instance to streamline the data into Databricks☆10Aug 15, 2023Updated 2 years ago
- ✨🎨 Dark theme for Visual Studio code based on Aura theme with the "spirit" of dracula☆18Aug 28, 2022Updated 3 years ago
- Data Vault 2.0: Code generation, Vertica, Airflow☆13Nov 20, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆26Jul 9, 2023Updated 2 years ago
- ☆16Feb 17, 2020Updated 6 years ago
- ☆13Sep 5, 2025Updated 7 months ago
- Where the Meltano team runs Meltano! Get it???☆31Apr 9, 2025Updated last year
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆24Nov 30, 2020Updated 5 years ago
- Simple finite-state machines in Python☆38Apr 26, 2012Updated 13 years ago
- This repository is for demonstrating the capability to do SQL-based UPDATES, DELETES, and INSERTS directly in the Data Lake using Amazon …☆18Aug 25, 2021Updated 4 years ago
- Rust parser for Clickhouse SQL dialect.☆24Feb 16, 2022Updated 4 years ago
- Материалы курса Airflow 101☆15Jun 15, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Ingress data from kafka topic into clickhouse table (JSON format)☆24Apr 12, 2018Updated 8 years ago
- ☆16Dec 14, 2021Updated 4 years ago
- ☆10Mar 8, 2022Updated 4 years ago
- ☆56Jul 30, 2025Updated 8 months ago
- dbt + Trino demo project, using TPC-H sample data☆19Mar 27, 2024Updated 2 years ago
- Code to be contributed to the Apache Airflow (incubating) project for ETL workflow management for integrating with the Snowflake Data War…☆26Jul 19, 2017Updated 8 years ago
- Lee Clarín, La Nación y Olé sin registrarte☆13Jun 21, 2019Updated 6 years ago
- 📕 Writing tests, the DataMade way☆16Sep 24, 2020Updated 5 years ago
- Supercharged pandas indexing☆11Mar 28, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Telegram bot for automatic trading on the Tinkoff stock market☆21Apr 26, 2023Updated 2 years ago
- ETL jobs that DoltHub maintained that load public data into DoltHub.☆20Mar 7, 2023Updated 3 years ago
- CSS & HTML on Python Easily☆11Sep 23, 2024Updated last year
- Netrics - Active Measurements of Internet Performance☆12Sep 14, 2023Updated 2 years ago
- Deployment example for a scikit-learn/lightgbm pipeline☆10Feb 28, 2021Updated 5 years ago
- ☆14Mar 7, 2015Updated 11 years ago
- ☆12Oct 31, 2023Updated 2 years ago
- ☆12Jul 27, 2015Updated 10 years ago
- A small data lake meant for solitary use☆16Jan 28, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Marimekko and bar mekko graphics in R☆10Jun 7, 2025Updated 10 months ago
- Final Project of the MLOps Zoomcamp hosted by DataTalksClub.☆25Dec 19, 2022Updated 3 years ago
- Explore Chicago ticket data.☆10Dec 8, 2022Updated 3 years ago
- Simple web code editor build with web components libraries☆15Oct 12, 2023Updated 2 years ago
- Code and Word2Vec embeddings of LOINC codes for KDD 2019 DSHealth paper "Evaluation of Embeddings of Laboratory Test Codes for Patients a…☆11Jun 13, 2024Updated last year
- Remark plugin for selecting and storing code blocks from markdown.☆18Dec 7, 2022Updated 3 years ago
- Coupling PySpark with PyTorch Models☆14Jan 22, 2020Updated 6 years ago