A project for exploring how Great Expectations can be used to ensure data quality and validate batches within a data pipeline defined in Airflow.
☆25Aug 30, 2022Updated 3 years ago
Alternatives and similar repositories for GreatEx
Users that are interested in GreatEx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- GB: Построение хранилища данных и основы ETL☆10Mar 27, 2021Updated 5 years ago
- A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from loc…☆23May 14, 2022Updated 3 years ago
- Demo on how to use Prefect with Docker☆27Sep 8, 2022Updated 3 years ago
- End to End Sales Streaming Pipeline (FastAPI, Kafka, Spark, Cassandra, MySQL, Superset)☆10May 26, 2023Updated 2 years ago
- Deployed an kafka instance in AWS EC2 Instance to streamline the data into Databricks☆10Aug 15, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- An extended parametrizing plugin of pytest.☆18Aug 7, 2024Updated last year
- A decorator that sends alert when a Prefect flow fails☆15Apr 5, 2023Updated 3 years ago
- Data Vault 2.0: Code generation, Vertica, Airflow☆13Nov 20, 2019Updated 6 years ago
- ☆26Jul 9, 2023Updated 2 years ago
- Transformers for Multi-Label Text Classification☆11Sep 18, 2020Updated 5 years ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆24Nov 30, 2020Updated 5 years ago
- Python examples of Report Portal usage for different frameworks☆11Apr 14, 2026Updated 2 weeks ago
- Ingress data from kafka topic into clickhouse table (JSON format)☆24Apr 12, 2018Updated 8 years ago
- ☆16Dec 14, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- FastAPI backend to upload files to S3☆26Jul 19, 2020Updated 5 years ago
- Prefect integrations for interacting with Great Expectations☆29Aug 15, 2024Updated last year
- ☆10Mar 8, 2022Updated 4 years ago
- ☆18Sep 24, 2024Updated last year
- With everything I learned from DEZoomcamp from datatalks.club, this project performs a batch processing on AWS for the cycling dataset wh…☆15Jan 4, 2026Updated 4 months ago
- A lightweight FastAPI scaffolding base to bootstrap App/API development utilizing MongoDB, Jinja2 Templates and no Javascript other than …☆13Nov 25, 2020Updated 5 years ago
- dbt + Trino demo project, using TPC-H sample data☆19Mar 27, 2024Updated 2 years ago
- Code to be contributed to the Apache Airflow (incubating) project for ETL workflow management for integrating with the Snowflake Data War…☆26Jul 19, 2017Updated 8 years ago
- ☆14Feb 26, 2021Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Aiogram template with mongo database☆16Aug 20, 2025Updated 8 months ago
- Para crear una imagen Docker de Shiny y subirla a Google Cloud sin morir en el intento.☆14Aug 28, 2022Updated 3 years ago
- Content for a talk on "The wonderful world of data quality tools in Python"☆18May 5, 2021Updated 4 years ago
- Demo project☆21Nov 26, 2021Updated 4 years ago
- E2E MLOps with Databricks☆16Nov 27, 2024Updated last year
- Parser and standardizer for politician, individual and organization names.☆128May 18, 2017Updated 8 years ago
- Supercharged pandas indexing☆11Mar 28, 2021Updated 5 years ago
- another express js mvc framework using mongo db (noSQL) and mvc design pattern to make restfull api's calls, there is also jwt to protect…☆16Nov 17, 2022Updated 3 years ago
- FastAPI Architecture☆19Oct 9, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Telegram bot for automatic trading on the Tinkoff stock market☆21Apr 26, 2023Updated 3 years ago
- ETL jobs that DoltHub maintained that load public data into DoltHub.☆20Mar 7, 2023Updated 3 years ago
- This is a memo to share what I have learnt in Apache Airflow☆21Oct 18, 2020Updated 5 years ago
- Accompanying solution accelerator notebook for the Databricks blog on transformer models☆15Sep 1, 2022Updated 3 years ago
- Library for working with FastAPI and MongoDB via Motor driver☆21Nov 19, 2021Updated 4 years ago
- ☆14Mar 7, 2015Updated 11 years ago
- Training and evaluating phase☆13Apr 1, 2021Updated 5 years ago