How to evaluate the Quality of your Data with Great Expectations and Spark.
☆31Mar 29, 2023Updated 3 years ago
Alternatives and similar repositories for hands-on-great-expectations-with-spark
Users that are interested in hands-on-great-expectations-with-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- In-Session Personalization Workshop for eCommerce, April 2021, and the MICES Workshop in June 2021.☆24Jun 29, 2021Updated 4 years ago
- Demo repository to lambda-fy your dbt runs☆11Sep 7, 2023Updated 2 years ago
- Serve a 1x1 GIF pixel from an AWS lambda-powered endpoint☆13Sep 7, 2017Updated 8 years ago
- A dbt package to run natural language queries☆10Jan 13, 2023Updated 3 years ago
- 🏟☆28Nov 11, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A write-audit-publish implementation on a data lake without the JVM☆45Aug 12, 2024Updated last year
- Anki Overdrive API for Python☆12Oct 21, 2017Updated 8 years ago
- ☆22Mar 31, 2022Updated 4 years ago
- ☆12Oct 25, 2023Updated 2 years ago
- Testing various methods of moving Arrow data between processes☆16Mar 29, 2023Updated 3 years ago
- A Node.js tool to examine the correctness of Open Data Metadata and build custom dataset profiles☆12Sep 26, 2023Updated 2 years ago
- An implementation of Defeasible Logic in Python☆15Sep 2, 2018Updated 7 years ago
- reference implementations and use cases done with bauplan☆62Mar 30, 2026Updated 2 weeks ago
- Python+node wrapper to read/send message from/to Anki Overdrive bluetooth vehicles.☆18Aug 9, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Learning resources for Airflow Tutorial article.☆56Jul 22, 2020Updated 5 years ago
- A fast and simple JavaScript library specifically targeted at collecting search and search related browser events.☆43Nov 20, 2025Updated 4 months ago
- A Python library for anomaly detection☆13Aug 28, 2017Updated 8 years ago
- ☆41May 16, 2023Updated 2 years ago
- ☆22Jun 28, 2022Updated 3 years ago
- A project for exploring how Great Expectations can be used to ensure data quality and validate batches within a data pipeline defined in …☆25Aug 30, 2022Updated 3 years ago
- WebApp in RShiny using the package itunesr for iTunes AppStore Review Extraction and Analysis☆10Mar 3, 2020Updated 6 years ago
- ☆13Feb 18, 2022Updated 4 years ago
- Functional Data Engineering tutorial in Python & Airflow.☆17Mar 24, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ETL processing toolset with SQL-like language and GIS capabilities, built on core Spark. Extensible and modular. REPL included☆16Jan 26, 2026Updated 2 months ago
- ☆15May 7, 2025Updated 11 months ago
- Demonstration of how dedupe might be used as geocoder☆17Jun 21, 2022Updated 3 years ago
- Project is in active development and has been moved to https://repository.datamart.ru/datamarts/prostore.☆17Apr 22, 2022Updated 3 years ago
- This example shows how to run Anychart library with the Scala programming language using Akka Http and MySQL.☆11Dec 21, 2017Updated 8 years ago
- ☆12May 19, 2021Updated 4 years ago
- Visual Studio Code Server on Azure Web App for Containers☆10Apr 12, 2019Updated 7 years ago
- Official Repository for EvalRS @ KDD 2023: a Rounded Evaluation of Recommender Systems☆30Feb 16, 2024Updated 2 years ago
- PROVED (PRocess mining OVer uncErtain Data) is a library of functionalities to perform process mining on uncertain event data.☆12Jan 12, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Spark (PySpark) script that applies dynamic time warping to Energy usage data (using the python fastdtw package)☆15Oct 22, 2016Updated 9 years ago
- Template-based generation of DAG cards from Metaflow classes, inspired by Google cards for machine learning models.☆29Dec 7, 2021Updated 4 years ago
- Geocoding Australia Using the PSMA Data☆14Mar 3, 2023Updated 3 years ago
- Old repo for R interface for GraphFrames☆13Mar 21, 2018Updated 8 years ago
- ☆17Sep 12, 2020Updated 5 years ago
- ☆16Feb 12, 2025Updated last year
- ☆17Nov 7, 2024Updated last year