MrPowers / gill
An example PySpark project with pytest
☆17Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for gill
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- Real-world Spark pipelines examples☆83Updated 6 years ago
- Make your libraries magically appear in Databricks.☆47Updated last year
- Test suite to document the behavior of Spark☆21Updated 3 years ago
- Data validation library for PySpark 3.0.0☆34Updated 2 years ago
- A simple introduction to using spark ml pipelines☆26Updated 6 years ago
- These are some code examples☆55Updated 4 years ago
- type-class based data cleansing library for Apache Spark SQL☆79Updated 5 years ago
- The iterative broadcast join example code.☆69Updated 7 years ago
- Mastering Spark for Data Science, published by Packt☆46Updated last year
- ☆63Updated 5 years ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆72Updated 3 years ago
- Utilities for writing tests that use Apache Spark.☆24Updated 5 years ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆56Updated last year
- Magic to help Spark pipelines upgrade☆34Updated last month
- Scalable CDC Pattern Implemented using PySpark☆18Updated 5 years ago
- This tutorial provides a quick introduction to using Spark☆57Updated 8 years ago
- Spark to Tableau Extractor library☆18Updated 7 years ago
- Various data stream/batch process demo with Apache Scala Spark 🚀☆11Updated 4 years ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆15Updated 9 months ago
- JSON schema parser for Apache Spark☆81Updated 2 years ago
- Repository used for Spark Trainings☆53Updated last year
- Examples To Help You Learn Apache Spark☆78Updated 6 years ago
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16Updated 5 years ago
- A tool to validate data, built around Apache Spark.☆101Updated this week
- Examples for High Performance Spark☆15Updated 2 weeks ago
- ☆13Updated last week