The source code for the book Modern Data Engineering with Apache Spark
☆39Jul 26, 2022Updated 3 years ago
Alternatives and similar repositories for spark-moderndataengineering
Users that are interested in spark-moderndataengineering are comparing it to the libraries listed below
Sorting:
- Don't Panic. This guide will help you when it feels like the end of the world.☆30Feb 7, 2026Updated 3 weeks ago
- A series of workshop modules introducing Feast feature store.☆19May 31, 2022Updated 3 years ago
- Trino (f.k.a PrestoSQL) dialect for SQLAlchemy.☆25May 5, 2022Updated 3 years ago
- ☆14Feb 15, 2025Updated last year
- ☆11Oct 6, 2023Updated 2 years ago
- Boilerplate project for MOTW Workshop 2015☆10Mar 3, 2016Updated 10 years ago
- ☆11Aug 14, 2014Updated 11 years ago
- This repository contains NiFi processors for interacting with Snowflake Cloud Data Platform.☆12Dec 13, 2024Updated last year
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆94May 9, 2025Updated 9 months ago
- ☆14Apr 18, 2023Updated 2 years ago
- Source code for the module "Advanced Statistics" 📊☆10Feb 25, 2019Updated 7 years ago
- Limit long text output for a single JupyterLab mime render.☆13Jul 30, 2025Updated 7 months ago
- Real-time change point detection☆10Jan 14, 2017Updated 9 years ago
- A collaborative, real-time feature model editor☆10Nov 27, 2023Updated 2 years ago
- Docker Image - Tadpole DB Hub☆14Jul 28, 2021Updated 4 years ago
- An interactive Rust learning platform featuring progressive exercises aligned with "The Rust Programming Language" book.☆20Dec 8, 2025Updated 2 months ago
- docker image to deploy rabbitmq cluster on mesos with one marathon app☆10Oct 12, 2017Updated 8 years ago
- Spark in Action, 2nd edition - chapter 12 - Transforming your data☆11Feb 6, 2024Updated 2 years ago
- Nim GUI Library☆13Oct 19, 2021Updated 4 years ago
- Model Context Protocol (MCP) server to interact with gRPC services using the grpcurl tool☆16Mar 5, 2025Updated 11 months ago
- Simple key-value store backed by sqlite☆16Jan 13, 2025Updated last year
- ☆14Feb 23, 2021Updated 5 years ago
- Embedding SQLite in Redis, with a sane license (Apache)☆11Jun 30, 2021Updated 4 years ago
- ☆13Jul 1, 2025Updated 8 months ago
- A Chinese friendly zola theme. Inspired by lightspeed.☆12Feb 10, 2025Updated last year
- Useful extension utilities for thiserror.☆14Dec 4, 2025Updated 2 months ago
- Combination of Dockerized Hortonworks projects and other Hadoop ecosystem components☆10Oct 11, 2019Updated 6 years ago
- An example CI/CD pipeline using GitHub Actions for doing continuous deployment of AWS Glue jobs built on PySpark and Jupyter Notebooks.☆13Oct 15, 2020Updated 5 years ago
- Spark in Action, 2nd edition - chapter 16 - performance, checkpointing, and caching☆12Apr 21, 2023Updated 2 years ago
- High Performance with Java, published by Packt☆15Jul 18, 2024Updated last year
- Digital Transformation and Modernization with IBM API Connect, published by Packt☆12Jan 30, 2023Updated 3 years ago
- Java code for Apache Nifi processors☆11Jun 5, 2017Updated 8 years ago
- (ARCHIVE) nqp-rx☆37Sep 25, 2014Updated 11 years ago
- ☆112Jan 15, 2025Updated last year
- tool to analyze performance of Raku programs running on moarvm☆12Jun 7, 2025Updated 8 months ago
- ☆11Oct 31, 2019Updated 6 years ago
- ☆16Oct 21, 2024Updated last year
- A Gentle introduction to Machine Learning with Apache Spark☆11Jun 27, 2025Updated 8 months ago
- Native libraries utilities for Perl6☆12Oct 14, 2023Updated 2 years ago