The source code for the book Modern Data Engineering with Apache Spark
☆40Jul 26, 2022Updated 3 years ago
Alternatives and similar repositories for spark-moderndataengineering
Users that are interested in spark-moderndataengineering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Visits sessionization pipeline used for the talk☆13May 28, 2024Updated last year
- A series of workshop modules introducing Feast feature store.☆18May 31, 2022Updated 3 years ago
- Don't Panic. This guide will help you when it feels like the end of the world.☆30Feb 7, 2026Updated 3 months ago
- Source Code for 'Beginning Apache Spark 3' by Hien Luu☆13Oct 14, 2021Updated 4 years ago
- Model Context Protocol (MCP) server to interact with gRPC services using the grpcurl tool☆17Mar 5, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- High Performance with Java, published by Packt☆15Jul 18, 2024Updated last year
- Code for the book "Get Programming with Scala" (Manning)☆83Feb 12, 2023Updated 3 years ago
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆10Feb 2, 2024Updated 2 years ago
- Git repo to accompany the AWS DevOps Blog: Using AWS DevOps Tools to model and provision AWS Glue workflows☆19Nov 16, 2021Updated 4 years ago
- ☆21Aug 31, 2025Updated 8 months ago
- Resources for the book "Functional and Concurrent Programming"☆19Jan 16, 2026Updated 4 months ago
- Unity Catalog AI Model Context Protocol Server☆16Mar 28, 2025Updated last year
- Collection of code snippets for blogs, conferences, and talks☆24Nov 1, 2022Updated 3 years ago
- Docker Compose environments for developing modern data platform architectures using Kafka, Flink, Spark, Iceberg, OpenLineage, OpenMetada…☆54May 5, 2026Updated 3 weeks ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Curso de análisis de textos con técnicas de aprendizaje automático☆17Nov 13, 2019Updated 6 years ago
- Covid19 and Iowa Liquor Sales analysis at BigQuery using dbt, Airflow, Marquez, Google Cloud and other modern data stack tools☆14Jun 18, 2022Updated 3 years ago
- ☆55Jan 28, 2026Updated 3 months ago
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆96May 11, 2026Updated 2 weeks ago
- ☆16Oct 21, 2024Updated last year
- Code examples for functional programming☆21Mar 3, 2025Updated last year
- GitHub Repository for Azure AI-102 Essentials to Learn, Implement, and Certify☆35Feb 11, 2026Updated 3 months ago
- Profiling Spark Applications for Performance Comparison and Diagnosis☆17Nov 11, 2018Updated 7 years ago
- ☆116Jan 15, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Plugin for Intake to read from SQL servers☆15May 29, 2023Updated 2 years ago
- ☆13Jul 1, 2025Updated 10 months ago
- Managing Data as a Product, published by Packt☆21Nov 30, 2024Updated last year
- A Flat Data GitHub Action demo repo☆15Jan 1, 2024Updated 2 years ago
- Rest API for Todobackend on top of Cassandra☆26Feb 22, 2023Updated 3 years ago
- Generate Parquet Files☆14Apr 23, 2026Updated last month
- Trino (f.k.a PrestoSQL) dialect for SQLAlchemy.☆25May 5, 2022Updated 4 years ago
- ☆13Feb 19, 2025Updated last year
- Examples of using Neo4j with R.☆22Jan 18, 2016Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Oct 6, 2023Updated 2 years ago
- Helm Chart for deploying Spark history server in Amazon EKS for S3 Spark Event Logs☆29Apr 4, 2026Updated last month
- A Model Context Protocol server for Google Workspace integration (Gmail and Calendar)☆30Dec 29, 2024Updated last year
- This repo is for the Linkedin Learning course: Learning Neo4j☆26Jun 13, 2023Updated 2 years ago
- Mock streaming data generator☆18May 31, 2024Updated last year
- 3D model file for the 2-pin JST battery removal tool to remove single cell, LiPo battery's JST-PH connectors.☆12Jan 30, 2025Updated last year
- Spark in Action, 2nd edition - chapter 16 - performance, checkpointing, and caching☆12Apr 21, 2023Updated 3 years ago