PacktPublishing / Data-Engineering-with-Scala-and-SparkLinks
Data Engineering with Scala, published by Packt
☆25Updated last year
Alternatives and similar repositories for Data-Engineering-with-Scala-and-Spark
Users that are interested in Data-Engineering-with-Scala-and-Spark are comparing it to the libraries listed below
Sorting:
- Data Engineering with Spark and Delta Lake☆102Updated 2 years ago
- ☆88Updated 2 years ago
- Analytics engineering with dbt - projects and developer environment☆19Updated 10 months ago
- Data engineering with dbt, published by Packt☆85Updated last year
- Data Engineering with Google Cloud Platform, published by Packt☆119Updated last year
- Simple stream processing pipeline☆103Updated last year
- Code snippets for Data Engineering Design Patterns book☆138Updated 4 months ago
- Scaling Machine Learning in Three Week course in a collaboration with O'Reilly following the guidance of Adi Polak's book - Scaling Machi…☆23Updated 2 years ago
- Dockerizing an Apache Spark Standalone Cluster☆43Updated 3 years ago
- GitHub repository related to the course Mastering Elastic Map Reduce for Data Engineers☆24Updated 3 years ago
- Apache Airflow Best Practices, published by Packt☆44Updated 9 months ago
- Building Big Data Pipelines with Apache Beam, published by Packt☆86Updated 2 years ago
- Simplifying Data Engineering and Analytics with Delta, published by Packt☆21Updated last year
- ☆90Updated 6 months ago
- Data Engineering with AWS, 2nd edition - Published by Packt☆150Updated last year
- Data Engineering with Databricks Cookbook, published by Packt☆94Updated last year
- Code for my "Efficient Data Processing in SQL" book.☆57Updated last year
- ☆68Updated 2 months ago
- O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian☆219Updated 2 years ago
- ☆186Updated 4 years ago
- Full stack data engineering tools and infrastructure set-up☆55Updated 4 years ago
- The Ultimate Hands-On Hadoop - Tame your Big Data!: https://www.udemy.com/the-ultimate-hands-on-hadoop-tame-your-big-data/☆8Updated 6 years ago
- A series of Jupyter notebooks that walk you through Machine Learning with Apache Spark ecosystem using Spark MLlib, PyTorch and TensorFlo…☆82Updated last year
- Building ETL Pipelines with Python☆150Updated last year
- Snowflake Cookbook, published by Packt☆80Updated 2 years ago
- Essential PySpark for Scalable Data Analytics, published by Packt☆45Updated 2 years ago
- ☆16Updated last year
- Azure Data Engineering Cookbook 2nd-edition, published by Packt☆32Updated last year
- ☆28Updated last year
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆56Updated 2 years ago