naveenkrsh / booksLinks
☆15Updated 8 years ago
Alternatives and similar repositories for books
Users that are interested in books are comparing it to the libraries listed below
Sorting:
- Apche Spark Structured Streaming with Kafka using Python(PySpark)☆40Updated 6 years ago
- O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian☆216Updated 2 years ago
- ☆60Updated 4 years ago
- This contain how to install Hadoop on google colab and how to run map-reduce in Hadoop☆33Updated 4 years ago
- Interactive Notebooks that support the book☆40Updated 4 years ago
- ☆32Updated 7 years ago
- Repository for Spark using Python material. It is popularly known as PySpark.☆20Updated 3 years ago
- This is my personal collection of free Hadoop books, please feel free to share and learn.☆17Updated 7 years ago
- Mastering Big Data Analytics with PySpark, Published by Packt☆160Updated 10 months ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆55Updated 2 years ago
- ☆33Updated last year
- An ETL pipeline that extracts data from S3, stages them in Redshift, and transforms data into a set of dimensional tables☆13Updated 5 years ago
- Apache Spark Course Material☆91Updated 2 years ago
- Complete PySpark Guide for the beginners... I prepared this notebook for my students.☆18Updated 5 years ago
- Accessed the Twitter API for live streaming tweets. Performed Feature Extraction and transformation from the JSON format of tweets using …☆19Updated 8 years ago
- Spark Examples☆125Updated 3 years ago
- Learning Spark SQL, published by Packt☆42Updated 2 years ago
- This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessar…☆45Updated last year
- Maternal Health Risk prediction MLOps pipeline☆43Updated 2 years ago
- ☆59Updated 8 years ago
- I Am Just A Student : Let's study AI/ML together. #iamJustAStudent☆30Updated last year
- Flink Streaming SQL | FlinkCEP | Some demos and notes☆20Updated 4 years ago
- My documents for self-learning fundamental of Data engineering skills☆12Updated last year
- My Git Repo for Csv Data☆21Updated 4 years ago
- Classwork projects and home works done through Udacity data engineering nano degree☆74Updated last year
- PySpark Tutorial for Beginners on Google Colab: Hands-On Guide☆16Updated 4 years ago
- Data Science Study Notes + Projects☆22Updated 3 years ago
- Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University☆158Updated 6 months ago
- Design/Implement stream/batch architecture on NYC taxi data | #DE☆25Updated 4 years ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆37Updated last year