This project introduces PySpark, a powerful open-source framework for distributed data processing. We explore its architecture, components, and applications for real-time data analysis.
☆43Sep 26, 2024Updated last year
Alternatives and similar repositories for Real-Time-PySpark
Users that are interested in Real-Time-PySpark are comparing it to the libraries listed below
Sorting:
- This repo consists of all important concepts for data engineers.☆11Dec 24, 2024Updated last year
- pyspark dataframe made easy☆16Dec 15, 2021Updated 4 years ago
- This repository contains code and configuration files for an Extract, Transform, Load (ETL) project using Google Cloud Data Fusion for da…☆20Feb 23, 2024Updated 2 years ago
- Demonstrating the efficiency of pmdarima’s auto_arima() function compared to implementing a traditional ARIMA model.☆12Feb 16, 2021Updated 5 years ago
- Codes related to data wrangling☆12Apr 12, 2020Updated 5 years ago
- Implementing best practices for PySpark ETL jobs and applications.☆2,085Jan 1, 2023Updated 3 years ago
- An AWS Data Engineering End-to-End Project (Glue, Lambda, Kinesis, Redshift, QuickSight, Athena, EC2, S3)☆16Sep 20, 2023Updated 2 years ago
- This project demonstrates how to build and automate an ETL pipeline written in Python and schedule it using open source Apache Airflow or…☆19Aug 21, 2025Updated 7 months ago
- The Free AWS Certified Cloud Practitioner Study Course☆14Oct 15, 2019Updated 6 years ago
- ☆17May 26, 2023Updated 2 years ago
- The Ultimate Guide to Snowpark, published by Packt☆16Jun 8, 2024Updated last year
- MCP server that provides hourly weather forecasts using the AccuWeather API☆30Jan 1, 2025Updated last year
- My solutions for the Udacity Data Engineering Nanodegree☆34Oct 14, 2019Updated 6 years ago
- ☆21Oct 21, 2024Updated last year
- This is the first project where we worked on apache spark, In this project what we have done is that we downloaded the datasets from KAGG…☆22Oct 14, 2021Updated 4 years ago
- This repo contains all the code used in the Python for Data Engineering Course☆352Apr 24, 2024Updated last year
- Starter application demonstrating how to connect a NestJS API to a PlanetScale MySQL database☆11Apr 12, 2023Updated 2 years ago
- ☆146Jan 31, 2023Updated 3 years ago
- ☆70Mar 1, 2026Updated 2 weeks ago
- It is a assemble to include all Practice Projects about Big Data Topic, includes Hadoop, Spark, Spark Streaming and Kafka☆11Mar 7, 2019Updated 7 years ago
- ☆21Jun 7, 2024Updated last year
- A platform that helps developers to better understand CSS through declaration interpretation and may even improve them through suggestion…☆14Jul 3, 2021Updated 4 years ago
- TTS utility☆12Aug 2, 2020Updated 5 years ago
- An example integration between Flask and the Preact front end library.☆13Jun 20, 2022Updated 3 years ago
- ☆10Feb 12, 2026Updated last month
- Spark Notebook docker image☆10Dec 29, 2017Updated 8 years ago
- ☆36Apr 25, 2025Updated 10 months ago
- Recommend products or brands to users based on browsing history data☆13Dec 18, 2020Updated 5 years ago
- AWS ETL Pipleine☆29May 16, 2024Updated last year
- Java Replica of the original Flappy Bird for mobile☆24Nov 25, 2023Updated 2 years ago
- Unsupervised concept extraction from clinical text☆14Jun 17, 2024Updated last year
- ☆11Sep 6, 2019Updated 6 years ago
- ☆21Mar 31, 2024Updated last year
- Opensource repository to create html reports using the same structure as creating a streamlit dashboard☆15Mar 4, 2026Updated 2 weeks ago
- 🚀 A simple javascript template for rapid development of GitHub actions.☆17Feb 24, 2023Updated 3 years ago
- Haraka SMTP plugin for logging outbound traffic. Useful for storing audit information of delivered/bounced emails.☆16Jan 12, 2023Updated 3 years ago
- Code for the second edition of Data Pipelines with Apache Airflow Book☆43Feb 11, 2026Updated last month
- StockStream is a web application developed using Streamlit, designed to provide users with real-time stock price data, stock price predic…☆21Oct 25, 2024Updated last year
- Demo of structured, contextual JSON logging with Spring Boot and Log4j2☆15Feb 15, 2022Updated 4 years ago