This project introduces PySpark, a powerful open-source framework for distributed data processing. We explore its architecture, components, and applications for real-time data analysis.
☆43Sep 26, 2024Updated last year
Alternatives and similar repositories for Real-Time-PySpark
Users that are interested in Real-Time-PySpark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Apache Airflow advanced functionalities examples☆21Mar 22, 2024Updated 2 years ago
- ☆24Sep 21, 2020Updated 5 years ago
- pyspark dataframe made easy☆16Dec 15, 2021Updated 4 years ago
- Demonstrating the efficiency of pmdarima’s auto_arima() function compared to implementing a traditional ARIMA model.☆12Feb 16, 2021Updated 5 years ago
- Build and run Spark Structured Streaming pipelines in Hadoop - project using PySpark.☆13Jun 6, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Simple project using pyflink, kafka and postgre containerized using Docker☆11Aug 26, 2024Updated last year
- capstone project for Dataengineer.io bootcamp Public Repo☆12Feb 20, 2024Updated 2 years ago
- ☆12Apr 14, 2024Updated last year
- Implementing best practices for PySpark ETL jobs and applications.☆2,088Jan 1, 2023Updated 3 years ago
- A bot that sends a specific message to all of a user's friends.☆15Aug 26, 2021Updated 4 years ago
- An AWS Data Engineering End-to-End Project (Glue, Lambda, Kinesis, Redshift, QuickSight, Athena, EC2, S3)☆16Sep 20, 2023Updated 2 years ago
- This project demonstrates how to build and automate an ETL pipeline written in Python and schedule it using open source Apache Airflow or…☆20Aug 21, 2025Updated 7 months ago
- ☆41May 4, 2025Updated 11 months ago
- ☆17May 26, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- My solutions for the Udacity Data Engineering Nanodegree☆34Oct 14, 2019Updated 6 years ago
- ☆21Oct 21, 2024Updated last year
- A repository for Analysis of Toronto Neighbourhoods (An IBM Data Science Capstone Project)☆10Jan 15, 2021Updated 5 years ago
- Starter application demonstrating how to connect a NestJS API to a PlanetScale MySQL database☆11Apr 12, 2023Updated 2 years ago
- ☆70Mar 1, 2026Updated last month
- ☆21Jun 7, 2024Updated last year
- A platform that helps developers to better understand CSS through declaration interpretation and may even improve them through suggestion…☆14Jul 3, 2021Updated 4 years ago
- TTS utility☆12Aug 2, 2020Updated 5 years ago
- Generate OpenAPI 3.x.x using Pydantic☆11Feb 9, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆12Apr 28, 2020Updated 5 years ago
- ☆10Feb 12, 2026Updated last month
- ☆37Apr 25, 2025Updated 11 months ago
- Docker Images with Databricks Connect Ready to go☆24Dec 26, 2023Updated 2 years ago
- Java Replica of the original Flappy Bird for mobile☆24Nov 25, 2023Updated 2 years ago
- AWS ETL Pipleine☆29May 16, 2024Updated last year
- (Python, PySpark)☆11Nov 15, 2020Updated 5 years ago
- A walkthorugh and tutorial covering all common techniques used for face detection☆19Jul 12, 2024Updated last year
- Unsupervised concept extraction from clinical text☆14Jun 17, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Notebooks corresponding to The Almost Astrophysicist's YouTube Channel. [https://www.youtube.com/thealmostastrophysicist]☆21Jul 30, 2023Updated 2 years ago
- Opensource repository to create html reports using the same structure as creating a streamlit dashboard☆15Mar 4, 2026Updated last month
- End-to-end data engineering pipeline with various technologies to ingest real time data.☆25Nov 3, 2023Updated 2 years ago
- Haraka SMTP plugin for logging outbound traffic. Useful for storing audit information of delivered/bounced emails.☆16Jan 12, 2023Updated 3 years ago
- code snippet for analytics sessions☆34May 17, 2022Updated 3 years ago
- 🚀 A simple javascript template for rapid development of GitHub actions.☆17Feb 24, 2023Updated 3 years ago
- StockStream is a web application developed using Streamlit, designed to provide users with real-time stock price data, stock price predic…☆21Oct 25, 2024Updated last year