An end-to-end data engineering pipeline to create a dashboard for the latest content on the r/Stocks subreddit
☆20Aug 5, 2022Updated 3 years ago
Alternatives and similar repositories for reddit-data-engineering
Users that are interested in reddit-data-engineering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆36Jun 3, 2023Updated 2 years ago
- This is the final project that after participated the Data Engineering Zoomcamp☆11Apr 4, 2022Updated 4 years ago
- streaming eight subreddits from reddit api using kafka producer & spark structured streaming.☆19Apr 5, 2026Updated last week
- ☆10May 3, 2021Updated 4 years ago
- A data pipeline moving data from a Relational database system (RDBMS) to a Hadoop file system (HDFS).☆15Jun 3, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Local Development of AWS Glue with Docker and Visual Studio Code☆14Nov 29, 2021Updated 4 years ago
- A collection of data analysis projects done using PySpark via Jupyter notebooks.