nama1arpit / reddit-streaming-pipelineView external linksLinks
A real-time reddit data streaming pipeline for sentiment analysis of various subreddits
☆143Aug 23, 2023Updated 2 years ago
Alternatives and similar repositories for reddit-streaming-pipeline
Users that are interested in reddit-streaming-pipeline are comparing it to the libraries listed below
Sorting:
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆20Aug 12, 2025Updated 6 months ago
- An end-to-end workflow for processing streaming data on Azure.☆17Sep 20, 2024Updated last year
- ☆30Feb 11, 2024Updated 2 years ago
- ☆40Mar 9, 2025Updated 11 months ago
- A Data Engineering Project that implements an ETL data pipeline using Dagster, Apache Spark, Streamlit, MinIO, Metabase, Dbt, Polars, Doc…☆22Nov 19, 2024Updated last year
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆25Nov 8, 2022Updated 3 years ago
- Data Pipeline from the Global Historical Climatology Network DataSet☆27Dec 20, 2022Updated 3 years ago
- DataTalks.Club's Data Engineering Zoomcamp Project☆24Jul 14, 2022Updated 3 years ago
- Code Repository for my 1st Data Project.☆25Mar 31, 2023Updated 2 years ago
- Scan and monitor your network effortlessly! Nmap Prometheus Exporter provides insights into network health and security with Prometheus-c…☆15Oct 2, 2023Updated 2 years ago
- All my work I am doing while learning to become Data Engineer. Most of the projects are based on the tasks from the 'Career Track: Data E…☆11Sep 28, 2023Updated 2 years ago
- ☆30Nov 16, 2023Updated 2 years ago
- End to end data engineering project☆58Oct 27, 2022Updated 3 years ago
- With everything I learned from DEZoomcamp from datatalks.club, this project performs a batch processing on AWS for the cycling dataset wh…☆15Jan 4, 2026Updated last month
- Collection of quick starts on docker, terraform, ansible, etc☆18Apr 9, 2024Updated last year
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆21Feb 3, 2026Updated last week
- End-to-end ELT data engineering project☆22Dec 24, 2022Updated 3 years ago
- Data Engineering Project in GCP☆22Mar 29, 2023Updated 2 years ago
- Data warehouse implementation for an e-commerce website “Infibeam” that sells digital and consumer electronics.☆21Jan 28, 2018Updated 8 years ago
- Collection of personal SQL projects and queries I've worked on, showcasing my skills and expertise in database management, data analysis,…☆32Aug 18, 2023Updated 2 years ago
- I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that uti…☆29May 2, 2023Updated 2 years ago
- End-to-end data engineer project☆23Aug 17, 2023Updated 2 years ago
- A Spark Publish/Subscribe NATS Connector☆27Oct 12, 2020Updated 5 years ago
- Data pipeline that scrapes Rust cheater Steam profiles☆54Feb 13, 2022Updated 4 years ago
- A user-friendly Command & Control (C&C) web platform for remote monitoring, management, and task automation across multiple devices.☆14Dec 15, 2024Updated last year
- ☆13Feb 15, 2025Updated last year
- Fully dockerized Data Warehouse (DWH) using Airflow, dbt, PostgreSQL and dashboard using redash☆25Nov 12, 2022Updated 3 years ago
- This repo is for the Linkedin Learning course: End-to-End Data Engineering Project☆28Nov 9, 2023Updated 2 years ago
- F1 Data Pipeline☆25Jul 1, 2023Updated 2 years ago
- Transaction processing & vis pipeline using PySpark Streaming☆30Jul 18, 2024Updated last year
- Data Engineering, Data Warehouse, Data Mart, Cloud Data, AWS, SAS, Redshift, S3☆32Feb 2, 2021Updated 5 years ago
- A end-to-end real-time stock market data pipeline with Python, AWS EC2, Apache Kafka, and Cassandra Data is processed on AWS EC2 with Apa…☆27Jun 7, 2023Updated 2 years ago
- A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!☆837Apr 16, 2022Updated 3 years ago
- benchmarks for LLM tokenizers☆16Jan 15, 2026Updated last month
- ☆384Jan 26, 2025Updated last year
- used Airflow, Postgres, Kafka, Spark, and Cassandra, and GitHub Actions to establish an end-to-end data pipeline☆29Oct 25, 2023Updated 2 years ago
- Example end to end data engineering project.☆1,384Dec 8, 2022Updated 3 years ago
- ☆82Feb 25, 2025Updated 11 months ago
- Portfolio of projects and studies conducted in data engineering.☆34Feb 22, 2025Updated 11 months ago