A real-time reddit data streaming pipeline for sentiment analysis of various subreddits
☆146Aug 23, 2023Updated 2 years ago
Alternatives and similar repositories for reddit-streaming-pipeline
Users that are interested in reddit-streaming-pipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code Repository for my 3rd Data Project.☆16Jun 13, 2023Updated 2 years ago
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆20Aug 12, 2025Updated 9 months ago
- ☆30Feb 11, 2024Updated 2 years ago
- ☆43Sep 20, 2023Updated 2 years ago
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…☆267Jan 1, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- My first attempt at a rough ETL pipeline; technologies include spark, GCS, prefect orchestration, and terraform☆14Oct 12, 2022Updated 3 years ago
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆25May 6, 2023Updated 3 years ago
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆26Nov 8, 2022Updated 3 years ago
- Code Repository for my 1st Data Project.☆25Mar 31, 2023Updated 3 years ago
- Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more☆434Nov 28, 2023Updated 2 years ago
- Data Pipeline from the Global Historical Climatology Network DataSet☆27Dec 20, 2022Updated 3 years ago
- ☆44Mar 9, 2025Updated last year
- End to end data engineering project☆59Oct 27, 2022Updated 3 years ago
- I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that uti…☆29May 2, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- DataTalks.Club's Data Engineering Zoomcamp Project☆24Jul 14, 2022Updated 3 years ago
- This project involves an ETL (Extract, Transform, Load) process to analyze sleep data exported from Apple Health☆29Apr 29, 2023Updated 3 years ago
- Snippets for data set generation and analyses with ParlGov · 🗳️🧑🏻💻📊☆15Dec 5, 2024Updated last year
- Data pipeline that scrapes Rust cheater Steam profiles☆54Feb 13, 2022Updated 4 years ago
- ☆30Nov 16, 2023Updated 2 years ago
- Optimal Data Engine (ODE) for MSSQL☆14Dec 18, 2018Updated 7 years ago
- NoSQL extract, transform, load (ETL) toolkit with Python☆16May 9, 2026Updated 2 weeks ago
- End-to-end ELT data engineering project☆23Dec 24, 2022Updated 3 years ago
- Sample project to demonstrate data engineering best practices☆219Feb 24, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!☆877Apr 16, 2022Updated 4 years ago
- Pull reddit data from APIs and store it in local db☆13Aug 9, 2025Updated 9 months ago
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆78Sep 2, 2023Updated 2 years ago
- Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift,…☆57Oct 20, 2022Updated 3 years ago
- Export userdata from your reddit accounts. Submissions, comments, saved, upvoted contents are supported.☆23Oct 31, 2024Updated last year
- In this project I used apache airflow to scrape website periodically. This is for the tutorials I do on youtube.☆10Nov 21, 2022Updated 3 years ago
- Collection of personal SQL projects and queries I've worked on, showcasing my skills and expertise in database management, data analysis,…☆42Aug 18, 2023Updated 2 years ago
- ☆394Jan 26, 2025Updated last year
- an end-to-end data pipeline extracting music listening habits and producing an insightful dashboard☆18Mar 31, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆32Jan 17, 2025Updated last year
- All my work I am doing while learning to become Data Engineer. Most of the projects are based on the tasks from the 'Career Track: Data E…☆12Sep 28, 2023Updated 2 years ago
- ☆16Apr 9, 2019Updated 7 years ago
- Spark Structured Streaming data pipeline that processes movie ratings data in real-time.☆14Apr 15, 2026Updated last month
- Portfolio of projects and studies conducted in data engineering.☆34Feb 22, 2025Updated last year
- ☆146Jan 31, 2023Updated 3 years ago
- ☆15Mar 15, 2024Updated 2 years ago