Spark data pipeline that processes movie ratings data.
☆31Mar 1, 2026Updated last week
Alternatives and similar repositories for spark-movies-etl
Users that are interested in spark-movies-etl are comparing it to the libraries listed below
Sorting:
- Spark Structured Streaming data pipeline that processes movie ratings data in real-time.☆13Mar 1, 2026Updated last week
- Create a data pipeline on AWS to execute batch processing in a Spark cluster provisioned by Amazon EMR. ETL using managed airflow: extrac…☆10Jul 12, 2021Updated 4 years ago
- ☆18Aug 6, 2024Updated last year
- A end-to-end real-time stock market data pipeline with Python, AWS EC2, Apache Kafka, and Cassandra Data is processed on AWS EC2 with Apa…☆28Jun 7, 2023Updated 2 years ago
- A real-time streaming ETL pipeline for streaming and performing sentiment analysis on Twitter data using Apache Kafka, Apache Spark and D…☆29Aug 8, 2020Updated 5 years ago
- Built a stream processing data pipeline to get data from disparate systems into a dashboard using Kafka as an intermediary.☆29Aug 14, 2023Updated 2 years ago
- Code for youtube channel☆10Apr 15, 2022Updated 3 years ago
- Climsoft Desktop for Windows - http://www.climsoft.org☆16Dec 19, 2025Updated 2 months ago
- EOSIO-Taurus - The Most Powerful Infrastructure for Decentralized Applications☆13Mar 29, 2024Updated last year
- Airflow DAGs for the Stellar ETL project☆38Updated this week
- ☆10May 16, 2022Updated 3 years ago
- ☆49Oct 15, 2024Updated last year
- Hands-On SQL Server 2019 Analysis Services, published by Packt.☆11Mar 2, 2026Updated last week
- AAIF landscape☆33Jan 15, 2026Updated last month
- ☆10Feb 12, 2026Updated 3 weeks ago
- Coupon System project: SpringBoot & AngularTS☆12Jan 3, 2021Updated 5 years ago
- Convert asciinema JSON files to GIF for embedding in Github, Medium, email, Slack and more!☆11Sep 24, 2020Updated 5 years ago
- ☆10Apr 13, 2022Updated 3 years ago
- A collection of python utility functions☆11Feb 11, 2026Updated last month
- A custom AWS credential provider that allows your Hadoop or Spark application access S3 file system by assuming a role☆10Jan 9, 2026Updated 2 months ago
- Set up an async pipeline in python using Celery, RabbitMQ and MongoDB. This repo covers the end to end deployment of an async pipeline fo…☆13Sep 23, 2022Updated 3 years ago
- Python library for continual lifelong anomaly detection☆20Dec 3, 2025Updated 3 months ago
- Transform AWS Config snapshots to a more AWS Athena-friendly format.☆11Aug 26, 2020Updated 5 years ago
- ☆10Dec 23, 2023Updated 2 years ago
- Let's run Ambari using docker compose. (feat. FreeIPA)☆10Nov 24, 2024Updated last year
- ☆10Apr 5, 2019Updated 6 years ago
- Corda Enterprise Network Manager (CENM) deployment☆10Nov 21, 2025Updated 3 months ago
- There isn't an official UI for the Q CLI, so I vibe coded one.☆19Mar 3, 2026Updated last week
- ☆10Feb 18, 2021Updated 5 years ago
- ☆17Jul 18, 2014Updated 11 years ago
- ☆10Apr 15, 2023Updated 2 years ago
- A place for modules.tf questions and issues☆13Oct 27, 2021Updated 4 years ago
- spinning Solar System using pure CSS☆10Oct 2, 2022Updated 3 years ago
- 🐍 Run Object Detection Inferences in Python☆12Feb 3, 2020Updated 6 years ago
- Desktop application for managing dairy farms, built using java, javafx & MySQL☆12Jan 8, 2026Updated 2 months ago
- Data Engineering and Data Analysis, as a new hire being tested by the boss, using SQL databases.☆15Aug 13, 2023Updated 2 years ago
- This repo contains ExecutionHook CRDs for dynamically executing user’s commands in pods/containers and an ExecutionHookController to mana…☆13Oct 7, 2022Updated 3 years ago
- Cool DE Projects☆66Updated this week
- Code samples for an Ignite conference presentation on the topic of Automating Azure SQL Data Warehouse☆11Mar 21, 2023Updated 2 years ago