hiejulia / Data-pipeline-project
Data pipeline project
☆34Updated 3 months ago
Alternatives and similar repositories for Data-pipeline-project
Users that are interested in Data-pipeline-project are comparing it to the libraries listed below
Sorting:
- All my projects on Big Data are provided☆27Updated 8 years ago
- Big data projects implemented by Maniram yadav☆51Updated 7 years ago
- Counting Tweets Per User in Real-Time☆42Updated 7 years ago
- Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.☆96Updated 3 years ago
- Data cleaning, pre-processing, and Analytics on a Health care data using Spark and Python.☆48Updated last year
- This repository implements a real-time credit card fraud detection pipeline using Kafka, Spark and Cassandra. Kafka continuously produces…☆19Updated 4 years ago
- ☆150Updated 7 years ago
- data engineering 100 days 🤖 🧲 🦾 | #DE☆40Updated last year
- Apche Spark Structured Streaming with Kafka using Python(PySpark)☆40Updated 6 years ago
- plan, design and implement enterprise data infrastructure solutions and create the blueprints for an organization’s data management syste…☆11Updated last year
- ☆13Updated 2 years ago
- Personal project where I perform some analytics (including Sentiment Analysis) over a Twitter Stream using Big Data Technologies of the H…☆21Updated 2 years ago
- My solutions for the Udacity Data Engineering Nanodegree☆34Updated 5 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆55Updated 2 years ago
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆55Updated 6 years ago
- Apache Spark Interview Question and Answers☆21Updated 4 years ago
- Big Data webapp using Chicago street congestion, crashes, red light violations, and speed camera violations☆40Updated 4 years ago
- Hadoop tutorial Files. For detailed Tutorials visit www.youtube.com/learningjournalin☆26Updated 7 years ago
- This repository hosts the code/projects/demos/slides for Big Data technologies under Apache Hadoop and Apache Spark umbrella.☆42Updated 2 years ago
- Classwork projects and home works done through Udacity data engineering nano degree☆74Updated last year
- ☆18Updated 7 years ago
- Deployed an kafka instance in AWS EC2 Instance to streamline the data into Databricks☆10Updated last year
- Big Data Management and Analysis Final Project☆68Updated 7 years ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆16Updated 6 years ago
- Final Project for IoT: Big Data Processing and Analytics class. Analyzing U.S nationwide temperature from IoT sensors in real-time☆70Updated 8 years ago
- Project - Data Processing and Analysis in Python Course☆41Updated 6 years ago
- This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessar…☆44Updated last year
- Playground for pyspark (RDDs, DStreams) and Apache Airflow. Based on the example of parsing (including incorrectly formated strings) web …☆18Updated 3 years ago
- Analytics projects using Big Data eco-systems (Hadoop, Spark, Storm)☆17Updated 3 years ago
- Data warehouse implementation for an e-commerce website “Infibeam” that sells digital and consumer electronics.☆19Updated 7 years ago