hiejulia / Data-pipeline-projectLinks
Data pipeline project
☆35Updated 4 months ago
Alternatives and similar repositories for Data-pipeline-project
Users that are interested in Data-pipeline-project are comparing it to the libraries listed below
Sorting:
- This repository implements a real-time credit card fraud detection pipeline using Kafka, Spark and Cassandra. Kafka continuously produces…☆20Updated 4 years ago
- Big data projects implemented by Maniram yadav☆51Updated 7 years ago
- All my projects on Big Data are provided☆27Updated 8 years ago
- Data cleaning, pre-processing, and Analytics on a Health care data using Spark and Python.☆48Updated last year
- This repository hosts the code/projects/demos/slides for Big Data technologies under Apache Hadoop and Apache Spark umbrella.☆42Updated 2 years ago
- Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.☆96Updated 4 years ago
- Personal project where I perform some analytics (including Sentiment Analysis) over a Twitter Stream using Big Data Technologies of the H…☆21Updated 2 years ago
- Vietnam stock price crawling☆19Updated 2 years ago
- plan, design and implement enterprise data infrastructure solutions and create the blueprints for an organization’s data management syste…☆11Updated 2 years ago
- PySpark-ETL☆23Updated 5 years ago
- This repo contains Big Data Project, its about "Real Time Twitter Sentiment Analysis via Kafka, Spark Streaming, MongoDB and Django Dashb…☆25Updated last year
- This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessar…☆45Updated last year
- Apche Spark Structured Streaming with Kafka using Python(PySpark)☆40Updated 6 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆55Updated 2 years ago
- Big Data webapp using Chicago street congestion, crashes, red light violations, and speed camera violations☆40Updated 4 years ago
- Deployed an kafka instance in AWS EC2 Instance to streamline the data into Databricks☆10Updated last year
- Tutorial for Deployment of Machine Learning Model using Flask☆23Updated 4 years ago
- Counting Tweets Per User in Real-Time☆42Updated 7 years ago
- ☆18Updated 7 years ago
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆27Updated 3 years ago
- Apache Spark using SQL☆14Updated 3 years ago
- This project aims to leverage Amazon Web Services to create trending Youtube videos analytics service. Project contains different data en…☆17Updated 2 weeks ago
- Final Project for IoT: Big Data Processing and Analytics class. Analyzing U.S nationwide temperature from IoT sensors in real-time☆71Updated 8 years ago
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆55Updated 6 years ago
- Classwork projects and home works done through Udacity data engineering nano degree☆74Updated last year
- Zomato Restaurants Exploratory Data Analysis, Visualization and Prediction with Sentiment Analysis of Reviews and Recommendation System☆75Updated 4 years ago
- Data Engineering, Data Warehouse, Data Mart, Cloud Data, AWS, SAS, Redshift, S3☆30Updated 4 years ago
- Project for real-time anomaly detection using Kafka and python☆57Updated 2 years ago
- data engineering 100 days 🤖 🧲 🦾 | #DE☆39Updated last year
- My Graduate Capstone Project - This is a Product Recommendation System for a Local Wholesaler in India, using Python and Machine Learning…☆28Updated 4 years ago