J-An-dev / real-time-fraud-detection
This repository implements a real-time credit card fraud detection pipeline using Kafka, Spark and Cassandra. Kafka continuously produces credit card transactions that will be analyzed by the Spark Streaming job in real-time. Meanwhile, classified transaction records will be displayed on the dashboard for visualization.
☆13Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for real-time-fraud-detection
- Data pipeline project☆23Updated last year
- Data Streaming Nanodegree (from Udacity) exercises, projects and their solutions☆16Updated last year
- Apche Spark Structured Streaming with Kafka using Python(PySpark)☆41Updated 5 years ago
- ☆28Updated last year
- Lecture notes, lab notes, and links to helpful resources to pass Google Certification Exam for Professional Data Engineer.☆16Updated 2 years ago
- Udacity Data Engineering Nanodegree Program☆51Updated 3 years ago
- All my projects on Big Data are provided☆27Updated 7 years ago
- Vietnam stock price crawling☆17Updated last year
- Because its never late to start taking notes and 'public' it...☆60Updated this week
- plan, design and implement enterprise data infrastructure solutions and create the blueprints for an organization’s data management syste…☆10Updated last year
- AWS Machine Learning-Specialty- my notes☆47Updated 5 years ago
- My Git Repo for Csv Data☆19Updated 4 years ago
- ☆18Updated 10 months ago
- Apache Spark using SQL☆14Updated 3 years ago
- This project is mainly for learning and practicing simple HIVE commands in real time scenarios. Here we have taken some sample coffee sho…☆11Updated 6 years ago
- ( These solutions tested on 4 node Hortonwork cluster on my laptop. Do not test on your production environment until you test... :)☆20Updated 4 years ago
- Repository related to Spark SQL and Pyspark using Python3☆36Updated 2 years ago
- Personal project where I perform some analytics (including Sentiment Analysis) over a Twitter Stream using Big Data Technologies of the H…☆20Updated last year
- Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development☆21Updated 5 years ago
- ☆14Updated last year
- Customer 360 analytics powered by MapR☆23Updated 2 years ago
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆94Updated last year
- Apache Spark Interview Question and Answers☆21Updated 4 years ago
- A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from loc…☆21Updated 2 years ago
- PySpark-ETL☆23Updated 4 years ago
- ☆37Updated 4 years ago
- Mastering Big Data Analytics with PySpark, Published by Packt☆157Updated 3 months ago
- Cloudera_Material: Study Material to help people preparing for Cloudera CCA Spark and Hadoop Developer Exam (CCA175). Feel free to collab…☆35Updated 4 years ago
- Simple ETL pipeline using Python☆21Updated last year
- An ETL pipeline that extracts data from S3, stages them in Redshift, and transforms data into a set of dimensional tables☆12Updated 4 years ago