J-An-dev / real-time-fraud-detection
This repository implements a real-time credit card fraud detection pipeline using Kafka, Spark and Cassandra. Kafka continuously produces credit card transactions that will be analyzed by the Spark Streaming job in real-time. Meanwhile, classified transaction records will be displayed on the dashboard for visualization.
☆20Updated 4 years ago
Alternatives and similar repositories for real-time-fraud-detection:
Users that are interested in real-time-fraud-detection are comparing it to the libraries listed below
- ☆18Updated 6 years ago
- I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that uti…☆29Updated last year
- A Big Data project leveraging AWS services and Apache frameworks to identify and visualize fraudulent credit card transaction patterns, p…☆11Updated last year
- A real-time streaming ETL pipeline for streaming and performing sentiment analysis on Twitter data using Apache Kafka, Apache Spark and D…☆30Updated 4 years ago
- Simple ETL pipeline using Python☆25Updated last year
- Data pipeline project☆32Updated last month
- Apache Spark using SQL☆14Updated 3 years ago
- ☆29Updated 2 years ago
- Classwork projects and home works done through Udacity data engineering nano degree☆74Updated last year
- IBM Data Engineering Courses from Coursera☆71Updated last year
- All my projects on Big Data are provided☆27Updated 8 years ago
- Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple …☆29Updated 4 years ago
- ☆20Updated 2 years ago
- PySpark Tutorial for Beginners on Google Colab: Hands-On Guide☆16Updated 4 years ago
- data science interview questions company wise which include the data analyst , junior data scientist , machine learning engineer etc. pos…☆15Updated 2 years ago
- Deployment of NLP Sensitive Analysis use Case with the help of flask and docker.☆8Updated 7 months ago
- A collection of data engineering projects: data modeling, ETL pipelines, data lakes, infrastructure configuration on AWS, data warehousin…☆15Updated 3 years ago
- ☆40Updated 8 months ago
- Content related to Mastering Postgresql along with videos.☆15Updated 3 years ago
- Mastering Big Data Analytics with PySpark, Published by Packt☆158Updated 7 months ago
- This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessar…☆43Updated last year
- ☆87Updated 2 years ago
- Lecture notes, lab notes, and links to helpful resources to pass Google Certification Exam for Professional Data Engineer.☆17Updated 2 years ago
- PySpark Cheatsheet☆36Updated 2 years ago
- Business challenge that requires building a data platform for retailer data analytics.☆12Updated 2 years ago
- Repository for Spark using Python material. It is popularly known as PySpark.☆19Updated 3 years ago
- AWS Machine Learning-Specialty- my notes☆48Updated 5 years ago
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆100Updated 4 years ago
- Udacity Data Engineering Nanodegree Program☆52Updated 4 years ago
- ☆27Updated last year