namebrandon / Sparkov_Data_GenerationLinks
Synthetic Credit Card Transaction Generator used in the Sparkov program.
☆151Updated 2 years ago
Alternatives and similar repositories for Sparkov_Data_Generation
Users that are interested in Sparkov_Data_Generation are comparing it to the libraries listed below
Sorting:
- Financial Simulator of Mobile Money Service☆110Updated 4 years ago
- AML End to End Example☆54Updated 2 years ago
- Template repo for kickstarting recipes for regression use case☆54Updated 5 months ago
- Reproducible Machine Learning for Credit Card Fraud Detection - Practical Handbook☆581Updated last year
- The data represents financial transactions -- bank transfers, purchases, credit card transactions, checks, etc. Most of the transactions…☆49Updated last year
- Feast AWS guide using Redshift / Spectrum / DynamoDB to build a credit scoring model☆63Updated 3 years ago
- A real-time streaming ETL pipeline for streaming and performing sentiment analysis on Twitter data using Apache Kafka, Apache Spark and D…☆30Updated 4 years ago
- Using a feature store to connect the DataOps and MLOps workflows to enable collaborative teams to develop efficiently.☆56Updated 2 years ago
- Example repo to kickstart integration with mlflow pipelines.☆76Updated 2 years ago
- Sample application running fbprophet using spark☆48Updated 6 years ago
- 🐍 Quick reference guide to common patterns & functions in PySpark.☆550Updated 2 years ago
- Reference code base for ML Engineering, Manning Publications☆128Updated 3 years ago
- Materials of the Official Helm Chart Webinar☆27Updated 3 years ago
- Capturing model drift and handling its response - Example webinar☆108Updated 5 years ago
- ⚓ Eurybia monitors model drift over time and securizes model deployment with data validation☆210Updated 7 months ago
- Streaming Anomaly Detection Solution by using Pub/Sub, Dataflow, BQML & Cloud DLP☆181Updated 4 months ago
- ☆108Updated 3 years ago
- A workshop with several modules to help learn Feast, an open-source feature store☆92Updated 4 months ago
- PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster☆463Updated 7 months ago
- Anomaly Detection Pipeline with Isolation Forest model and Kedro framework☆24Updated 2 years ago
- Demo of Streamlit application with Databricks SQL Endpoint☆35Updated 2 years ago
- PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it☆67Updated last month
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆38Updated 10 months ago
- O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian☆216Updated last year
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆68Updated last year
- Predict if a reservation will be canceled using robust Machine Learning pipelines with Airflow and Mlflow☆63Updated last year
- Delta Lake Documentation☆49Updated 11 months ago
- Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner☆71Updated 3 years ago
- ☆38Updated 2 years ago
- Machine Learning Engineering with MLflow, published by Packt☆115Updated 11 months ago