DFoly / User_log_pipeline
Creating a Streaming Pipeline for user log data in Google Cloud Platform
☆22Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for User_log_pipeline
- Projects from Udacity Data Streaming Nanodegree☆15Updated last year
- This repository is used as source code for the medium post about implementing a Twitter recommender system using GCP.☆31Updated 6 years ago
- A simple introduction to using spark ml pipelines☆26Updated 6 years ago
- Slides, code and more for my class: Data Analytics and Machine Learning on Big Data☆8Updated 6 years ago
- This repo demonstrates how to load a sample Parquet formatted file from an AWS S3 Bucket. A python job will then be submitted to a Apach…☆19Updated 8 years ago
- Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtag☆29Updated 8 years ago
- AWS Big Data Certification☆25Updated last year
- My presentation at ODSC India 2018 about Deep Learning with Apache Spark☆27Updated 6 years ago
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆26Updated 5 years ago
- Real-time report dashboard with Apache Kafka, Apache Spark Streaming and Node.js☆49Updated last year
- Collection of presentation of my work on various platforms and meetups☆22Updated 5 years ago
- Sample Notebooks for PipelineAI☆44Updated 2 years ago
- Example custom model image trainable and distributable via AWS SageMaker☆36Updated last year
- Use Kafka and Apache Spark streaming to perform click stream analytics☆76Updated 4 years ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 11 months ago
- 🚨 Simple, self-contained fraud detection system built with Apache Kafka and Python☆83Updated 5 years ago
- ☆10Updated 5 years ago
- Apache Spark Interview Question and Answers☆21Updated 4 years ago
- Udacity Data Pipeline Exercises☆15Updated 4 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- Follow the Lumiata Tech Blog on Medium!☆21Updated last year
- notebooks for nlp-on-spark☆13Updated 7 years ago
- A project template for developing BYOD docker images for use in Amazon SageMaker.☆19Updated 4 years ago
- Build end-to-end Machine Learning pipeline to predict accessibility of playgrounds in NYC☆14Updated 4 years ago
- Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etc☆52Updated 8 years ago
- ☆26Updated 10 months ago
- An example PySpark project with pytest☆17Updated 7 years ago
- ☆10Updated 5 years ago
- ☆16Updated last year