ecloudvalley / Building-a-Data-Lake-with-AWS-Glue-and-Amazon-S3Links
☆17Updated 7 years ago
Alternatives and similar repositories for Building-a-Data-Lake-with-AWS-Glue-and-Amazon-S3
Users that are interested in Building-a-Data-Lake-with-AWS-Glue-and-Amazon-S3 are comparing it to the libraries listed below
Sorting:
- AWS Big Data Certification☆25Updated 10 months ago
- 🚨 Simple, self-contained fraud detection system built with Apache Kafka and Python☆89Updated 6 years ago
- [Video]AWS Certified Machine Learning-Specialty (ML-S) Guide☆121Updated 10 months ago
- Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtag☆29Updated 9 years ago
- Spark and Python (PySpark) Examples☆39Updated 4 years ago
- [Book-2019] Pragmatic AI: An Introduction to Cloud-based Machine Learning☆138Updated 10 months ago
- A self-paced workshop designed to allow you to get hands on with building a real-time data platform using serverless technologies such as…☆22Updated 6 years ago
- Code to build a simple analytics data pipeline with Python☆101Updated 8 years ago
- The open source version of the Amazon Redshift Cluster Management Guide.☆48Updated 2 years ago
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆55Updated 6 years ago
- Udacity Data Pipeline Exercises☆15Updated 5 years ago
- Learn how to build an end-to-end streaming architecture to ingest, analyze, and visualize streaming data in near real-time☆34Updated 3 years ago
- Just a boilerplate for PySpark and Flask☆35Updated 7 years ago
- Open innovation with 60 minute cloud experiments on AWS☆87Updated last year
- A repo to track data engineering projects☆13Updated 3 years ago
- A Pyspark job to handle upserts, conversion to parquet and create partitions on S3☆28Updated 5 years ago
- Big Data Demystified meetup and blog examples☆31Updated last year
- 🐍💨 Airflow tutorial for PyCon 2019☆87Updated 2 years ago
- Sharing interesting and noteworthy Data Engineering content☆69Updated 9 years ago
- As customers move from building data lakes and analytics on AWS to building machine learning solutions, one of their biggest challenges i…☆63Updated 6 years ago
- Code supporting Data Science articles at The Marketing Technologist, Floryn Tech Blog, and Pythom.nl☆71Updated 2 years ago
- This workshop demonstrates two methods of machine learning inference for global production using AWS Lambda and Amazon SageMaker☆58Updated 5 years ago
- The open source version of the Amazon Athena documentation. To submit feedback & requests for changes, submit issues in this repository, …☆84Updated 2 years ago
- Airflow workflow management platform chef cookbook.☆71Updated 6 years ago
- This repository shows a sample example to build, manage and orchestrate Machine Learning workflows using Amazon Sagemaker and Apache Airf…☆138Updated 4 years ago
- Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etc☆51Updated 9 years ago
- Managed Machine Learning Systems and Internet of Things Live Lesson☆40Updated 10 months ago
- ☆17Updated last year
- A Personalized 'Shop-by-Style' Experience via PyTorch on Amazon SageMaker and Amazon Neptune☆24Updated 4 years ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 10 months ago