aws-samples/amazon-eks-apache-spark-etl-sample

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aws-samples/amazon-eks-apache-spark-etl-sample)

aws-samples / amazon-eks-apache-spark-etl-sample

Spark ETL example processing New York taxi rides public dataset on EKS

☆45

Alternatives and similar repositories for amazon-eks-apache-spark-etl-sample

Users that are interested in amazon-eks-apache-spark-etl-sample are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aws-samples / amazon-emr-optimize-data-processing
View on GitHub
Optimizing downstream data processing with Amazon Kinesis Data Firehose and Amazon EMR running Apache Spark
☆14Apr 14, 2023Updated 3 years ago
aws-samples / eks-manage-node-groups-placement-group
View on GitHub
EKS Managed Node Groups with Placement Group
☆12Jul 15, 2024Updated 2 years ago
aws-samples / amazon-es-check-cw-alarms
View on GitHub
This sample code checks, and optionally creates, recommended CloudWatch alarms for your Amazon Elasticsearch service domain.
☆13Feb 16, 2018Updated 8 years ago
sbt / sbt-cucumber
View on GitHub
Cucumber plugin for SBT.
☆19Oct 19, 2021Updated 4 years ago
infinispan / infinispan-helm-charts
View on GitHub
☆17Jun 5, 2026Updated last month
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
testdrivenio / vault-consul-swarm
View on GitHub
Deploy Vault and Consul with Docker Swarm
☆24Oct 2, 2021Updated 4 years ago
aws-samples / amazon-redshift-data-warehouse-migration
View on GitHub
In few hours, quickly learn how to effectively migrate oracle data warehouse workload to Amazon Redshift using AWS Schema Conversion Tool…
☆10Dec 16, 2020Updated 5 years ago
mayur2810 / sope
View on GitHub
Apache Spark ETL Utilities
☆40Oct 23, 2024Updated last year
dubeyrupesh / Gatling-Kinesis
View on GitHub
This project aims at doing performance testing of AWS Kinesis stream
☆11May 16, 2020Updated 6 years ago
aws-samples / amazon-kinesis-analytics-streaming-etl
View on GitHub
Streaming ETL with Apache Flink and Amazon Kinesis Data Analytics
☆65Oct 17, 2023Updated 2 years ago
drabastomek / learningPySpark_video
View on GitHub
Learning PySpark video series
☆11Mar 5, 2018Updated 8 years ago
awslabs / amazon-emr-vscode-toolkit
View on GitHub
A VS Code Extension to make it easier to manage and develop Spark jobs on EMR
☆39Feb 17, 2025Updated last year
aws-samples / amazon-ecs-fluent-bit-daemon-service
View on GitHub
Fluent Bit plugin-based centralized log analysis across Amazon ECS & EKS clusters
☆55Oct 29, 2020Updated 5 years ago
A3Data / pyspark-notebook-helm
View on GitHub
This repo provides the Kubernetes Helm chart for deploying Pyspark Notebook.
☆17Nov 16, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mudassir0909 / stackoverflow-card
View on GitHub
Unofficial embeddable Stackoverflow profile summary card
☆11Nov 19, 2022Updated 3 years ago
aws-samples / amazon-kinesis-data-analytics-flinktableapi
View on GitHub
A walkthrough of setting up a Kinesis Data Analytics for Java Application which ingest streaming JSON data and leverages the Flink Table …
☆16Aug 30, 2023Updated 2 years ago
aws-samples / amazon-sagemaker-analyze-model-predictions
View on GitHub
☆25Jan 5, 2021Updated 5 years ago
DataChefHQ / aws-data-landing-zone
View on GitHub
The Data Landing Zone is a CDK Construct designed to create a landing zone tailored for supporting and enabling AI, data-driven, data mes…
☆23Updated this week
sunilsala88 / fyers-files-feb-2024
View on GitHub
☆12Mar 27, 2024Updated 2 years ago
santoshjoshi / Apache-Kafka
View on GitHub
Apache Kafka Overview
☆12Jun 9, 2023Updated 3 years ago
alexbeletsky / tdd.demand
View on GitHub
A Web Crawler that crawle some job looking sites, analyze them, store data
☆19Jun 5, 2012Updated 14 years ago
pierangeloc / ray-tracer-zio
View on GitHub
A ray tracer to learn ZIO modules
☆27Mar 19, 2021Updated 5 years ago
databricks-solutions / databricks-apps-examples
View on GitHub
Features Databricks Apps examples that are built by Databricks field personnel. Meant to act as points-of-inspiration and points-of-imple…
☆25Jan 19, 2026Updated 6 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
HsiehShuJeng / cdk-emrserverless-with-delta-lake
View on GitHub
This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…
☆11Nov 18, 2025Updated 8 months ago
AWSCookbook / BigData
View on GitHub
Chapter 7 of the AWS Cookbook
☆12Mar 23, 2022Updated 4 years ago
aws-samples / amazon-S3-cache-with-amazon-elasticache-redis
View on GitHub
This sample project illustrates how you can cache Amazon S3 objects within Amazon ElastiCache for Redis.
☆21Mar 26, 2019Updated 7 years ago
tsimbalar / gha-build-monitor
View on GitHub
Adapter to give access to GitHub Actions status via the CatLight Protocol
☆10Mar 13, 2023Updated 3 years ago
apache / airflow-on-k8s-operator
View on GitHub
Airflow on Kubernetes Operator
☆86Feb 6, 2023Updated 3 years ago
Zach-Johnson / spotify-discover
View on GitHub
☆10Sep 30, 2018Updated 7 years ago
gkdevops / python-data-engineer
View on GitHub
Learn Python language for beginners in Data Engineering and Data Analytics
☆35Jul 19, 2026Updated last week
awslabs / amazon-emr-cli
View on GitHub
A command-line interface for packaging, deploying, and running your EMR Serverless Spark jobs
☆47Jun 24, 2026Updated last month
aws-quickstart / quickstart-databricks-unified-data-analytics-platform
View on GitHub
AWS Quick Start Team
☆20Oct 3, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
bezkoder / spring-boot-data-cassandra
View on GitHub
Spring Boot CRUD Rest APIs with Spring Data Cassandra
☆15Apr 30, 2021Updated 5 years ago
xnuinside / airflow_examples
View on GitHub
Airflow Examples: code samples for Medium articles
☆14Jan 10, 2021Updated 5 years ago
NickAkincilar / Snowflake_SelfService_Sandbox_Config
View on GitHub
☆13Feb 16, 2022Updated 4 years ago
mmocny / web-vitals
View on GitHub
Essential metrics for a healthy site.
☆15May 24, 2022Updated 4 years ago
mrgrain / jsii-struct-builder
View on GitHub
Build jsii structs with ease.
☆34Updated this week
aws-samples / cluster-sample-app
View on GitHub
A very basic app written in Javascript and packaged as a Docker image to be used as a demo when testing clustered deployments in ECS/EKS.
☆11Jun 30, 2023Updated 3 years ago
sanjeetkumar13 / AWS-Solutions-Architect-Associate-SAA-C02-Exam-Prep-Course---2020-UPDATED-
View on GitHub
AWS Solutions Architect Associate (SAA-C02) Exam Prep Course - 2020 UPDATED!, published by Packt
☆15Sep 3, 2020Updated 5 years ago