miztiik / emr-on-eks
Run EMR workloads on EKS
☆13Updated 3 years ago
Alternatives and similar repositories for emr-on-eks:
Users that are interested in emr-on-eks are comparing it to the libraries listed below
- Skeleton project for Apache Airflow training participants to work on.☆16Updated 4 years ago
- This repo demonstrates how to load a sample Parquet formatted file from an AWS S3 Bucket. A python job will then be submitted to a Apach…☆19Updated 8 years ago
- The Python fake data producer for Apache Kafka® is a complete demo app allowing you to quickly produce JSON fake streaming datasets and …☆82Updated 9 months ago
- Read Delta tables without any Spark☆47Updated 11 months ago
- ☆16Updated 4 years ago
- Dask integration for Snowflake☆30Updated 3 months ago
- MLOps NYC 2019 training session: Runnign Spark on Kubernetes☆18Updated 3 years ago
- Example script to deploy DAGs to Google Cloud Composer.☆15Updated 2 years ago
- ☆30Updated 3 years ago
- Data validation library for PySpark 3.0.0☆34Updated 2 years ago
- Code examples for the Introduction to Kubeflow course☆14Updated 4 years ago
- Deploy your Spark Production Cluster on Kubernetes☆47Updated 4 years ago
- ☆10Updated 6 years ago
- Spark ETL example processing New York taxi rides public dataset on EKS☆44Updated 2 years ago
- A pyspark lib to validate data quality☆18Updated 2 years ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated last month
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMR☆30Updated this week
- Airflow helm chart for AWS EKS☆18Updated 4 years ago
- Amazon EMR Notebook to show how to read from and write to Delta tables with Amazon EMR☆17Updated 6 months ago
- AWS Lambda function to get events in Kafka topic when files are uploaded to S3☆24Updated 6 years ago
- A collection of airflow sample workflows for data processing on aws☆12Updated 7 years ago
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆9Updated last year
- ☆22Updated 2 years ago
- Machine Learning Projects with Flytekit☆35Updated last year
- Asynchronous actions for PySpark☆47Updated 3 years ago
- Functional Airflow DAG definitions.☆38Updated 7 years ago
- Materials of the Official Helm Chart Webinar☆27Updated 3 years ago
- Airflow code accompanying blog post.☆21Updated 6 years ago