☆12Mar 6, 2021Updated 5 years ago
Alternatives and similar repositories for airflow-spark-aws-emr
Users that are interested in airflow-spark-aws-emr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Translate your CSV files effortlessly across multiple languages and save your time and effort.☆19Jul 2, 2024Updated last year
- An End-to-End ETL data pipeline that leverages pyspark parallel processing to process about 25 million rows of data coming from a SaaS ap…☆25Dec 7, 2022Updated 3 years ago
- Branch Metrics Win32/C++ SDK☆10Jun 10, 2025Updated 9 months ago
- ☆12Sep 30, 2021Updated 4 years ago
- The goal of this project is to analyse the impact of Covid-19 on the Aviation industry through data engineering processes using technolog…☆13Jun 26, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Simple repo to demonstrate how to submit a spark job to EMR from Airflow☆34Oct 18, 2020Updated 5 years ago
- XCloud Project's objective is to build similar infrastructure on both AWS and GCP to have multi cloud infrastructure for an organization …☆14Aug 1, 2022Updated 3 years ago
- ☆10Oct 12, 2021Updated 4 years ago
- Azure DP 900 notes☆10Jan 23, 2022Updated 4 years ago
- Projects I implemented to finish Udacity Nanodegree Programs from Data Engineering to Machine Learning Engineering.☆23Jun 5, 2023Updated 2 years ago
- A fully integrated platform for aggregating, visualising and analysing alternative data☆13Mar 15, 2024Updated 2 years ago
- Apache Flink/Apache Kafka streaming data analytics demonstration using Streaming Synthetic Sales Data Generator☆15Jun 4, 2024Updated last year
- Kubernetes LDAP authentication service written in Go.☆10May 4, 2019Updated 6 years ago
- This is an analytical project done using python to process and extract valuable insights from WhatsApp text file, deployed as a webapp us…☆19Dec 8, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- This project aims to move the data from a Relational database system (RDBMS) to a Hadoop file system (HDFS)☆11Apr 29, 2022Updated 3 years ago
- ☆15Nov 11, 2023Updated 2 years ago
- This is a capstone project associated with MLOps Zoomcamp. The end goal of the project is to build an end-to-end machine learning projec…☆13Sep 8, 2022Updated 3 years ago
- Open Source Data Contracts In JSON to UNIFY understanding and efforts efficiently☆16Dec 16, 2022Updated 3 years ago
- A repository that includes examples from Spanish posts☆10Dec 19, 2025Updated 3 months ago
- Semantic segmentation for visual terrain classification and autonomous navigation in TensorFlow☆13Mar 22, 2017Updated 9 years ago
- 디시인사이드 갤러리를 실시간 채팅창으로 바꿔주는 북마크렛입니다.☆16Nov 2, 2025Updated 4 months ago
- ETL flow framework based on Yaml configs in Python☆22Oct 21, 2023Updated 2 years ago
- A data pipeline moving data from a Relational database system (RDBMS) to a Hadoop file system (HDFS).☆15Jun 3, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆16Apr 8, 2024Updated last year
- A tool for handwritten text (straight and skewed) line segmentation based on a statistical approach.☆40Jun 29, 2018Updated 7 years ago
- ☆13Mar 30, 2024Updated last year
- ☆10Oct 7, 2024Updated last year
- Glue ETL job or EMR Spark that gets from data catalog, modifies and uploads to S3 and Data Catalog☆13Aug 26, 2023Updated 2 years ago
- A new approach for the real time 3D semantic segmentation based on feature abstract and deep learning method☆16Nov 28, 2017Updated 8 years ago
- Data Streaming Nanodegree (from Udacity) exercises, projects and their solutions☆17Aug 14, 2023Updated 2 years ago
- Customized Jupyter Spark Docker images with everything you need☆16May 3, 2025Updated 10 months ago
- 🛰🐦 Bird tracking - GPS tracking network for large birds☆23Mar 17, 2026Updated last week
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- With everything I learned from DEZoomcamp from datatalks.club, this project performs a batch processing on AWS for the cycling dataset wh…☆15Jan 4, 2026Updated 2 months ago
- End to end mlflow with feast example☆17May 18, 2021Updated 4 years ago
- End-to-end data pipeline that ingests, processes, and stores data. It uses Apache Airflow to schedule scripts that fetch data from an API…☆21Jul 26, 2024Updated last year
- Kubernetes LDAP authentication with the Webhook Token authentication plugin☆12Apr 14, 2020Updated 5 years ago
- ☆23Apr 13, 2019Updated 6 years ago
- Code to be contributed to the Apache Airflow (incubating) project for ETL workflow management for integrating with the Snowflake Data War…☆26Jul 19, 2017Updated 8 years ago
- Labs and demos for courses in the Data Engineer track of GCP Training (http://cloud.google.com/training).☆16Oct 28, 2019Updated 6 years ago