Spark all the ETL Pipelines
☆37Aug 2, 2023Updated 2 years ago
Alternatives and similar repositories for SparkETL
Users that are interested in SparkETL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Jun 3, 2023Updated 3 years ago
- Lecture Notes for DSML Jun22 Beginner's Intermediate module☆11Oct 14, 2022Updated 3 years ago
- ☆15Jan 6, 2025Updated last year
- My personal page, CV and blog☆15May 8, 2026Updated last month
- Streaming analytics project with eventsim and Kafka☆13Dec 23, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Built a Data Pipeline for a Retail store using AWS services that collects data from its transactional database (OLTP) in Snowflake and tr…☆13May 25, 2023Updated 3 years ago
- Applying the Trading Deep Q-Network algorithm (TDQN) on shares in the hydrogen sector.☆11Nov 11, 2020Updated 5 years ago
- Identify and tokenize sensitive data automatically using Cloud DLP and Dataflow☆45Oct 27, 2025Updated 7 months ago
- ☆23Jan 22, 2018Updated 8 years ago
- Evaluation Matrix for Change Data Capture☆25Aug 6, 2024Updated last year
- google cloud machine learning engineer☆14May 23, 2021Updated 5 years ago
- ☆16Mar 9, 2026Updated 3 months ago
- Open-Source Tools for Real World Problem Series☆16Nov 21, 2022Updated 3 years ago
- Distributed System in Docker with Apache Kafka and Spark for big data streaming and visualisation (NodeJS, TypeScript, React, NestJS, Jav…☆24Apr 28, 2019Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- My *nix dotfiles☆12Apr 16, 2026Updated last month
- DuckDB Copilot Extension☆10Jan 12, 2026Updated 5 months ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆16May 22, 2026Updated 3 weeks ago
- This repo demonstrates an Apache Arrow Flight server implementation in Kubernetes.☆12Oct 25, 2024Updated last year
- Calico API☆24Jun 6, 2026Updated last week
- Get map value via dot-delimited path or nil.☆30Sep 9, 2014Updated 11 years ago
- Zabbix Template (>2.4) and resources useful to monitor zfs on linux (zpool)☆13Jan 26, 2017Updated 9 years ago
- ☆12Aug 26, 2024Updated last year
- CLI secret management☆16May 15, 2026Updated 3 weeks ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Apache Polaris Tools, additional tooling for Apache Polaris☆29Jun 7, 2026Updated last week
- Source code of the institutional insights TradingView indicator.☆11Jan 30, 2025Updated last year
- Contains spark dataframe solutions of leetcode questions☆24Dec 13, 2022Updated 3 years ago
- Open source package for Survival Analysis modeling☆23Feb 3, 2020Updated 6 years ago
- OpenKruise Helm Charts.☆16Updated this week
- A foreign data wrapper for PostgreSQL allowing easy accessing of Apache ORC formatted data files.☆11Sep 21, 2020Updated 5 years ago
- Automated TPC-DS and TPC-H benchmark for Apache Hive LLAP☆10Jul 18, 2022Updated 3 years ago
- Bigdata on Kubernetes, Published by Packt☆37Oct 1, 2024Updated last year
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆90Jun 25, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Exploring retrieval systems for language models☆14Apr 12, 2025Updated last year
- Code/Notes for the Data Engineering Zoomcamp by DataTalksClub☆32Mar 16, 2023Updated 3 years ago
- Collection of useful Helm Charts. Well test with KinD and Kubeconform☆19Updated this week
- Example project for building scalable data pipelines with Kedro and Ibis.☆14Dec 10, 2025Updated 6 months ago
- ☆13Apr 29, 2026Updated last month
- Building Data Science Solutions with Anaconda, published by Packt☆18Mar 2, 2026Updated 3 months ago
- Literate Computing for Reproducible Infrastructure - Hadoop Practice☆11Mar 5, 2026Updated 3 months ago