qwshen/spark-etl-framework

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/qwshen/spark-etl-framework)

qwshen / spark-etl-framework

A generic ETL framework with Spark_SQL for transforming data by constructing pipelines with Yaml/Json/Xml

☆21

Alternatives and similar repositories for spark-etl-framework

Users that are interested in spark-etl-framework are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mouryar / spring_hive_jdbc_template
View on GitHub
☆10Feb 10, 2017Updated 9 years ago
yeastrc / jobcenter
View on GitHub
Jobcenter, a client-server application and framework for job management and distributed job execution
☆11Aug 19, 2019Updated 6 years ago
MarekLani / Scala-Spark-VSCode-Remote-Containers
View on GitHub
☆12Jan 18, 2021Updated 5 years ago
ddossot / vertx-react-demo
View on GitHub
Vert.x React Demo
☆14May 18, 2014Updated 12 years ago
2gis / kafka-connect-hdfs-ext
View on GitHub
Set of extensions for kafka connect hdfs
☆11May 12, 2021Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
bernhard-42 / Spark-ETL-Atlas
View on GitHub
A small project to show how to add lineage to Atlas when using Spark as ETL tool
☆12Nov 29, 2016Updated 9 years ago
Azure-Samples / hdinsight-java-hive-jdbc
View on GitHub
An example of how to use the JDBC to issue Hive queries from a Java client application.
☆11Apr 5, 2018Updated 8 years ago
DataReply / kafka-connect-elastic-search-sink
View on GitHub
☆20Dec 21, 2016Updated 9 years ago
dyrnq / install-docker
View on GitHub
bash scripts install docker
☆10Jun 26, 2026Updated last month
crflynn / pbspark
View on GitHub
protobuf pyspark conversion
☆23Jun 7, 2023Updated 3 years ago
MatthewSteel / carpool
View on GitHub
Carpooling to the database (demo)
☆13Oct 29, 2018Updated 7 years ago
sqlpage / sqlpage-spreadsheet
View on GitHub
An excel-like spreadsheet component for SQLPage
☆16Aug 4, 2025Updated 11 months ago
dockerator / dockerator-elixir
View on GitHub
Tool for turning Elixir apps into Docker images without a pain.
☆18Jan 15, 2019Updated 7 years ago
wushujames / kafka-utilities
View on GitHub
☆26Dec 18, 2019Updated 6 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
cerndb / SparkPlugins
View on GitHub
Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…
☆96May 11, 2026Updated 2 months ago
aceberg / ClickAHabit
View on GitHub
Daily habit tracker and counter
☆11Apr 2, 2024Updated 2 years ago
Menci / Tsukasa
View on GitHub
A flexible port forwarder among TCP, UNIX socket and (optionally) Tailscale, with PROXY protocol support, written in Golang.
☆15Sep 24, 2024Updated last year
intpl / 1op-elixir-vuejs
View on GitHub
☆14May 23, 2017Updated 9 years ago
okkam-it / flink-examples
View on GitHub
Flink jobs collection
☆17Oct 13, 2020Updated 5 years ago
matthewhammer / candid-spaces
View on GitHub
A candid data lake service for the internet computer.
☆15Jun 30, 2021Updated 5 years ago
fabricalab / streaming-flink-dynamodb-connector
View on GitHub
DynamoDB‎ connector for Apache Flink
☆12Jun 14, 2023Updated 3 years ago
aws-samples / aws-keyspaces-lambda-python
View on GitHub
☆10May 24, 2021Updated 5 years ago
mustafaturan / messenger_bot
View on GitHub
Unofficial Facebook Messenger Platform *chatbot client* and *webhook handler*
☆16Aug 2, 2021Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
cyngn / vertx-kafka
View on GitHub
☆31Mar 21, 2016Updated 10 years ago
ml0renz0 / shtpl
View on GitHub
SHell TempLating
☆12Apr 8, 2025Updated last year
donovanmuller / hashistack-vagrant
View on GitHub
A Vagrant managed VM containing a minimal Hashistack for development
☆12Aug 2, 2017Updated 8 years ago
aws-samples / aws-lambda-redshift-event-driven-app
View on GitHub
Building Event Driven Application with AWS Lambda and Amazon Redshift Data API
☆17Oct 27, 2020Updated 5 years ago
mesosphere-backup / tf_dcos_core
View on GitHub
A Terraform module to install, upgrade, and modify nodes for DC/OS clusters.
☆13Feb 14, 2019Updated 7 years ago
vic / mill-docker
View on GitHub
Build minimalist distroless docker images for your java applications using Mill
☆15Jul 15, 2025Updated last year
aws-samples / analysing-realtime-streaming-data-using-msk-emr
View on GitHub
☆14Jun 25, 2020Updated 6 years ago
Heaust-ops / rauxy
View on GitHub
A reverse proxy manager written in go, to convert exposed ports into token-based auth protected ports
☆20Apr 14, 2025Updated last year
Wraient / myd
View on GitHub
Manage your dotfiles, upload and install
☆14Nov 13, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
corelight / json-tcp-lb
View on GitHub
line based tcp load balancing proxy.
☆14Jun 18, 2024Updated 2 years ago
joshed-io / programming-quotes
View on GitHub
A demo presentation for the reveal-hugo Reveal.js Hugo theme
☆12Feb 26, 2020Updated 6 years ago
SmilyOrg / tinygpkg
View on GitHub
Go library for local, small, fast reverse geocoding
☆15Dec 2, 2023Updated 2 years ago
gabrielnic / dfinity-react
View on GitHub
☆13Dec 8, 2022Updated 3 years ago
datafibers-community / df_data_service
View on GitHub
DataFibers Data Service
☆31Feb 11, 2022Updated 4 years ago
phpdragon / hub-mirror-action
View on GitHub
一个提供代码仓库之间同步用的Github Action ，比如(Github同步至Gitee)
☆10Nov 15, 2024Updated last year
Maskvvv / easy-flink-cdc
View on GitHub
一个基于 Flink CDC 的 CDC 框架，mysql，binlog
☆12Jun 1, 2026Updated last month