A generic ETL framework with Spark_SQL for transforming data by constructing pipelines with Yaml/Json/Xml
☆21Feb 3, 2026Updated 4 months ago
Alternatives and similar repositories for spark-etl-framework
Users that are interested in spark-etl-framework are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Jan 18, 2021Updated 5 years ago
- Jobcenter, a client-server application and framework for job management and distributed job execution☆11Aug 19, 2019Updated 6 years ago
- A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL☆48Jun 7, 2026Updated last week
- Vert.x React Demo☆14May 18, 2014Updated 12 years ago
- Set of extensions for kafka connect hdfs☆11May 12, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Open-source event streaming platform built on S3. Kafka-compatible APIs, built-in SQL engine, schema registry — one Rust binary replace…☆65May 21, 2026Updated 3 weeks ago
- [译] 面向机器学习的特征工程☆11Aug 9, 2018Updated 7 years ago
- protobuf pyspark conversion☆23Jun 7, 2023Updated 3 years ago
- Implements a gateway that speaks the SparkConnect protocol and drives a backend using Substrait (over ADBC Flight SQL).☆19Feb 10, 2025Updated last year
- bash scripts install docker☆10Jun 2, 2026Updated 2 weeks ago
- Self-hostable headless QR code generator☆21Feb 5, 2026Updated 4 months ago
- Carpooling to the database (demo)☆13Oct 29, 2018Updated 7 years ago
- Manage one or more PubSub instances using the Elixir registry☆20May 1, 2023Updated 3 years ago
- Proof-of-concept distributed key-value store implementation on top of MicroRaft, Protocol Buffers, and gRPC☆14Feb 15, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Tool for turning Elixir apps into Docker images without a pain.☆18Jan 15, 2019Updated 7 years ago
- 基于spring boot 3.x的starter组件,集成了钉钉机器人发送消息通知,支持多机器人☆12Feb 13, 2023Updated 3 years ago
- Scripts and utilities to install and manage KVM machines☆11Jan 25, 2023Updated 3 years ago
- Flink jobs collection☆17Oct 13, 2020Updated 5 years ago
- A Lasp PG adapter for the Phoenix framework pubsub☆18Apr 18, 2018Updated 8 years ago
- ☆14May 23, 2017Updated 9 years ago
- ☆12Jun 11, 2020Updated 6 years ago
- A candid data lake service for the internet computer.☆15Jun 30, 2021Updated 4 years ago
- Caddy Web UI is a user-friendly interface for managing Caddy server configurations. The application allows users to create, edit, and del…☆21Feb 12, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An excel-like spreadsheet component for SQLPage☆16Aug 4, 2025Updated 10 months ago
- Better, container friendly big-data images for Docker☆38Nov 12, 2016Updated 9 years ago
- A demo presentation for the reveal-hugo Reveal.js Hugo theme☆12Feb 26, 2020Updated 6 years ago
- Go 语言编程的一些代码示例,开箱即用。☆13Nov 5, 2023Updated 2 years ago
- Manage your dotfiles, upload and install☆15Nov 13, 2024Updated last year
- SHell TempLating☆12Apr 8, 2025Updated last year
- A reverse proxy manager written in go, to convert exposed ports into token-based auth protected ports☆20Apr 14, 2025Updated last year
- 大型电商网站session大数据分析实战项目☆18Jun 21, 2022Updated 3 years ago
- ☆10May 24, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Elm SPA for live station schedules for Helsinki region commuter trains☆26Nov 21, 2024Updated last year
- Utility project for working with Kafka Connect.☆35Jul 31, 2024Updated last year
- Hands-on workshop with Iceberg, Redpanda, Debezium and Kafka-Connect☆13Oct 9, 2024Updated last year
- Building Event Driven Application with AWS Lambda and Amazon Redshift Data API☆17Oct 27, 2020Updated 5 years ago
- An open source terminology management solution☆18Jun 11, 2023Updated 3 years ago
- 一个基于 Flink CDC 的 CDC 框架,mysql,binlog☆12Jun 1, 2026Updated 2 weeks ago
- 一个提供代码仓库之间同步用的Github Action ,比如(Github同步至Gitee)☆10Nov 15, 2024Updated last year