yamrcraft/etl-light

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yamrcraft/etl-light)

yamrcraft / etl-light

A light Kafka to HDFS/S3 ETL library based on Apache Spark

☆40

Alternatives and similar repositories for etl-light

Users that are interested in etl-light are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

konrads / spark-etl
View on GitHub
Set of ETL utils for Spark
☆15May 4, 2020Updated 6 years ago
vngrs / spark-etl
View on GitHub
Apache Spark based ETL Engine
☆71Oct 18, 2016Updated 9 years ago
ycloudnet / ya100
View on GitHub
一个比Spark-Parquet还快5~100倍的存储格式
☆12Feb 22, 2016Updated 10 years ago
xavient / Data-Ingestion-Platform
View on GitHub
☆51Updated this week
dk-stationery / stationery-ink
View on GitHub
Distributed SQL base Realtime Streaming Computation Framework On Apache Storm, Spark
☆12Mar 14, 2016Updated 10 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
shengjk / flinksql-platform
View on GitHub
flinksql-platform
☆19Mar 22, 2021Updated 5 years ago
wuworker / netty-proxy
View on GitHub
基于netty实现代理服务器
☆12Jul 4, 2026Updated 3 weeks ago
wykingfly / ElasticSearch-SQL
View on GitHub
该项目主要是为了熟悉sql的人员能够很方便的进行elasticsearch数据的查询，降低学习成本。
☆47Jan 13, 2015Updated 11 years ago
icgc-dcc / dcc-release
View on GitHub
Second generation of the ICGC DCC release ETL built on Spark
☆10Apr 8, 2019Updated 7 years ago
thejunglejane / pynigma
View on GitHub
A Python client for the Enigma API.
☆14Dec 7, 2022Updated 3 years ago
bluebreezecf / SparkJobServerClient
View on GitHub
Java Client of the Spark Job Server implementing the arranged Rest APIs
☆52Jun 4, 2021Updated 5 years ago
johanprinsloo / akka-micro-dag
View on GitHub
Quick Akka Micro Dag Prototype
☆13Apr 8, 2016Updated 10 years ago
RedisLabs / ReSearch
View on GitHub
Redis search and indexing in Java
☆16Sep 26, 2016Updated 9 years ago
cpbaranwal / Avro-SparkStreaming-Kafka
View on GitHub
Code for processing AVRO data in Spark Streaming + Kafka (DirectKafka approach with custom offset management in zookeeper)
☆29Sep 9, 2016Updated 9 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
CodeRayZhang / Spark-Example
View on GitHub
Spark1.6和spark2.2的示例，包含kafka,flume,structuredstreaming,jedis,elasticsearch,mysql,dataframe
☆15Jan 28, 2018Updated 8 years ago
dianping / storm-util
View on GitHub
☆18Apr 23, 2015Updated 11 years ago
hammerlab / spark-tests
View on GitHub
Utilities for writing tests that use Apache Spark.
☆24Dec 29, 2018Updated 7 years ago
bylee5 / calcite-elasticsearch
View on GitHub
☆14Oct 5, 2022Updated 3 years ago
thunderain-project / thunderain
View on GitHub
A Real-Time Analytical Processing (RTAP) example using Spark/Shark
☆51Feb 21, 2014Updated 12 years ago
wypb / Spark-Flink-Meetup-6-Hangzhou
View on GitHub
杭州第六次 Spark & Flink Meetup
☆30May 14, 2018Updated 8 years ago
vspiewak / twitter-sentiment-analysis
View on GitHub
Streaming tweets with spark, language detection & sentiment analysis, dashboard with Kibana
☆105Dec 21, 2015Updated 10 years ago
BenFradet / spark-kafka-writer
View on GitHub
Write your Spark data to Kafka seamlessly
☆172Jul 10, 2024Updated 2 years ago
cpbaranwal / Spark-Streaming-DirectKafka-Examples
View on GitHub
DirectKafka examples for Spark Streaming : 1. with checkpointing 2. Custom offset management
☆59Sep 9, 2016Updated 9 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
xiam / resp
View on GitHub
RESP (REdis Serialization Protocol) encoder and decoder.
☆19Dec 6, 2015Updated 10 years ago
anjuke / romar
View on GitHub
Recommendation Web Service
☆17Apr 17, 2013Updated 13 years ago
pdaodao / fiflow
View on GitHub
flink-sql 在 flink 上运行 sql 和构建数据流的平台基于 apache flink 1.10.0
☆113Jun 21, 2022Updated 4 years ago
Stratio / sparta
View on GitHub
Real Time Analytics and Data Pipelines based on Spark Streaming
☆530Oct 24, 2019Updated 6 years ago
realxujiang / storm-kafka-examples
View on GitHub
storm kafka hdfs examples
☆21Nov 28, 2016Updated 9 years ago
markgrover / spark-secure-kafka-app
View on GitHub
Sample Spark Streaming application for secure consumption from Kafka
☆33Jun 19, 2017Updated 9 years ago
jkorab / ameliant-tools
View on GitHub
A set of tools to ease working with Zookeeper and Kafka.
☆23Jan 22, 2016Updated 10 years ago
bernhard-42 / Spark-ETL-Atlas
View on GitHub
A small project to show how to add lineage to Atlas when using Spark as ETL tool
☆12Nov 29, 2016Updated 9 years ago
ml-hongkong / chatbot
View on GitHub
Seq2Seq Chatbot with attention mechanism
☆18Apr 27, 2017Updated 9 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
JasonWiki / dw_etl
View on GitHub
dw etl 工具 mysql 增量、全量抽取 to hive. 合并 hive 数据表, 等数据平台清洗工具
☆10Dec 21, 2016Updated 9 years ago
sujee81 / SparkApps
View on GitHub
Apache Spark applications
☆70Dec 17, 2017Updated 8 years ago
mispecto / realtime-dashboard-example
View on GitHub
This is a real-time dashboard example using Spark Streaming and Node.js
☆25Dec 17, 2025Updated 7 months ago
fbascheper / kafka-connect-telegram
View on GitHub
Kafka-connect telegram connector
☆16Nov 21, 2025Updated 8 months ago
OopsOutOfMemory / spark-sql-hbase
View on GitHub
A Spark SQL HBase connector
☆29May 4, 2015Updated 11 years ago
jeoffreylim / maelstrom
View on GitHub
Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream …
☆21Feb 6, 2017Updated 9 years ago
JerryLead / SparkProfiler
View on GitHub
Profiling Spark Applications for Performance Comparison and Diagnosis
☆16Nov 11, 2018Updated 7 years ago