Set of ETL utils for Spark
☆15May 4, 2020Updated 5 years ago
Alternatives and similar repositories for spark-etl
Users that are interested in spark-etl are comparing it to the libraries listed below
Sorting:
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆40Jun 29, 2017Updated 8 years ago
- contains non confidential smart contracts from eterland☆16Jun 8, 2023Updated 2 years ago
- Multi-stage, config driven, SQL based ETL framework using PySpark☆26Sep 16, 2019Updated 6 years ago
- Hive-JDBC-Proxy是一个高性能的HiveServer2和Spark ThriftServer的代理服务,具备负载均衡、基于规则转发Hive JDBC Client的请求给到HiveServer2和Spark ThriftServer的能力。☆33Apr 12, 2022Updated 3 years ago
- Apache Spark based ETL Engine☆71Oct 18, 2016Updated 9 years ago
- Code samples, summaries, cheatsheets and other study material for Hadoop MapReduce and Apache Spark☆10Aug 17, 2018Updated 7 years ago
- 支持分库分表jdbc的flink connector☆10Dec 31, 2021Updated 4 years ago
- ☆10Aug 13, 2021Updated 4 years ago
- Second generation of the ICGC DCC release ETL built on Spark☆10Apr 8, 2019Updated 6 years ago
- An exploration of Flink and change-data-capture via flink-cdc-connectors☆11Jul 7, 2021Updated 4 years ago
- A timer module for Redis☆11Oct 16, 2019Updated 6 years ago
- SQL AI Agent - Talk to your DB in Natural Language☆15Oct 20, 2025Updated 4 months ago
- hudi 中文文档☆37Jan 9, 2020Updated 6 years ago
- A Fully HiveServer2-like Multi-tenancy Spark Thrift Server Supporting Impersonation and Multi-SparkContext with Ranger Authorization (GO …☆10Jul 7, 2022Updated 3 years ago
- A yeoman-based template to generate a great documentation website☆11Feb 3, 2023Updated 3 years ago
- Scala HTTP/SOCKS proxy library, based on akka-streams☆10Nov 3, 2018Updated 7 years ago
- A simple golang job queue☆13Jan 19, 2023Updated 3 years ago
- Combination of Dockerized Hortonworks projects and other Hadoop ecosystem components☆10Oct 11, 2019Updated 6 years ago
- wkhtmltopdf compiled on Alpine Linux with Java baseimage☆14Jan 15, 2021Updated 5 years ago
- ☆10May 25, 2017Updated 8 years ago
- ☆15Dec 2, 2020Updated 5 years ago
- flink connector for redis☆10Apr 22, 2023Updated 2 years ago
- Exposes Redis stream through the command line☆12Jun 28, 2022Updated 3 years ago
- 观点型问题阅读理解 challenger.ai☆10Nov 14, 2018Updated 7 years ago
- A simple example usage of HBase on Trusted Analytics Platform.☆10Jul 6, 2016Updated 9 years ago
- Converts CDX and CDXML from and to CML☆12Feb 17, 2024Updated 2 years ago
- An easy-to-use, scalable spark streaming ETL tool and sdk☆13Aug 14, 2017Updated 8 years ago
- Generate auto-signed TLS certificates for your docker swarm cluster☆17Nov 10, 2015Updated 10 years ago
- Hortonworks Data Platform Data Generation Tool☆13Nov 30, 2017Updated 8 years ago
- SQL for Redis☆11Sep 16, 2022Updated 3 years ago
- 基于netty实现代理服务器☆11Nov 17, 2019Updated 6 years ago
- A server made only by composing functions☆10Jul 1, 2017Updated 8 years ago
- Event ticketing system with Next.js and Appwrite☆10Jun 22, 2023Updated 2 years ago
- Send an issue to propose a talk☆17Jul 12, 2019Updated 6 years ago
- flink sql☆11Jun 21, 2022Updated 3 years ago
- Building custom data sources for Apache Spark, in Java.☆12Oct 12, 2020Updated 5 years ago
- Fast, reliable, and scalable channels implementation based on Redis streams.☆11Jun 25, 2024Updated last year
- Zero dependency Model generator based on the method of least squares. Written in plain JS.☆11Apr 1, 2024Updated last year
- Open source task scheduler with dependency management☆15Jul 1, 2018Updated 7 years ago