metatron-app / discovery-spark-engine
A REST server for spark jobs from metatron discovery data preparation.
☆11Updated 2 years ago
Alternatives and similar repositories for discovery-spark-engine:
Users that are interested in discovery-spark-engine are comparing it to the libraries listed below
- Apache Phoenix Connectors☆53Updated last week
- A library based on delta for Spark and MLSQL☆61Updated 4 years ago
- Java library to integrate Flink and Kudu☆54Updated 7 years ago
- A sink to save Spark Structured Streaming DataFrame into Hive table☆31Updated 6 years ago
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆153Updated last year
- Presto connector for Apache Kudu☆48Updated 5 years ago
- Utilities for processing Flink checkpoints/savepoints☆74Updated 5 years ago
- DataFibers Data Service☆31Updated 3 years ago
- Fluent client for interacting with Spark Standalone Mode's Rest API for submitting, killing and monitoring the state of jobs.☆109Updated 6 years ago
- Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside in HBase region servers☆69Updated 2 years ago
- Schema Registry integration for Apache Spark☆40Updated 2 years ago
- A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…☆173Updated 2 years ago
- Spark 脚手架工程,标准化 spark 开发、部署、测试流程。☆93Updated 5 months ago
- spark将hdfs数据高性能灌入kafka,然后spark streaming/structured streaming高速消费,关注性能,欢迎提供性能/代码优化建议☆33Updated 5 years ago
- Project to create configurable ETL via lightbend configuration using Spark Structured Streaming☆8Updated 6 years ago
- Code for processing AVRO data in Spark Streaming + Kafka (DirectKafka approach with custom offset management in zookeeper)☆29Updated 8 years ago
- Custom state store providers for Apache Spark☆92Updated 3 weeks ago
- A modern real-time streaming application serving as a reference framework for developing a big data pipeline, complete with a broad range…☆41Updated 5 years ago
- Kafka Connect to Hbase☆43Updated 4 years ago
- A library for querying Druid data sources with Apache Spark☆23Updated 4 years ago
- Capture changes of HBase to Kafka☆30Updated 8 years ago
- ☆174Updated last year
- Framework for Apache Flink unit tests☆207Updated 5 years ago
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy☆62Updated last year
- facebook presto connectors☆49Updated 3 years ago
- 优化flink的多流操作(例如join),优化点不限于数据丢失问题,以及性能问题☆11Updated 5 years ago
- SparkStreaming中利用MySQL保存Kafka偏移量保证0数据丢失☆45Updated 7 years ago
- Encapsulated spark 与其他组件的结合api,方便使用,例如 es,hbase,kudu,kafka,mq等☆35Updated 5 years ago
- Build configuration-driven ETL pipelines on Apache Spark☆159Updated 2 years ago
- Alerting and monitoring tool for Apache Spark☆23Updated 2 years ago