permanentstar / spark-sql-dsv2-extensionLinks
A sql extension build on spark3 datasource v2 api, ex: hive v2 catalog support amoung multi clusters
☆12Updated 3 years ago
Alternatives and similar repositories for spark-sql-dsv2-extension
Users that are interested in spark-sql-dsv2-extension are comparing it to the libraries listed below
Sorting:
- A Spark Atlas connector to track data lineage in Apache Atlas☆265Updated 3 years ago
- Apache HBase Connectors☆248Updated 3 weeks ago
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆153Updated 2 years ago
- ☆236Updated 3 years ago
- The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.☆550Updated 4 years ago
- SparkSQL自定义Hint优化器解决热点数据导致JOIN数据倾斜问题☆48Updated 7 years ago
- 剥离的模块,用于查看Spark SQL生成的语法树☆91Updated 6 years ago
- A RPC framework leveraging Spark RPC module☆208Updated 6 years ago
- ☆550Updated 4 years ago
- A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…☆182Updated 3 years ago
- SparkOnHBase☆278Updated 4 years ago
- NameNodeAnalytics is a self-help utility for scouting and maintaining the namespace of an HDFS instance.☆120Updated 2 months ago
- Apache HBase Operator Tools☆183Updated 2 months ago
- Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces☆315Updated 3 years ago
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.☆285Updated 2 months ago
- Spark Structured Streaming Kafka 0.8 Source Implementation☆35Updated 8 years ago
- A third party tool to simulate the calculation result of Flink's memory configuration. Valid for Flink-1.10 and Flink-1.11.☆45Updated 5 years ago
- ☆17Updated last year
- Apache Flink Website☆156Updated this week
- ☆106Updated 3 weeks ago
- Mirror of Apache Hive☆32Updated 5 years ago
- The Internals of Spark Structured Streaming☆422Updated 2 weeks ago
- Java library to integrate Flink and Kudu☆55Updated 8 years ago
- Spark RDD to read, write and delete from HBase☆274Updated 5 years ago
- ☆179Updated 8 years ago
- This is a library for SQL optimizing/rewriting including Materialized View rewrite☆69Updated 3 years ago
- SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.☆136Updated 2 years ago
- Cloudera Manager Extensibility Tools and Documentation.☆193Updated 2 years ago
- My Blog☆76Updated 7 years ago
- DirectKafka examples for Spark Streaming : 1. with checkpointing 2. Custom offset management☆60Updated 9 years ago