sskaje / hive_mergeLinks
Merge Small files for Hive Table on HDFS
☆15Updated 11 years ago
Alternatives and similar repositories for hive_merge
Users that are interested in hive_merge are comparing it to the libraries listed below
Sorting:
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆152Updated 2 years ago
- A library based on delta for Spark and MLSQL☆61Updated 4 years ago
- A Spark SQL HBase connector☆29Updated 10 years ago
- spark summit 2017 SanFrancisco☆97Updated 8 years ago
- My Blog☆76Updated 7 years ago
- ☆66Updated 2 years ago
- SparkSQL自定义Hint优化器解决热点数据导致JOIN数据倾斜问题☆48Updated 6 years ago
- A Spark Reliability Testing Suite☆13Updated 8 years ago
- Apache CarbonData Learning☆53Updated 5 years ago
- This is a library for SQL optimizing/rewriting including Materialized View rewrite☆69Updated 3 years ago
- Plugin for Presto to allow addition of user functions easily☆120Updated 4 years ago
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Updated 3 months ago
- Spark Streaming,Kafka and HBase code accompanying the blog 'Offset Management For Apache Kafka With Apache Spark Streaming'☆23Updated 8 years ago
- Spark 脚手架工程,标准化 spark 开发、部署、测试流程。☆94Updated last year
- spark to yandex clickhouse connector☆69Updated 6 years ago
- fast spark local mode☆35Updated 7 years ago
- A playground for Spark jobs.☆43Updated 6 years ago
- presto hbase connector 组件基于Presto Connector接口规范实现,用来给Presto增加查询HBase的功能。相比其他开源版本的HBase Connector,我们的性能要快10到100倍以上。☆243Updated 2 years ago
- A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…☆178Updated 3 years ago
- ☆131Updated 6 years ago
- 已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.☆57Updated 3 years ago
- Spark Structured Streaming Kafka 0.8 Source Implementation☆35Updated 8 years ago
- spark-summit-north-america-2018-06, More detail please visit☆111Updated 7 years ago
- Java library to integrate Flink and Kudu☆55Updated 8 years ago
- spark实例代码☆78Updated 7 years ago
- SparkStreaming中利用MySQL保存Kafka偏移量保证0数据丢失☆45Updated 8 years ago
- Deep Dive into Apache Spark 深入研读Spark源码☆260Updated 8 years ago
- Data self exporting and monitoring platform based on Hive data warehouse. https://hc.smartloli.org☆36Updated 8 years ago
- loading hdfs data to clickhouse☆72Updated 3 years ago
- Spark源码剖析☆87Updated 7 years ago