netease-bigdata / ne-spark-coursewareLinks
NetEase Spark Courses
☆15Updated 6 years ago
Alternatives and similar repositories for ne-spark-courseware
Users that are interested in ne-spark-courseware are comparing it to the libraries listed below
Sorting:
- Apache CarbonData 源码阅读☆61Updated 5 years ago
- fast spark local mode☆35Updated 6 years ago
- A playground for experimenting ideas that may apply to Spark SQL/Catalyst☆141Updated 6 years ago
- SparkSQL自定义Hint优化器解决热点数据导致JOIN数据倾斜问题☆48Updated 6 years ago
- Aloha: a distributed task scheduling and management framework☆64Updated 2 years ago
- Apache CarbonData Learning☆53Updated 5 years ago
- TPC-DS Performance tests tool for Flink☆29Updated 4 years ago
- A RPC framework leveraging Spark RPC module☆210Updated 6 years ago
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆152Updated 2 years ago
- ☆75Updated 11 years ago
- This is a library for SQL optimizing/rewriting including Materialized View rewrite☆68Updated 2 years ago
- Explore the project Tungsten☆1Updated 8 years ago
- Demo for service oriented application hosted on Hadoop YARN cluster for HA and scheduling☆23Updated 7 years ago
- ☆39Updated 6 years ago
- ☆17Updated last year
- SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.☆133Updated 2 years ago
- Scalable NameNode RPC Proxy for HDFS Federation☆86Updated 9 years ago
- A library based on delta for Spark and MLSQL☆61Updated 4 years ago
- Alerting and monitoring tool for Apache Spark☆23Updated 3 years ago
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Updated last year
- A third party tool to simulate the calculation result of Flink's memory configuration. Valid for Flink-1.10 and Flink-1.11.☆45Updated 4 years ago
- ☆236Updated 2 years ago
- ☆33Updated 6 years ago
- Spark Structured Streaming Kafka 0.8 Source Implementation☆35Updated 8 years ago
- akkaflow是一个基于akka架构上构建的分布式高可用DAG工作流调度工具,可以把子节点分配在集群机器上并行执行,高效利用集群资源。☆107Updated 5 years ago
- Spark源码剖析☆87Updated 7 years ago
- A tool to get better debug info on spark's memory usage☆42Updated 5 years ago
- presto 源码分析☆51Updated 7 years ago
- ☆56Updated 4 years ago
- Mirror of Apache Hive☆32Updated 5 years ago