linzebing / MiniSparkLinks
Java implementation of a mini Spark-like framework named MiniSpark that can run on top of a HDFS cluster. MiniSpark supports operators including Map, FlatMap, MapPair, Reduce, ReduceByKey, Collect, Count, Parallelize, Join and Filter.
☆36Updated 7 years ago
Alternatives and similar repositories for MiniSpark
Users that are interested in MiniSpark are comparing it to the libraries listed below
Sorting:
- Dig Spark's source code.☆17Updated last year
- ☆77Updated 11 years ago
- ☆30Updated 2 years ago
- A simple calculator to demonstrate code gen technology☆27Updated 6 years ago
- Explore the project Tungsten☆1Updated 8 years ago
- KVStore is a simple Key-Value Store based on B+Tree (disk & memory) for Java☆100Updated 9 months ago
- flink 流处理源码分析☆78Updated 5 years ago
- Flink Forward Asia 2019 PPT以及视频资料☆38Updated 5 years ago
- Kudu源码分析笔记☆24Updated 5 years ago
- 500 行代码实现一个基于 LSM 的数据库☆143Updated 4 years ago
- ☆11Updated 2 years ago
- A playground for experimenting ideas that may apply to Spark SQL/Catalyst☆141Updated 7 years ago
- ☆130Updated 6 years ago
- A RPC framework leveraging Spark RPC module☆210Updated 6 years ago
- Profiling Spark Applications for Performance Comparison and Diagnosis☆17Updated 6 years ago
- Hadoop分布式文件系统hdfs代码分析☆181Updated 9 years ago
- TPC-DS Performance tests tool for Flink☆29Updated 4 years ago
- learn calcite sql parsing☆18Updated 2 years ago
- This is a library for SQL optimizing/rewriting including Materialized View rewrite☆69Updated 3 years ago
- Labs of MIT 6.830 Database Systems☆57Updated 7 years ago
- 致力于提供最具实践性的 Spark 代码开发学习指南☆11Updated 3 years ago
- Learning notes of Apache Spark source code☆76Updated 9 years ago
- 对yarn的的RM,NM模块代码进行分析☆49Updated 6 years ago
- 剥离的模块,用于查看Spark SQL生成的语法树☆92Updated 6 years ago
- Distributed KV Storage System based on Raft and RocksDB, can be use to store small files, like images.☆58Updated 5 years ago
- Shared files, presentations, and other materials☆35Updated 2 weeks ago
- Spark源码剖析☆87Updated 7 years ago
- Flink源码阅读分享,不断记录Flink源码的阅读过程☆93Updated 8 months ago
- Spark源码分析,主要包含SparkContext源码、Executor进程启动、Stage划分、Task执行和Spark2.0的新特性☆82Updated 6 years ago
- Code samples from blogs posts https://www.querifylabs.com/blog☆28Updated 3 years ago