linzebing / MiniSpark
Java implementation of a mini Spark-like framework named MiniSpark that can run on top of a HDFS cluster. MiniSpark supports operators including Map, FlatMap, MapPair, Reduce, ReduceByKey, Collect, Count, Parallelize, Join and Filter.
☆35Updated 7 years ago
Alternatives and similar repositories for MiniSpark:
Users that are interested in MiniSpark are comparing it to the libraries listed below
- Distributed KV Storage System based on Raft and RocksDB, can be use to store small files, like images.☆59Updated 5 years ago
- Dig Spark's source code.☆17Updated last year
- ☆77Updated 10 years ago
- Profiling Spark Applications for Performance Comparison and Diagnosis☆16Updated 6 years ago
- ☆71Updated 2 years ago
- learn calcite sql parsing☆18Updated 2 years ago
- Alluxio源码分析、学习☆13Updated 8 years ago
- A simple calculator to demonstrate code gen technology☆27Updated 5 years ago
- ☆30Updated 2 years ago
- KVStore is a simple Key-Value Store based on B+Tree (disk & memory) for Java☆99Updated 6 months ago
- presto 源码分析☆51Updated 7 years ago
- 500 行代码实现一个基于 LSM 的数据库☆143Updated 3 years ago
- ☆64Updated 5 years ago
- Explore the project Tungsten☆1Updated 8 years ago
- a tiny database with ARIES recovery algorithm (WAL and Fuzzy Checkpoint) to achieve ACID☆37Updated 4 years ago
- flink 流处理源码分析☆75Updated 5 years ago
- A New Way of Data Lake☆48Updated 3 years ago
- 一个模仿Kafka的简单消息中间件☆15Updated 2 years ago
- TPC-DS Performance tests tool for Flink☆29Updated 3 years ago
- Shared files, presentations, and other materials☆34Updated 3 weeks ago
- 对yarn的的RM,NM模块代码进行分析☆49Updated 6 years ago
- A ToyDB (for beginner) based on MIT 6.830 and CMU 15445☆30Updated 3 years ago
- 第三届阿里中间件性能挑战赛, 赛后整理☆17Updated 6 years ago
- A toy SQL engine built on top of LSM(LevelDB)☆22Updated 5 years ago
- An efficient database query optimizer for large complex join queries☆129Updated last year
- ☆56Updated 4 years ago
- ☆131Updated 6 years ago
- Labs of MIT 6.830 Database Systems☆57Updated 7 years ago
- Hadoop分布式文件系统hdfs代码分析☆182Updated 9 years ago
- 剥离的模块,用于查看Spark SQL生成的语法树☆92Updated 5 years ago