linzebing / MiniSparkLinks
Java implementation of a mini Spark-like framework named MiniSpark that can run on top of a HDFS cluster. MiniSpark supports operators including Map, FlatMap, MapPair, Reduce, ReduceByKey, Collect, Count, Parallelize, Join and Filter.
☆37Updated 8 years ago
Alternatives and similar repositories for MiniSpark
Users that are interested in MiniSpark are comparing it to the libraries listed below
Sorting:
- Profiling Spark Applications for Performance Comparison and Diagnosis☆17Updated 7 years ago
- Hadoop分布式文件系统hdfs代码分析☆182Updated 10 years ago
- Distributed KV Storage System based on Raft and RocksDB, can be use to store small files, like images.☆58Updated 5 years ago
- Dig Spark's source code.☆17Updated last year
- flink 流处理源码分析☆80Updated 6 years ago
- KVStore is a simple Key-Value Store based on B+Tree (disk & memory) for Java☆103Updated 2 months ago
- 500 行代码实现一个基于 LSM 的数据库☆145Updated 4 years ago
- presto 源码分析☆51Updated 7 years ago
- Simple Database Management System☆39Updated 9 years ago
- A simple calculator to demonstrate code gen technology☆26Updated 6 years ago
- ☆30Updated 3 years ago
- ☆77Updated 11 years ago
- Flink Forward Asia 2019 PPT以及视频资料☆37Updated 5 years ago
- Labs of MIT 6.830 Database Systems☆57Updated 8 years ago
- Kudu源码分析笔记☆25Updated 6 years ago
- This is archive of SparkRDMA project. The new repository with RDMA shuffle acceleration for Apache Spark is here: https://github.com/Nvid…☆253Updated 6 years ago
- 1st AliCloud Database Performance Competition in 2018 - Java rank No.1 source code 阿里云2018年第一届PolarDB数据库性能大赛Java排名第一源码☆200Updated 6 years ago
- ☆74Updated 3 years ago
- A simple optimizing Brainfuck compiler (used as the demo for my QCon Beijing 2015 talk)☆62Updated 3 years ago
- ☆56Updated 5 years ago
- ☆131Updated 6 years ago
- Code samples from blogs posts https://www.querifylabs.com/blog☆28Updated 3 years ago
- A playground for experimenting ideas that may apply to Spark SQL/Catalyst☆141Updated 7 years ago
- Apache Flink 源码分析系列,基于 git tag 1.1.2☆233Updated 8 years ago
- 对yarn的的RM,NM模块代码进行分析☆49Updated 7 years ago
- 剥离的模块,用于查看Spark SQL生成的语法树☆92Updated 6 years ago
- A RPC framework leveraging Spark RPC module☆209Updated 6 years ago
- learn calcite sql parsing☆18Updated 3 years ago
- Implementation for MIT 6.824 Distributed System☆14Updated 11 years ago
- C-Store : A column-oriented DBMS prototype (frozen)☆48Updated 5 years ago