linzebing / MiniSparkLinks
Java implementation of a mini Spark-like framework named MiniSpark that can run on top of a HDFS cluster. MiniSpark supports operators including Map, FlatMap, MapPair, Reduce, ReduceByKey, Collect, Count, Parallelize, Join and Filter.
☆37Updated 8 years ago
Alternatives and similar repositories for MiniSpark
Users that are interested in MiniSpark are comparing it to the libraries listed below
Sorting:
- Dig Spark's source code.☆17Updated last year
- Distributed KV Storage System based on Raft and RocksDB, can be use to store small files, like images.☆58Updated 5 years ago
- 500 行代码实现一个基于 LSM 的数据库☆145Updated 4 years ago
- ☆77Updated 11 years ago
- Profiling Spark Applications for Performance Comparison and Diagnosis☆17Updated 6 years ago
- KVStore is a simple Key-Value Store based on B+Tree (disk & memory) for Java☆103Updated last month
- Hadoop分布式文件系统hdfs代码分析☆182Updated 10 years ago
- A simple calculator to demonstrate code gen technology☆26Updated 6 years ago
- ☆56Updated 5 years ago
- flink 流处理源码分析☆80Updated 6 years ago
- Flink Forward Asia 2019 PPT以及视频资料☆37Updated 5 years ago
- Labs of MIT 6.830 Database Systems☆57Updated 8 years ago
- This is archive of SparkRDMA project. The new repository with RDMA shuffle acceleration for Apache Spark is here: https://github.com/Nvid…☆252Updated 6 years ago
- ☆131Updated 6 years ago
- ☆30Updated 3 years ago
- Code samples from blogs posts https://www.querifylabs.com/blog☆28Updated 3 years ago
- ☆133Updated 4 years ago
- A playground for experimenting ideas that may apply to Spark SQL/Catalyst☆141Updated 7 years ago
- Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange☆128Updated 9 months ago
- presto 源码分析☆51Updated 7 years ago
- A simple optimizing Brainfuck compiler (used as the demo for my QCon Beijing 2015 talk)☆62Updated 3 years ago
- ☆64Updated 6 years ago
- ☆11Updated 2 years ago
- 1st AliCloud Database Performance Competition in 2018 - Java rank No.1 source code 阿里云2018年第一届PolarDB数据库性能大赛Java排名第一源码☆201Updated 6 years ago
- Apache Flink 源码分析系列,基于 git tag 1.1.2☆232Updated 8 years ago
- A light weight distributed key-value database system with table concept.☆84Updated 4 years ago
- 一个模仿Kafka的简单消息中间件☆15Updated 3 years ago
- Alluxio源码分析、学习☆14Updated 8 years ago
- Trisk on Flink☆16Updated 3 years ago
- C-Store : A column-oriented DBMS prototype (frozen)☆48Updated 5 years ago