linzebing / MiniSparkLinks
Java implementation of a mini Spark-like framework named MiniSpark that can run on top of a HDFS cluster. MiniSpark supports operators including Map, FlatMap, MapPair, Reduce, ReduceByKey, Collect, Count, Parallelize, Join and Filter.
☆37Updated 8 years ago
Alternatives and similar repositories for MiniSpark
Users that are interested in MiniSpark are comparing it to the libraries listed below
Sorting:
- KVStore is a simple Key-Value Store based on B+Tree (disk & memory) for Java☆103Updated last month
- 500 行代码实现一个基于 LSM 的数据库☆145Updated 4 years ago
- Dig Spark's source code.☆17Updated last year
- Profiling Spark Applications for Performance Comparison and Diagnosis☆17Updated 6 years ago
- flink 流处理源码分析☆80Updated 6 years ago
- A simple calculator to demonstrate code gen technology☆26Updated 6 years ago
- Hadoop分布式文件系统hdfs代码分析☆182Updated 10 years ago
- ☆77Updated 11 years ago
- Distributed KV Storage System based on Raft and RocksDB, can be use to store small files, like images.☆58Updated 5 years ago
- Flink Forward Asia 2019 PPT以及视频资料☆37Updated 5 years ago
- ☆30Updated 3 years ago
- ☆56Updated 5 years ago
- presto 源码分析☆51Updated 7 years ago
- ☆131Updated 6 years ago
- Shared files, presentations, and other materials☆35Updated 3 weeks ago
- 对yarn的的RM,NM模块代码进行分析☆49Updated 7 years ago
- C-Store : A column-oriented DBMS prototype (frozen)☆48Updated 5 years ago
- This is archive of SparkRDMA project. The new repository with RDMA shuffle acceleration for Apache Spark is here: https://github.com/Nvid…☆252Updated 6 years ago
- Apache Flink 源码分析系列,基于 git tag 1.1.2☆232Updated 8 years ago
- A playground for experimenting ideas that may apply to Spark SQL/Catalyst☆141Updated 7 years ago
- learn calcite sql parsing☆18Updated 3 years ago
- A simple optimizing Brainfuck compiler (used as the demo for my QCon Beijing 2015 talk)☆62Updated 3 years ago
- 面向单机与分布式 OLTP/OLAP 场景的可暂停的渐进式 SQL 引擎 (只用于研究)☆12Updated 2 years ago
- A light weight distributed key-value database system with table concept.☆84Updated 4 years ago
- An efficient, conscise, and simple implementation of a purely on-disk B+ Tree data structure☆175Updated last year
- 剥离的模块,用于查看Spark SQL生成的语法树☆92Updated 6 years ago
- 翻译Calcite文档,非官方☆15Updated 6 years ago
- Labs of MIT 6.830 Database Systems☆57Updated 8 years ago
- 经典论文阅读笔记,文章同步发布在知乎和博客上。欢迎提 PR☆229Updated 4 years ago
- Code samples from blogs posts https://www.querifylabs.com/blog☆28Updated 3 years ago