☆179Sep 3, 2017Updated 8 years ago
Alternatives and similar repositories for spark-code-analysis
Users that are interested in spark-code-analysis are comparing it to the libraries listed below
Sorting:
- 挖坑与填坑☆687Aug 18, 2016Updated 9 years ago
- ☆131Jan 10, 2019Updated 7 years ago
- A RPC framework leveraging Spark RPC module☆209Mar 13, 2019Updated 7 years ago
- A playground for experimenting ideas that may apply to Spark SQL/Catalyst☆142Jul 5, 2018Updated 7 years ago
- NetEase Spark Courses☆15Sep 4, 2018Updated 7 years ago
- 酷玩 Spark: Spark 源代码解析、Spark 类库等☆3,483May 18, 2022Updated 3 years ago
- Notes talking about the design and implementation of Apache Spark☆5,364Apr 2, 2024Updated last year
- Spark源码剖析☆86Nov 23, 2017Updated 8 years ago
- calcite文档翻译☆25Feb 21, 2018Updated 8 years ago
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,530Updated this week
- flink 流处理源码分析☆80Sep 8, 2019Updated 6 years ago
- 通过实例来演示Scala中的各种特性!☆22Aug 30, 2016Updated 9 years ago
- Apache Flink 源码分析系列,基于 git tag 1.1.2☆233Feb 10, 2017Updated 9 years ago
- Stream computing platform for bigdata☆408Apr 24, 2024Updated last year
- spark ml 算法原理剖析以及具体的源码实现分析☆1,960Mar 25, 2019Updated 6 years ago
- A simple optimizing Brainfuck compiler (used as the demo for my QCon Beijing 2015 talk)☆62Sep 23, 2022Updated 3 years ago
- Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.☆262May 12, 2024Updated last year
- JVM related exercises☆11Jul 16, 2017Updated 8 years ago
- My sample code repository☆145Dec 16, 2022Updated 3 years ago
- A tool to get better debug info on spark's memory usage☆42Aug 21, 2019Updated 6 years ago
- Distributed SQL query engine for big data☆55Jun 17, 2014Updated 11 years ago
- My Blog☆76May 3, 2018Updated 7 years ago
- Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.☆1,843May 29, 2024Updated last year
- Spark2.4.0 学习笔记分享☆199Jan 18, 2019Updated 7 years ago
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆152Apr 21, 2023Updated 2 years ago
- 博客☆16Sep 17, 2025Updated 6 months ago
- Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.☆1,039Updated this week
- MySQL-based distributed lock☆16Jun 21, 2022Updated 3 years ago
- TPC-DS queries☆65Jun 17, 2015Updated 10 years ago
- ServiceFramework 示例项目☆10Apr 2, 2016Updated 9 years ago
- An ad hoc query service based on the spark sql engine.(基于spark sql引擎的即席查询服务)☆380Dec 16, 2023Updated 2 years ago
- Research on distributed system☆73Mar 19, 2021Updated 5 years ago
- Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.☆257Feb 21, 2023Updated 3 years ago
- ☆235Sep 15, 2022Updated 3 years ago
- The Internals of Apache Spark☆1,542Jul 5, 2025Updated 8 months ago
- Compass is a task diagnosis platform for bigdata☆406Nov 23, 2024Updated last year
- Spark SQL index for Parquet tables☆134May 6, 2021Updated 4 years ago
- High performance data store solution☆1,446Mar 11, 2026Updated last week
- fast spark local mode☆35Aug 20, 2018Updated 7 years ago