☆179Sep 3, 2017Updated 8 years ago
Alternatives and similar repositories for spark-code-analysis
Users that are interested in spark-code-analysis are comparing it to the libraries listed below
Sorting:
- 挖坑与填坑☆687Aug 18, 2016Updated 9 years ago
- ☆131Jan 10, 2019Updated 7 years ago
- A RPC framework leveraging Spark RPC module☆209Mar 13, 2019Updated 6 years ago
- A playground for experimenting ideas that may apply to Spark SQL/Catalyst☆142Jul 5, 2018Updated 7 years ago
- NetEase Spark Courses☆15Sep 4, 2018Updated 7 years ago
- 酷玩 Spark: Spark 源代码解析、Spark 类库等☆3,482May 18, 2022Updated 3 years ago
- Spark源码剖析☆86Nov 23, 2017Updated 8 years ago
- Notes talking about the design and implementation of Apache Spark☆5,360Apr 2, 2024Updated last year
- A simple optimizing Brainfuck compiler (used as the demo for my QCon Beijing 2015 talk)☆62Sep 23, 2022Updated 3 years ago
- Apache Flink 源码分析系列,基于 git tag 1.1.2☆233Feb 10, 2017Updated 9 years ago
- Stream computing platform for bigdata☆408Apr 24, 2024Updated last year
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,516Updated this week
- Spark2.4.0 学习笔记分享☆199Jan 18, 2019Updated 7 years ago
- Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.☆1,845May 29, 2024Updated last year
- Distributed SQL query engine for big data☆55Jun 17, 2014Updated 11 years ago
- 通过实例来演示Scala中的各种特性!☆22Aug 30, 2016Updated 9 years ago
- My sample code repository☆145Dec 16, 2022Updated 3 years ago
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆152Apr 21, 2023Updated 2 years ago
- JVM related exercises☆11Jul 16, 2017Updated 8 years ago
- ServiceFramework 示例项目☆10Apr 2, 2016Updated 9 years ago
- Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.☆262May 12, 2024Updated last year
- TPC-DS queries☆65Jun 17, 2015Updated 10 years ago
- A tool to get better debug info on spark's memory usage☆42Aug 21, 2019Updated 6 years ago
- ☆10Aug 28, 2014Updated 11 years ago
- FCTT代码仓库☆10May 22, 2018Updated 7 years ago
- calcite文档翻译☆25Feb 21, 2018Updated 8 years ago
- ☆235Sep 15, 2022Updated 3 years ago
- My Blog☆76May 3, 2018Updated 7 years ago
- The ISC Anomaly Detection and Classification Framework implemented for Apache Flink.☆13Dec 14, 2016Updated 9 years ago
- flink 流处理源码分析☆80Sep 8, 2019Updated 6 years ago
- Compass is a task diagnosis platform for bigdata☆405Nov 23, 2024Updated last year
- Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.☆257Feb 21, 2023Updated 3 years ago
- High performance data store solution☆1,446Feb 21, 2026Updated last week
- Remote shuffle service for Apache Spark to store shuffle data on remote servers.☆334Sep 29, 2023Updated 2 years ago
- This is archive of SparkRDMA project. The new repository with RDMA shuffle acceleration for Apache Spark is here: https://github.com/Nvid…☆257May 13, 2019Updated 6 years ago
- Learning notes of Apache Spark source code☆75Nov 19, 2015Updated 10 years ago
- An ad hoc query service based on the spark sql engine.(基于spark sql引擎的即席查询服务)☆381Dec 16, 2023Updated 2 years ago
- 博客☆16Sep 17, 2025Updated 5 months ago
- 微博数据分析服务框架。☆12Nov 10, 2015Updated 10 years ago