LucaCanali / MiscellaneousLinks
Includes notes on using Apache Spark, with drill down on Spark for Physics, how to run TPCDS on PySpark, how to create histograms with Spark. Also tools for stress testing, measuring CPUs' performance, and I/O latency heat maps. Jupyter notebooks examples for using various DB systems.
☆456Updated last month
Alternatives and similar repositories for Miscellaneous
Users that are interested in Miscellaneous are comparing it to the libraries listed below
Sorting:
- Use the TPC-DS benchmark to test Spark SQL performance☆181Updated 5 years ago
- Benchmark Suite for Apache Spark☆241Updated 2 years ago
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆795Updated last week
- The Internals of Spark SQL☆477Updated last week
- Spark Terasort☆121Updated 2 years ago
- Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange