kayousterhout/trace-analysis

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kayousterhout/trace-analysis)

kayousterhout / trace-analysis

Scripts to analyze Spark's performance

☆136

Alternatives and similar repositories for trace-analysis

Users that are interested in trace-analysis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hammerlab / grafana-spark-dashboards
View on GitHub
Scripts for generating Grafana dashboards for monitoring Spark jobs
☆239Mar 26, 2015Updated 11 years ago
CODAIT / spark-bench
View on GitHub
Benchmark Suite for Apache Spark
☆242Apr 12, 2023Updated 3 years ago
baiwei0427 / PIAS
View on GitHub
Information-Agnostic Flow Scheduling for Commodity Data Centers
☆16Jul 20, 2016Updated 10 years ago
coflow / varys
View on GitHub
Varys: Efficient Clairvoyant Coflow Scheduler
☆36Aug 6, 2015Updated 10 years ago
databricks / spark-perf
View on GitHub
Performance tests for Apache Spark
☆392Jul 9, 2018Updated 8 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
amplab / keystone
View on GitHub
Simplifying robust end-to-end machine learning on Apache Spark.
☆473Apr 18, 2017Updated 9 years ago
databricks / simr
View on GitHub
Spark In MapReduce (SIMR) - launching Spark applications on existing Hadoop MapReduce infrastructure
☆44Mar 9, 2022Updated 4 years ago
eastcirclek / terasort
View on GitHub
TeraSort for Spark and Flink which uses a range partitioner based on sampling
☆22Feb 5, 2016Updated 10 years ago
mhausenblas / elsa
View on GitHub
Elastic Sentiment Analysis (using Apache Mesos, Marathon and Apache Spark)
☆35Mar 16, 2015Updated 11 years ago
brightcove-archive / ooyala_spark-jobserver
View on GitHub
REST job server for Spark. Note that this is *not* the mainline open source version. For that, go to https://github.com/spark-jobserver…
☆345May 19, 2017Updated 9 years ago
tresata / spark-kafka
View on GitHub
Low level integration of Spark and Kafka
☆129Mar 15, 2018Updated 8 years ago
coflow / aalo
View on GitHub
Aalo: Efficient Non-Clairvoyant Coflow Scheduler
☆13Nov 22, 2015Updated 10 years ago
broxtronix / spark-gce
View on GitHub
A tool for running Spark on Google Compute Engine
☆16Jan 20, 2017Updated 9 years ago
darkjh / scalaflow
View on GitHub
Fluent Scala DSL for Google's Cloud Dataflow SDK
☆56Aug 2, 2015Updated 10 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
stealthly / punxsutawney
View on GitHub
An Apache Mesos Framework that allows for replaying load over and over and over (and over) again
☆10Aug 10, 2015Updated 10 years ago
amplab / spark-indexedrdd
View on GitHub
An efficient updatable key-value store for Apache Spark
☆255Mar 11, 2017Updated 9 years ago
fawind / picasso
View on GitHub
Artsy New Tab pages for the masses 🎨
☆11May 25, 2022Updated 4 years ago
radlab / sparrow
View on GitHub
Sparrow scheduling platform (U.C. Berkeley).
☆328Jul 25, 2020Updated 6 years ago
adobe-research / spark-gpu
View on GitHub
GPU Acceleration for Apache Spark
☆34Aug 24, 2015Updated 10 years ago
bythebay / pipeline
View on GitHub
Complete Pipeline Training at Big Data Scala By the Bay
☆71Oct 27, 2015Updated 10 years ago
lightbend / mesos-spark-integration-tests
View on GitHub
Mesos Integration Tests on Docker/Ec2
☆15May 25, 2023Updated 3 years ago
uwsampa / grappa
View on GitHub
Grappa: scaling irregular applications on commodity clusters
☆159May 4, 2017Updated 9 years ago
tresata / spark-columnar
View on GitHub
☆15Mar 4, 2015Updated 11 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
amplab / benchmark
View on GitHub
Large scale query engine benchmark
☆99Apr 5, 2016Updated 10 years ago
Intel-bigdata / HiBench
View on GitHub
HiBench is a big data benchmark suite.
☆1,484Dec 15, 2025Updated 7 months ago
TrueCar / mleap
View on GitHub
MLeap allows for easily putting Spark ML pipelines into production
☆78Oct 27, 2016Updated 9 years ago
databricks / spark-csv
View on GitHub
CSV Data Source for Apache Spark 1.x
☆1,057Dec 13, 2018Updated 7 years ago
ehiggs / spark-terasort
View on GitHub
Spark Terasort
☆121Apr 21, 2023Updated 3 years ago
abduld / WebGPU
View on GitHub
☆21Dec 15, 2023Updated 2 years ago
IBMStreams / benchmarks
View on GitHub
Contains performance benchmark applications for IBM Streams
☆13Jul 6, 2025Updated last year
databricks / spark-sql-perf
View on GitHub
☆623Feb 26, 2022Updated 4 years ago
google / cluster-scheduler-simulator
View on GitHub
Automatically exported from code.google.com/p/cluster-scheduler-simulator
☆173Jun 3, 2022Updated 4 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
syoummer / SpatialSpark
View on GitHub
Big Spatial Data Processing using Spark
☆146Mar 7, 2017Updated 9 years ago
lihaoyi / scala-bench
View on GitHub
Some benchmarks of memory and runtime performance of Scala's collections
☆44May 19, 2024Updated 2 years ago
PathDump / PathDump
View on GitHub
Implementation based on OSDI paper
☆20Feb 11, 2018Updated 8 years ago
ankurdave / kmeans-spark
View on GitHub
A simple implementation of k-means clustering on the Spark cluster computing framework. See http://cs.berkeley.edu/~matei/spark.
☆26Apr 9, 2011Updated 15 years ago
spark-notebook / spark-notebook
View on GitHub
Interactive and Reactive Data Science using Scala and Spark.
☆3,142May 16, 2023Updated 3 years ago
stephenneuendorffer / vyasa
View on GitHub
Xilinx Modifications to Halide
☆13May 3, 2021Updated 5 years ago
ZEPL / z-manager
View on GitHub
Simplify getting Zeppelin up and running
☆56Jul 20, 2016Updated 10 years ago