Performance Analysis Tool
☆78Nov 25, 2025Updated 3 months ago
Alternatives and similar repositories for PAT
Users that are interested in PAT are comparing it to the libraries listed below
Sorting:
- ☆11Nov 16, 2022Updated 3 years ago
- HiBench is a big data benchmark suite.☆1,489Dec 15, 2025Updated 2 months ago
- Spark* shuffle plugin for support shuffling data through a remote Hadoop-compatible file system, as opposed to vanilla Spark's local-dis…☆21Mar 15, 2024Updated last year
- Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange☆130Dec 19, 2024Updated last year
- Copy millions of objects in minutes☆12Oct 21, 2019Updated 6 years ago
- Configure an LDAPS Endpoint for Simple AD☆14Aug 29, 2017Updated 8 years ago
- python script to repair the primary range of a node in N discrete steps☆12Aug 3, 2018Updated 7 years ago
- An AWS Lambda package including two functions to dynamically maintain a security partition around a group of AWS resources which originat…☆12Nov 16, 2018Updated 7 years ago
- FOundation of stXXl and thriLL☆14Jan 24, 2024Updated 2 years ago
- Conway's Game of Life implemented in Scala.js☆10Mar 30, 2018Updated 7 years ago
- pysh-db - The Data Science Toolkit (DSK)☆13Dec 5, 2018Updated 7 years ago
- Mirror of Apache crail (Incubating)☆151Jul 3, 2022Updated 3 years ago
- The presentation at Spark Summit 2014 showing how 4Quant does production scale image processing and analysis using Spark☆16Jul 29, 2014Updated 11 years ago
- Additional useful algorithms that can be used with spark.☆24Dec 24, 2014Updated 11 years ago
- This is archive of SparkRDMA project. The new repository with RDMA shuffle acceleration for Apache Spark is here: https://github.com/Nvid…☆257May 13, 2019Updated 6 years ago
- A tool for scale and performance testing of HDFS with a specific focus on the NameNode.☆134Jan 11, 2024Updated 2 years ago
- Smart Storage Management for Big Data, a comprehensive hot/cold data optimized solution☆140Jan 3, 2023Updated 3 years ago
- Spark Shuffle Optimization with RDMA+AEP☆30May 23, 2023Updated 2 years ago
- Chatlytics is a data query and visualization platform for chat!☆13Feb 21, 2017Updated 9 years ago
- Presentation materials for the 2016 Berkeley C++ Summit☆14Oct 20, 2016Updated 9 years ago
- A low-overhead sampling profiler for PySpark, that outputs Flame Graphs☆16Dec 17, 2020Updated 5 years ago
- HDF5 Cache VOL connector for caching data on fast storage layers and moving data asynchronously to the parallel file system to hide I/O o…☆21Feb 10, 2026Updated 3 weeks ago
- Benchmark Suite for Apache Spark☆240Apr 12, 2023Updated 2 years ago
- Java event logs collector for hadoop and frameworks☆41Mar 25, 2025Updated 11 months ago
- Apache Zeppelin Service for Apache Ambari Service. Installation and management of Zeppelin via Ambari.☆14Jan 23, 2016Updated 10 years ago
- ODPi specifications, developed by ODPi Runtime and ODPi Operations projects. Currently in Emeritus status☆35Feb 12, 2019Updated 7 years ago
- TPC-CLANG compiler that compiles a TPC C programming language which is used in HabanaLabs Deep-Learning Accelerators☆26Nov 11, 2024Updated last year
- The released version of Astro(Spark SQL on HBase) has been moved to:☆16Jul 23, 2015Updated 10 years ago
- Hadoop Data Pipeline using Falcon☆15May 3, 2016Updated 9 years ago
- Caffe deep learning framework - optimized for Xeon Phi☆14May 12, 2015Updated 10 years ago
- Litesimd is a no overhead, header only, C++ library for SIMD processing, specialized on SIMD comparison and data shuffle.☆16May 23, 2019Updated 6 years ago
- spark structured streaming via HTTP communication☆18Jul 7, 2022Updated 3 years ago
- Paper: A Zero-rename committer for object stores☆20Nov 7, 2025Updated 3 months ago
- Visualize your HDFS cluster usage☆228Oct 13, 2020Updated 5 years ago
- Profiler for large-scale distributed java applications (Spark, Scalding, MapReduce, Hive,...) on YARN.☆128Sep 7, 2018Updated 7 years ago
- OpenMP vs Offload☆23Jun 2, 2023Updated 2 years ago
- A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…☆183Apr 6, 2022Updated 3 years ago
- Remote shuffle service for Apache Spark to store shuffle data on remote servers.☆335Sep 29, 2023Updated 2 years ago
- Open source framework for predictive modeling on Apache Hadoop☆34Aug 23, 2014Updated 11 years ago