palantir / spark-tpcds-benchmarkLinks
Utility for benchmarking changes in Spark using TPC-DS workloads
☆16Updated 4 years ago
Alternatives and similar repositories for spark-tpcds-benchmark
Users that are interested in spark-tpcds-benchmark are comparing it to the libraries listed below
Sorting:
- Discover Flink clusters on Hadoop YARN for Prometheus☆23Updated 4 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 2 years ago
- Framework for running macro benchmarks in a clustered environment☆25Updated 2 years ago
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Updated 2 years ago
- High performance HBase / Spark SQL engine☆28Updated 2 years ago
- Custom Service for deploying Apache Alluxio on a running HDP 2.3 / IOP 4.1 Ambari Managed Cluster☆13Updated 8 years ago
- Rocksdb state storage implementation for Structured Streaming.☆17Updated 4 years ago
- NameNodeAnalytics is a self-help utility for scouting and maintaining the namespace of an HDFS instance.☆116Updated 10 months ago
- StreamLine - Streaming Analytics☆164Updated last year
- 已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.☆55Updated 3 years ago
- Hadoop utility to compact small files☆18Updated last year
- Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange☆127Updated 6 months ago
- Flink performance tests☆28Updated 5 years ago
- Quark is a data virtualization engine over analytic databases.☆98Updated 7 years ago
- Spark SQL index for Parquet tables☆134Updated 4 years ago
- Spark Structured Streaming State Tools☆34Updated 4 years ago
- An example of building kubernetes operator (Flink) using Abstract operator's framework☆26Updated 5 years ago
- Thoughts on things I find interesting.☆17Updated 6 months ago
- Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside in HBase region servers☆69Updated 2 years ago
- All the things about TPC-DS in Apache Spark☆106Updated 2 years ago
- ACID Data Source for Apache Spark based on Hive ACID☆97Updated 3 years ago
- ☆39Updated 6 years ago
- Stratosphere is now Apache Flink.☆197Updated last year
- A re-implementation of Hadoop DistCP in Apache Spark☆47Updated last year
- Based off the design of SparkOnHBase. This Repo will support Spark, Spark Streaming, and Spark SQL integration with Kudu.☆50Updated 9 years ago
- A tool to get better debug info on spark's memory usage☆42Updated 5 years ago
- Demo quering counts of a event stream with Apache Flink☆23Updated 6 years ago
- Enabling Spark Optimization through Cross-stack Monitoring and Visualization☆47Updated 7 years ago
- An alternative to the "hive standalone" jar for connecting Java applications to Apache Hive via JDBC☆40Updated 8 months ago
- Java event logs collector for hadoop and frameworks☆40Updated 3 months ago