palantir / spark-tpcds-benchmark
Utility for benchmarking changes in Spark using TPC-DS workloads
☆16Updated 3 years ago
Alternatives and similar repositories for spark-tpcds-benchmark
Users that are interested in spark-tpcds-benchmark are comparing it to the libraries listed below
Sorting:
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 2 years ago
- ACID Data Source for Apache Spark based on Hive ACID☆97Updated 3 years ago
- Discover Flink clusters on Hadoop YARN for Prometheus☆23Updated 4 years ago
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆61Updated 5 months ago
- A tool to get better debug info on spark's memory usage☆42Updated 5 years ago
- ☆39Updated 6 years ago
- Spark Connector to read and write with Pulsar☆113Updated 6 months ago
- Thoughts on things I find interesting.☆17Updated 4 months ago
- Quark is a data virtualization engine over analytic databases.☆97Updated 7 years ago
- Mirror of Apache Tephra (Incubating)☆32Updated 2 years ago
- Java event logs collector for hadoop and frameworks☆39Updated last month
- Spark Shuffle Optimization with RDMA+AEP☆30Updated last year
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Updated 2 years ago
- Splittable Gzip codec for Hadoop☆70Updated 3 weeks ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆88Updated last year
- A re-implementation of Hadoop DistCP in Apache Spark☆47Updated last year
- Druid indexing plugin for using Spark in batch jobs☆101Updated 3 years ago
- Enabling Spark Optimization through Cross-stack Monitoring and Visualization☆47Updated 7 years ago
- Spark Structured Streaming State Tools☆34Updated 4 years ago
- 已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.☆55Updated 3 years ago
- Generic Model Serving Implementation leveraging Flink☆19Updated 6 years ago
- Plugin for Presto to allow addition of user functions easily☆118Updated 4 years ago
- A sink to save Spark Structured Streaming DataFrame into Hive table☆23Updated 7 years ago
- Demonstration of a Hive Input Format for Iceberg☆26Updated 4 years ago
- High performance HBase / Spark SQL engine☆28Updated 2 years ago
- LinkedIn's version of Apache Calcite☆22Updated 6 months ago
- Rocksdb state storage implementation for Structured Streaming.☆17Updated 4 years ago
- Sample UDF and UDAs for Impala.☆64Updated 5 years ago
- An example of building kubernetes operator (Flink) using Abstract operator's framework☆26Updated 5 years ago
- Run TPCH Benchmark on Apache Kylin☆22Updated 3 years ago