kiszk / spark-gpuLinks
Spark GPU and SIMD Support
☆61Updated 5 years ago
Alternatives and similar repositories for spark-gpu
Users that are interested in spark-gpu are comparing it to the libraries listed below
Sorting:
- Provides GPU awareness to Spark, Contact: @kmadhugit and @kiszk☆171Updated 7 years ago
- Drizzle integration with Apache Spark☆120Updated 6 years ago
- Cascading on Apache Flink®☆54Updated last year
- something to help you spark☆64Updated 6 years ago
- Joins for skewed datasets in Spark☆57Updated 8 years ago
- An efficient updatable key-value store for Apache Spark☆252Updated 8 years ago
- ☆92Updated 8 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Updated 8 years ago
- JVM integration for Weld☆16Updated 6 years ago
- Distributed DataFrame: Productivity = Power x Simplicity For Scientists & Engineers, on any Data Engine☆167Updated 4 years ago
- ☆33Updated 9 years ago
- Enabling Spark Optimization through Cross-stack Monitoring and Visualization☆47Updated 8 years ago
- Secondary sort and streaming reduce for Apache Spark☆78Updated 2 years ago
- Support Highcharts in Apache Zeppelin☆81Updated 7 years ago
- Scripts to analyze Spark's performance☆136Updated 7 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 9 years ago
- Library for organizing batch processing pipelines in Apache Spark☆42Updated 8 years ago
- Example Spark project using Parquet as a columnar store with Thrift objects.☆48Updated 11 years ago
- Scala bindings for Bokeh plotting library☆136Updated last year
- Spark MLlib code optimized to efficiently support sparse data☆51Updated 8 years ago
- GPU* or SPARK* branches are used for generating GPU code in Tungsten/concact:@kiszk, MLlib branch is used for CUDA-MLlib project/concact:…☆48Updated 8 years ago
- Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.☆24Updated 9 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 12 years ago
- functionstest☆33Updated 8 years ago
- Bucketing and partitioning system for Parquet☆30Updated 7 years ago
- Data-Driven Spark allows quick data exploration based on Apache Spark.☆28Updated 8 years ago
- A quotation-based Scala DSL for scalable data analysis.☆63Updated 3 years ago
- Simple Spark app that reads and writes Avro data☆31Updated 10 years ago
- A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support…☆109Updated 7 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆52Updated 11 years ago