kiszk / spark-gpuLinks
Spark GPU and SIMD Support
☆61Updated 4 years ago
Alternatives and similar repositories for spark-gpu
Users that are interested in spark-gpu are comparing it to the libraries listed below
Sorting:
- Provides GPU awareness to Spark, Contact: @kmadhugit and @kiszk☆171Updated 6 years ago
- Drizzle integration with Apache Spark☆120Updated 6 years ago
- Enabling Spark Optimization through Cross-stack Monitoring and Visualization☆47Updated 7 years ago
- JVM integration for Weld☆16Updated 6 years ago
- Fast I/O plugins for Spark☆41Updated 4 years ago
- GPU* or SPARK* branches are used for generating GPU code in Tungsten/concact:@kiszk, MLlib branch is used for CUDA-MLlib project/concact:…☆48Updated 8 years ago
- Joins for skewed datasets in Spark☆57Updated 7 years ago
- Spark Terasort☆121Updated 2 years ago
- Cascading on Apache Flink®☆54Updated last year
- Scripts to analyze Spark's performance☆136Updated 7 years ago
- Automatic offload of user-written Spark kernels to accelerators☆18Updated 8 years ago
- Secondary sort and streaming reduce for Apache Spark☆78Updated last year
- An efficient updatable key-value store for Apache Spark☆251Updated 8 years ago
- Example Spark project using Parquet as a columnar store with Thrift objects.☆48Updated 10 years ago
- An extension of Yahoo's Benchmarks☆107Updated last year
- something to help you spark☆65Updated 6 years ago
- Fast, memory-efficient, minimal-serialization, binary data vectors for Scala and other languages☆67Updated 7 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Updated 8 years ago
- Support for operating on images via Apache Spark☆26Updated 2 years ago
- An experimental Graph Streaming API for Apache Flink☆142Updated 4 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- A tool to get better debug info on spark's memory usage☆42Updated 5 years ago
- Scala bindings for Bokeh plotting library☆136Updated last year
- Spark SQL index for Parquet tables☆134Updated 4 years ago
- Peel is a framework that helps you to define, execute, analyze, and share experiments for distributed systems and algorithms.☆27Updated 2 years ago
- SQL parser written using Scala's parser combinator library☆103Updated 9 years ago
- A quotation-based Scala DSL for scalable data analysis.☆63Updated 2 years ago
- Mirror of Apache Spark☆56Updated 9 years ago
- High performance HBase / Spark SQL engine☆28Updated 2 years ago
- Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.☆24Updated 8 years ago