shivaram / spark-ec2Links
Scripts used to setup a Spark cluster on EC2
☆21Updated 9 years ago
Alternatives and similar repositories for spark-ec2
Users that are interested in spark-ec2 are comparing it to the libraries listed below
Sorting:
- Scripts to analyze Spark's performance☆136Updated 7 years ago
- Additional useful algorithms that can be used with spark.☆24Updated 10 years ago
- Testbench for experimenting with Apache Hive at any data scale.☆64Updated 8 years ago
- Spark Terasort☆121Updated 2 years ago
- Fast I/O plugins for Spark☆41Updated 4 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 12 years ago
- Coding exercises for Apache Spark☆104Updated 10 years ago
- Parquet file generator☆22Updated 7 years ago
- An extension of Yahoo's Benchmarks☆108Updated last year
- Drizzle integration with Apache Spark☆120Updated 7 years ago
- Benchmark Suite for Apache Spark☆241Updated 2 years ago
- Large scale query engine benchmark☆99Updated 9 years ago
- Spark GPU and SIMD Support☆61Updated 5 years ago
- This is an introduction of Apache Spark DataFrames.☆41Updated 10 years ago
- Complete Pipeline Training at Big Data Scala By the Bay☆71Updated 9 years ago
- Spark Tutorial at the University of Maryland☆38Updated 11 years ago
- A connector for SingleStore and Spark☆162Updated last month
- ☆21Updated 10 years ago
- Based off the design of SparkOnHBase. This Repo will support Spark, Spark Streaming, and Spark SQL integration with Kudu.☆50Updated 9 years ago
- Self-written notes that may be useful☆106Updated 9 years ago
- real time log event processing using spark, kafka & cassandra☆13Updated 10 years ago
- Stocator is high performing connector to object storage for Apache Spark, achieving performance by leveraging object storage semantics.☆114Updated last year
- Source code of Blog at☆51Updated last month
- Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 10 years ago
- A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support…☆108Updated 7 years ago
- Training materials for Strata, AMP Camp, etc☆149Updated 9 years ago
- ☆56Updated 11 years ago
- [NOTE: Repository has moved to github.com/amplab/spark-ec2]☆57Updated 10 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Updated 8 years ago
- Utility to easily copy files into HDFS☆69Updated 5 years ago