nchammas / flintrock
A command-line tool for launching Apache Spark clusters.
☆637Updated 2 months ago
Related projects: ⓘ
- Scripts used to setup a Spark cluster on EC2☆392Updated 6 years ago
- Redshift data source for Apache Spark☆605Updated last year
- Mirror of Apache Toree (Incubating)☆737Updated 2 weeks ago
- Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks☆360Updated 7 years ago
- This repository hold the Amazon Elastic MapReduce sample bootstrap actions☆614Updated last year
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆692Updated last month
- Base classes to use when writing tests with Spark☆1,509Updated 2 months ago
- Jupyter magics and kernels for working with remote Spark clusters☆1,315Updated last month
- ☆510Updated 2 years ago
- ☆244Updated 4 years ago
- The Internals of Apache Spark☆1,461Updated this week
- CSV Data Source for Apache Spark 1.x☆1,053Updated 5 years ago
- Essential Spark extensions and helper methods ✨😲☆747Updated 2 years ago
- Avro Data Source for Apache Spark☆539Updated 5 years ago
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)☆429Updated this week
- Performance tests for Apache Spark☆379Updated 6 years ago
- Stanford CoreNLP wrapper for Apache Spark☆422Updated 5 years ago
- A tool for monitoring and tuning Spark jobs for efficiency.☆357Updated last year
- Apache Spark on AWS Lambda☆151Updated last year
- [DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark☆749Updated last month
- XML data source for Spark SQL and DataFrames☆500Updated last month
- Examples for High Performance Spark☆497Updated 3 weeks ago
- Iceberg is a table format for large, slow-moving tabular data☆476Updated last year
- Serverless proxy for Spark cluster☆326Updated 3 years ago
- The missing MatPlotLib for Scala + Spark☆730Updated 2 years ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,008Updated last year
- Google BigQuery support for Spark, SQL, and DataFrames☆155Updated 4 years ago
- Kinesis Connector for Structured Streaming☆137Updated 2 months ago
- VM based deployment for prototyping Big Data tools on Amazon Web Services☆128Updated 4 years ago
- Performant Redshift data source for Apache Spark☆135Updated last month