krisnova / spark-cluster-api-operatorLinks
Use Kubernetes to autoscale your spark clusters.
☆10Updated 6 years ago
Alternatives and similar repositories for spark-cluster-api-operator
Users that are interested in spark-cluster-api-operator are comparing it to the libraries listed below
Sorting:
- A tutorial on Apache Spark Unit Testing☆37Updated 10 years ago
- A curated list of all the awesome examples, articles, tutorials and videos for Apache Airflow.☆96Updated 5 years ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆62Updated last year
- Apache Zeppelin on Kubernetes.☆28Updated 6 years ago
- Google BigQuery support for Spark, SQL, and DataFrames☆156Updated 6 years ago
- A stack overflow for Apache Spark☆72Updated 8 years ago
- Terraform module for a PostgreSQL-backed Apache Airflow instance☆24Updated 7 years ago
- Apache Spark OpenCPU Executor (ROSE)☆26Updated 7 years ago
- Cheatsheet for Spark DataFrame☆91Updated 6 years ago
- Splittable SAS (.sas7bdat) Input Format for Hadoop and Spark SQL☆97Updated 2 weeks ago
- Scala for Statistical Computing and Data Science Short Course☆136Updated 5 years ago
- Code from the book Machine Learning Systems☆145Updated 7 years ago
- Spark Scala docker container sample for AWS testing - EKS & S3☆24Updated 7 years ago
- ☆19Updated 2 years ago
- A full example of my blog post regarding Sparks stateful streaming (http://asyncified.io/2016/07/31/exploring-stateful-streaming-with-apa…☆35Updated 8 years ago
- A giter8 template for Spark SBT projects☆72Updated 4 years ago
- ☆54Updated 8 years ago
- Implementations of the Portable Format for Analytics (PFA)☆126Updated 3 years ago
- Base hadoop/spark/bigdata image with advanced config loading scripts.☆11Updated 5 years ago
- Real-world Spark pipelines examples☆83Updated 7 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆63Updated 6 years ago
- This is an introduction of Apache Spark DataFrames.☆41Updated 10 years ago
- spark backend for dplyr☆48Updated 10 years ago
- Ephemeral Hadoop clusters using Google Compute Platform☆134Updated 3 years ago
- Live-updating Spark UI built with Meteor☆189Updated 4 years ago
- Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code☆296Updated last year
- Data Science box: Spark, Jupyter, R+RStudio, Zeppelin, Python 2 & 3, Java, Scala.☆39Updated 7 years ago
- An example PySpark project with pytest☆18Updated 8 years ago
- An umbrella project for multiple implementations of model serving☆45Updated 8 years ago
- Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.☆115Updated 5 years ago