sigmoidanalytics/spark_gce

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sigmoidanalytics/spark_gce)

sigmoidanalytics / spark_gce

Spark GCE Script Helps you deploy Spark cluster on Google Cloud.

☆43

Alternatives and similar repositories for spark_gce

Users that are interested in spark_gce are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

broxtronix / spark-gce
View on GitHub
A tool for running Spark on Google Compute Engine
☆16Jan 20, 2017Updated 9 years ago
mesos / spark-ec2
View on GitHub
[NOTE: Repository has moved to github.com/amplab/spark-ec2]
☆57Aug 10, 2015Updated 10 years ago
GoogleCloudPlatform / solutions-google-compute-engine-cluster-for-hadoop
View on GitHub
This sample app will get up and running quickly with a Hadoop cluster on Google Compute Engine. For more information on running Hadoop o…
☆81Jan 9, 2018Updated 8 years ago
AndreSchumacher / avro-parquet-spark-example
View on GitHub
An example of using Avro and Parquet in Spark SQL
☆60Nov 16, 2015Updated 10 years ago
bigdatabe / p2
View on GitHub
The second BigDate.be workshop
☆18Sep 4, 2013Updated 12 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
med-at-scale / high-health
View on GitHub
Integrate the GA4GH schemas and probably a scala impl of the service.
☆14May 20, 2016Updated 10 years ago
pomadchin / accumulo-spark
View on GitHub
Docker containers with Apache Accumulo and Apache Spark environment.
☆12Jan 22, 2016Updated 10 years ago
mandubian / zpark-ztream
View on GitHub
Driving Spark stream with Scalaz-Stream
☆26Mar 18, 2014Updated 12 years ago
lightning-viz / lightning-scala
View on GitHub
Scala client for the Lightning data visualization server (WIP)
☆47Jun 25, 2019Updated 7 years ago
elodina / syscol
View on GitHub
Collect local Mesos slave, underlying operating system and machine metrics and produce to Apache Kafka
☆20Jan 29, 2016Updated 10 years ago
tresata / spark-columnar
View on GitHub
☆15Mar 4, 2015Updated 11 years ago
FurongHuang / spectrallda-tensorspark
View on GitHub
Quick summary: This code implements a spectral (third order tensor decomposition) learning method for learning LDA topic model on Spark.
☆104Jul 2, 2018Updated 8 years ago
pwendell / spark-twitter-collection
View on GitHub
Spark example of collecting tweets and loading into HDFS/S3
☆42Oct 2, 2013Updated 12 years ago
amplab / docker-scripts
View on GitHub
Dockerfiles and scripts for Spark and Shark Docker images
☆259Jun 19, 2014Updated 12 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
exoscale / python-riemann-wrapper
View on GitHub
time and report exception in riemann for functions
☆20Apr 17, 2016Updated 10 years ago
collectivemedia / modelmatrix
View on GitHub
Sparse feature extraction with Spark
☆30Jul 25, 2018Updated 8 years ago
MLWave / Kaggle_Rotten_Tomatoes
View on GitHub
Code to munge data between Kaggle .tsv Rotten Tomatoes Sentiment Analysis data set and Vowpal Wabbit
☆24Jun 22, 2014Updated 12 years ago
hohonuuli / sparknotebook
View on GitHub
An example of running Apache Spark using Scala in ipython notebook
☆141Aug 31, 2015Updated 10 years ago
shivaram / spark-ec2
View on GitHub
Scripts used to setup a Spark cluster on EC2
☆21Mar 24, 2016Updated 10 years ago
suhailshergill / TTFI
View on GitHub
typed tagless final interpreters
☆13Feb 14, 2017Updated 9 years ago
amplab / velox-modelserver
View on GitHub
☆110Apr 17, 2017Updated 9 years ago
ccsevers / scalding-linalg
View on GitHub
Linear algebra routines for Scalding.
☆21May 23, 2013Updated 13 years ago
amplab / training
View on GitHub
Training materials for Strata, AMP Camp, etc
☆150Nov 20, 2015Updated 10 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bigdatagenomics / bdg-formats
View on GitHub
Open source formats for scalable genomic processing systems using Avro. Apache 2 licensed.
☆42Feb 13, 2026Updated 5 months ago
clamm / spark-location-history
View on GitHub
Application that visualizes your google location history in form of a heatmap using Spark to aggregate the data.
☆12Feb 19, 2015Updated 11 years ago
mitll / vizlinc
View on GitHub
Vizlinc
☆15Jan 14, 2016Updated 10 years ago
freeman-lab / spark-ml-streaming
View on GitHub
Visualize streaming machine learning in Spark
☆176Jun 29, 2017Updated 9 years ago
databricks / sbt-databricks
View on GitHub
An sbt plugin for deploying code to Databricks Cloud
☆71Jul 8, 2018Updated 8 years ago
bigdatagenomics / avocado
View on GitHub
A Variant Caller, Distributed. Apache 2 licensed.
☆72Mar 11, 2019Updated 7 years ago
massie / spark-parquet-example
View on GitHub
Example project to show how to use Spark to read and write Avro/Parquet files
☆50Aug 21, 2013Updated 12 years ago
googlegenomics / spark-examples
View on GitHub
Apache Spark jobs such as Principal Coordinate Analysis.
☆77Jan 30, 2017Updated 9 years ago
bigdatagenomics / eggo
View on GitHub
Ready-to-go Parquet-formatted public 'omics datasets
☆30Nov 2, 2015Updated 10 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
collectivemedia / spark-ext
View on GitHub
Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark
☆145Jan 26, 2016Updated 10 years ago
daithiocrualaoich / spark-emr
View on GitHub
Spark Elastic MapReduce bootstrap and runnable examples.
☆17Jun 26, 2013Updated 13 years ago
darkjh / scalaflow
View on GitHub
Fluent Scala DSL for Google's Cloud Dataflow SDK
☆56Aug 2, 2015Updated 10 years ago
bbc / rdfsim
View on GitHub
Large RDF hierarchies as vector spaces
☆20Jun 27, 2014Updated 12 years ago
kingdonb / kccnceu2021
View on GitHub
KubeCon CloudNativeCon EU 2021
☆12May 20, 2021Updated 5 years ago
solid-contrib / talks
View on GitHub
List of Solid talks
☆17Nov 25, 2019Updated 6 years ago
johnmq / raft-rs
View on GitHub
Raft implementation in Rust
☆26Jan 8, 2015Updated 11 years ago