A tool for running Spark on Google Compute Engine
☆16Jan 20, 2017Updated 9 years ago
Alternatives and similar repositories for spark-gce
Users that are interested in spark-gce are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Stream timeseries data to a file☆17Aug 14, 2016Updated 9 years ago
- personal redirect server☆17Aug 11, 2016Updated 9 years ago
- Mesos Integration Tests on Docker/Ec2☆15May 25, 2023Updated 2 years ago
- Demonstrates the pros and cons of scala.Enumeration and examines alternative structures☆18Nov 24, 2016Updated 9 years ago
- On demand presto cluster with mesos, marathon and docker.☆29Mar 7, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A command-line tool for launching Apache Spark clusters.☆652Dec 13, 2024Updated last year
- Apache Spark build compatible with AWS Glue Data Catalog.☆19Aug 9, 2021Updated 4 years ago
- An Apache Mesos Framework that allows for replaying load over and over and over (and over) again☆10Aug 10, 2015Updated 10 years ago
- Comparison of single-cell normalization methods using multiple datasets with ground-truth labels☆19Dec 2, 2019Updated 6 years ago
- MLeap allows for easily putting Spark ML pipelines into production☆78Oct 27, 2016Updated 9 years ago
- ☆19Dec 16, 2022Updated 3 years ago
- a bare-bones development server for watchify☆13Aug 23, 2015Updated 10 years ago
- experiments with http://regl.party☆11Nov 22, 2016Updated 9 years ago
- list of 2D and 3D mesh modules☆14Nov 28, 2015Updated 10 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- algorithms for mass univariate regression☆13Aug 21, 2018Updated 7 years ago
- Interpret a string literal at the beginning of a function as its documentation.☆16Aug 15, 2016Updated 9 years ago
- A parser/exporter for the greatest image format ever created.☆12Apr 18, 2016Updated 10 years ago
- Playbook to provision a Confluent Cluster☆10Oct 22, 2017Updated 8 years ago
- Find the unique columns in a tabular dataset.☆13Jan 13, 2016Updated 10 years ago
- Complete Pipeline Training at Big Data Scala By the Bay☆71Oct 27, 2015Updated 10 years ago
- Notebooks to accompany sofroniew-vlasov-2015☆10May 18, 2019Updated 6 years ago
- Embedded Kafka for testing and quick prototyping.☆14Apr 19, 2016Updated 10 years ago
- A framework for systematically quality controlling big data.☆40Mar 13, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Ansible role to install/configure Prometheus☆12Mar 21, 2026Updated last month
- Data Science with Apache Spark and Spark Notebook☆30Jul 24, 2017Updated 8 years ago
- The Bible has some issues. Let's make it better.☆72Oct 2, 2015Updated 10 years ago
- Protobuf support for Finagle☆14Nov 7, 2022Updated 3 years ago
- Docker based Netflix proxy☆18Sep 6, 2014Updated 11 years ago
- Implementation of 'Recordinality' cardinality estimation sketch with distinct value sampling☆55Aug 20, 2013Updated 12 years ago
- Experiments with scala native & libpcap☆10Mar 30, 2018Updated 8 years ago
- a world of hexagons☆12Jan 19, 2016Updated 10 years ago
- Seldon Spark Jobs☆26Apr 11, 2015Updated 11 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This repository contains the source for a json-json transformation processor for apache NiFi☆12Jun 21, 2015Updated 10 years ago
- An example of using Avro and Parquet in Spark SQL☆60Nov 16, 2015Updated 10 years ago
- Hadoop Yarn aggregated log parser utility☆23Feb 1, 2020Updated 6 years ago
- ETLy is an add-on dashboard service on top of Apache Airflow.☆69Jul 21, 2023Updated 2 years ago
- WSLKit is a generic toolkit for Windows Subsystem for Linux (WSL), with a PowerShell API, and support for VPN-friendly networking kit (VP…☆21Apr 23, 2026Updated last week
- Quick benchmark comparing Protocol Buffers 3 vs Jackson JSON☆14Jul 3, 2015Updated 10 years ago
- Aggregate and store a collection of data for GitHub repositories, intended for use with documenting package ecosystems on npm☆19Mar 27, 2016Updated 10 years ago