pinterest/terrapin

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pinterest/terrapin)

pinterest / terrapin

Serving system for batch generated data sets

☆179

Alternatives and similar repositories for terrapin

Users that are interested in terrapin are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

elodina / syscol
View on GitHub
Collect local Mesos slave, underlying operating system and machine metrics and produce to Apache Kafka
☆20Jan 29, 2016Updated 10 years ago
maropu / hivemall-spark
View on GitHub
A Hivemall wrapper for Spark
☆31Apr 21, 2016Updated 10 years ago
Banno / samza-mesos
View on GitHub
This project allows to run Samza jobs on Mesos cluster
☆43Mar 25, 2021Updated 5 years ago
stealthly / punxsutawney
View on GitHub
An Apache Mesos Framework that allows for replaying load over and over and over (and over) again
☆10Aug 10, 2015Updated 10 years ago
pinterest / pinball
View on GitHub
Pinball is a scalable workflow manager
☆1,047Dec 10, 2019Updated 6 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
tweetmagik / spark-yarn
View on GitHub
Launch Spark clusters on YARN
☆24Aug 29, 2011Updated 14 years ago
CiscoCloud / exhibitor-mesos-framework
View on GitHub
Exhibitor on Apache Mesos for reliably running Zookeeper on Mesos
☆20May 18, 2016Updated 10 years ago
theclaymethod / Foundry-vagrant-mesos-kafka-cluster
View on GitHub
A Vagrant/Ansible => Kafka, Mesos (w/ Marathon/Docker), ZK, Hadoop, and Spark. Service discovery via HAProxy and Bamboo.
☆50Dec 3, 2014Updated 11 years ago
mesos / myriad
View on GitHub
https://github.com/apache/incubator-myriad is our new home. See
☆251Dec 2, 2015Updated 10 years ago
concord / concord-jvm
View on GitHub
Java and Scala client libraries for Concord
☆13Feb 15, 2017Updated 9 years ago
TAwarehouse / backup-hadoop-and-hive
View on GitHub
☆21May 9, 2012Updated 14 years ago
stripe-archive / herringbone
View on GitHub
Tools for working with parquet, impala, and hive
☆135Jan 4, 2021Updated 5 years ago
rjurney / Cloud-Stenography
View on GitHub
Main Repo
☆15Jun 24, 2010Updated 16 years ago
khronus / khronus
View on GitHub
A reactive time series database
☆234Jun 5, 2018Updated 8 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
twitter-archive / mysos
View on GitHub
Cotton (formerly known as Mysos)
☆584Aug 6, 2015Updated 10 years ago
pinterest / kingpin
View on GitHub
KingPin is the toolset used at Pinterest for service discovery and application configuration.
☆70Nov 16, 2018Updated 7 years ago
Hydrospheredata / mist
View on GitHub
Serverless proxy for Spark cluster
☆325Apr 13, 2026Updated 3 months ago
sheepkiller / presto-marathon-docker
View on GitHub
On demand presto cluster with mesos, marathon and docker.
☆29Mar 7, 2018Updated 8 years ago
twitter / storehaus
View on GitHub
Storehaus is a library that makes it easy to work with asynchronous key value stores
☆465Jul 17, 2020Updated 6 years ago
etsy / jading
View on GitHub
cascading.jruby build and execution tool
☆16Sep 23, 2015Updated 10 years ago
apache / gobblin
View on GitHub
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…
☆2,270Jun 24, 2026Updated last month
nitsanw / safepoint-experiments
View on GitHub
☆11Feb 24, 2016Updated 10 years ago
adobe-research / spindle
View on GitHub
Next-generation web analytics processing with Scala, Spark, and Parquet.
☆330Mar 28, 2015Updated 11 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ianoc / SparkEMRBootstrap
View on GitHub
Files to help make new spark EMR Bootstraps
☆15Aug 4, 2013Updated 12 years ago
thunderain-project / thunderain
View on GitHub
A Real-Time Analytical Processing (RTAP) example using Spark/Shark
☆51Feb 21, 2014Updated 12 years ago
sentric / hannibal
View on GitHub
Hannibal is tool to help monitor and maintain HBase-Clusters that are configured for manual splitting.
☆172Dec 22, 2017Updated 8 years ago
stripe-archive / timberlake
View on GitHub
Timberlake is a Job Tracker for Hadoop.
☆177Jan 24, 2020Updated 6 years ago
rbrush / kite-apps
View on GitHub
Prescriptive Applications over Kite and Hadoop
☆12Oct 14, 2015Updated 10 years ago
prezi / datateam
View on GitHub
Data team culture
☆42Sep 10, 2015Updated 10 years ago
crowdmob / kafka-s3-consumer
View on GitHub
Consumes Kafka topics specified in the config, and outputs them in chunks as desired in an S3 Bucket. Keeps track of offsets via S3.
☆15Sep 6, 2013Updated 12 years ago
agentultra / dmon
View on GitHub
A stream-processing service for monitoring distributed clusters
☆15Feb 6, 2015Updated 11 years ago
Verizon / funnel
View on GitHub
DEPRECATED: Reasonable monitoring for distributed systems
☆145Mar 1, 2017Updated 9 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ContainerSolutions / minimesos
View on GitHub
The experimentation and testing tool for Apache Mesos - NO LONGER MAINTANED!
☆425Jun 22, 2018Updated 8 years ago
yieldbot / chronos-shuttle
View on GitHub
An opinionated CLI for Chronos
☆22Oct 25, 2018Updated 7 years ago
calrissian / spark-jetty-server
View on GitHub
Recipes and examples for Apache Spark
☆13Jan 21, 2015Updated 11 years ago
sameeragarwal / blinkdb
View on GitHub
BlinkDB: Sub-Second Approximate Queries on Very Large Data.
☆660Feb 6, 2014Updated 12 years ago
brightcove-archive / ooyala_spark-jobserver
View on GitHub
REST job server for Spark. Note that this is *not* the mainline open source version. For that, go to https://github.com/spark-jobserver…
☆345May 19, 2017Updated 9 years ago
Comcast / sirius
View on GitHub
A distributed system library for managing application reference data
☆296Feb 28, 2025Updated last year
GravityLabs / HPaste
View on GitHub
HBase DSL for Scala with MapReduce support
☆128Jan 4, 2018Updated 8 years ago