Serving system for batch generated data sets
☆177May 11, 2017Updated 8 years ago
Alternatives and similar repositories for terrapin
Users that are interested in terrapin are comparing it to the libraries listed below
Sorting:
- Collect local Mesos slave, underlying operating system and machine metrics and produce to Apache Kafka☆20Jan 29, 2016Updated 10 years ago
- A Hivemall wrapper for Spark☆31Apr 21, 2016Updated 9 years ago
- This project allows to run Samza jobs on Mesos cluster☆43Mar 25, 2021Updated 4 years ago
- Pinball is a scalable workflow manager☆1,043Dec 10, 2019Updated 6 years ago
- A Ruby toolkit for cloud-friendly ETL☆38Jul 29, 2016Updated 9 years ago
- Java and Scala client libraries for Concord☆13Feb 15, 2017Updated 9 years ago
- An Apache Mesos Framework that allows for replaying load over and over and over (and over) again☆10Aug 10, 2015Updated 10 years ago
- PinLater is a Thrift service to manage scheduling and execution of asynchronous jobs.☆140May 12, 2017Updated 8 years ago
- https://github.com/apache/incubator-myriad is our new home. See☆252Dec 2, 2015Updated 10 years ago
- An opinionated CLI for Chronos☆22Oct 25, 2018Updated 7 years ago
- Serverless proxy for Spark cluster☆325Oct 29, 2020Updated 5 years ago
- The experimentation and testing tool for Apache Mesos - NO LONGER MAINTANED!☆424Jun 22, 2018Updated 7 years ago
- Cotton (formerly known as Mysos)☆585Aug 6, 2015Updated 10 years ago
- Exhibitor on Apache Mesos for reliably running Zookeeper on Mesos☆20May 18, 2016Updated 9 years ago
- Alenka JDBC is a library for accessing and manipulating data with the open-source GPU database Alenka.☆20Jul 3, 2014Updated 11 years ago
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,261Updated this week
- The backend for the Pathfinder service.☆12May 25, 2016Updated 9 years ago
- Realtime analytics, this includes the core components of Pulsar pipeline.☆651Nov 6, 2015Updated 10 years ago
- KingPin is the toolset used at Pinterest for service discovery and application configuration.☆69Nov 16, 2018Updated 7 years ago
- Recipes and examples for Apache Spark☆13Jan 21, 2015Updated 11 years ago
- Satellite monitors, alerts on, and self-heals your Mesos cluster.☆144May 9, 2016Updated 9 years ago
- Extensible Scheduler for Mesos Frameworks☆698Mar 31, 2023Updated 2 years ago
- Kubernetes deployment of PrestoDB, Hive Metastore, and Minio S3-standard object store☆17Oct 20, 2022Updated 3 years ago
- A high performance replicated log service. (The development is moved to Apache Incubator)☆2,208Feb 25, 2020Updated 6 years ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Jul 17, 2015Updated 10 years ago
- ☆92Apr 17, 2017Updated 8 years ago
- Quark is a data virtualization engine over analytic databases.☆100Jul 13, 2017Updated 8 years ago
- Coral is a real-time analytics and data science platform. It transforms streaming events and extract patterns from data via RESTful APIs.…☆147Sep 5, 2019Updated 6 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Dec 14, 2022Updated 3 years ago
- Distributed solver library for large-scale structured output prediction, based on Spark. Project website:☆17Mar 3, 2016Updated 10 years ago
- Secor is a service implementing Kafka log persistence☆1,858Feb 25, 2026Updated last week
- Schedoscope is a scheduling framework for painfree agile development, testing, (re)loading, and monitoring of your datahub, lake, or what…☆96Nov 14, 2019Updated 6 years ago
- A machine learning package built for humans.☆4,800Nov 6, 2025Updated 3 months ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,037Nov 21, 2022Updated 3 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Sep 8, 2022Updated 3 years ago
- ☆17Oct 27, 2015Updated 10 years ago
- Data-Driven Spark allows quick data exploration based on Apache Spark.☆29Jan 6, 2017Updated 9 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Feb 21, 2014Updated 12 years ago
- A collection of Apache Parquet add-on modules☆30Updated this week