twitter / caladriusLinks
Performance modelling system for Distributed Stream Processing Systems (DSPS) such as Apache Heron and Apache Storm
☆26Updated 2 years ago
Alternatives and similar repositories for caladrius
Users that are interested in caladrius are comparing it to the libraries listed below
Sorting:
- Controllers, wrappers and miscaleus utils to make it easier for Argo to be used in ML scenarios☆24Updated 4 years ago
- Tools for creating repos based on open source standards and best practices☆37Updated 4 years ago
- Website for DataSketches.☆104Updated last week
- Cask Hydrator Plugins Repository☆69Updated 2 weeks ago
- struct2tensor is a library for parsing and manipulating structured data inside of tensorflow.☆34Updated 2 weeks ago
- General Metadata Architecture☆131Updated this week
- Self regulation and auto-tuning for distributed system☆66Updated 2 years ago
- Dione - a Spark and HDFS indexing library☆52Updated last year
- Cache File System optimized for columnar formats and object stores☆185Updated 3 years ago
- Sherlock is an anomaly detection service built on top of Druid☆155Updated 9 months ago
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-dataproc☆49Updated last year
- A load generator, built for engineers☆28Updated 2 years ago
- Apache datasketches☆99Updated 2 years ago
- Shaded version of Apache Hive for Trino☆10Updated last year
- Library for per-file client-side encyption in Hadoop FileSystems such as HDFS or S3.☆48Updated this week
- Testbench for experimenting with Apache Hive at any data scale.☆64Updated 8 years ago
- Repository for Twitter Open Source Decks☆12Updated 6 years ago
- ☆28Updated 3 months ago
- Condor allows for the specification of synopsis-based streaming jobs on top of general dataflow systems. Condor provides a collection of …☆13Updated last year
- Documentation and implementation of telemetry ingestion on Google Cloud Platform☆84Updated this week
- A tool for scale and performance testing of HDFS with a specific focus on the NameNode.☆133Updated last year
- ☆11Updated 5 years ago
- LinkedIn's version of Apache Calcite☆23Updated 2 months ago
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-bigquery-connection☆29Updated last year
- Big Data Processing Framework - Unified Data API or SQL on Any Storage☆246Updated 2 months ago
- An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.☆427Updated 3 years ago
- Milan is a Scala API and runtime infrastructure for building data-oriented systems, built on top of Apache Flink.☆40Updated 2 years ago
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Updated 2 years ago
- A toolkit providing a uniform interface for connecting to and extracting data from a wide variety of (potentially remote) data stores (in…☆254Updated 2 months ago
- Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful …☆144Updated last year