twitter / caladrius
Performance modelling system for Distributed Stream Processing Systems (DSPS) such as Apache Heron and Apache Storm
☆22Updated last year
Related projects: ⓘ
- Apache datasketches☆85Updated last year
- Condor allows for the specification of synopsis-based streaming jobs on top of general dataflow systems. Condor provides a collection of …☆13Updated 2 months ago
- DS2 is an auto-scaling controller for distributed streaming dataflows☆88Updated last year
- Mirror of Apache Arrow site☆33Updated this week
- Mirror of Apache livy (Incubating)☆13Updated 4 months ago
- Self regulation and auto-tuning for distributed system☆64Updated last year
- A composable framework for fast and scalable data analytics☆57Updated last year
- Albis: High-Performance File Format for Big Data Systems☆21Updated 6 years ago
- ☆14Updated 2 years ago
- struct2tensor is a library for parsing and manipulating structured data inside of tensorflow.☆32Updated last week
- Milan is a Scala API and runtime infrastructure for building data-oriented systems, built on top of Apache Flink.☆39Updated last year
- ESPBench - The Enterprise Stream Processing Benchmark☆13Updated 8 months ago
- Fast I/O plugins for Spark☆41Updated 3 years ago
- Website for DataSketches.☆94Updated this week
- Repository for building CDAP and additional external projects☆15Updated last week
- Shaded version of Apache Hive for Trino☆8Updated last month
- Dione - a Spark and HDFS indexing library☆49Updated 6 months ago
- The Musketeer workflow manager.☆41Updated 5 years ago
- Apache Beam Site☆29Updated last week
- Java Sketch Characterization Code.☆10Updated 2 weeks ago
- Parquet file generator☆22Updated 6 years ago
- This repository provides Scotty, a framework for efficient window aggregations for out-of-order Stream Processing.☆75Updated last year
- Cloud Spanner Connector for Apache Spark☆17Updated last month
- Machine Learning Inference Graph Spec☆21Updated 5 years ago
- ☆27Updated 3 weeks ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆11Updated last month
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Updated last year
- Peel is a framework that helps you to define, execute, analyze, and share experiments for distributed systems and algorithms.☆27Updated last year
- A demo of Redis Enterprise as the Online Feature Store deployed on GCP with Feast and NVIDIA Triton Inference Server.☆15Updated last year
- Fybrik platform - Arrow/Flight module☆16Updated last month