startreedata / thirdeye
ThirdEye is an integrated tool for realtime monitoring of time series and interactive root-cause analysis.
☆99Updated last month
Alternatives and similar repositories for thirdeye:
Users that are interested in thirdeye are comparing it to the libraries listed below
- ThirdEye is an integrated tool for realtime monitoring of time series and interactive root-cause analysis. It enables anyone inside an or…☆92Updated 2 years ago
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆135Updated last year
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆76Updated 3 weeks ago
- Multi-hop declarative data pipelines☆112Updated this week
- A Table format agnostic data sharing framework☆38Updated last year
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆134Updated 2 months ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆94Updated 2 weeks ago
- Sherlock is an anomaly detection service built on top of Druid☆155Updated 3 months ago
- ☆69Updated 3 weeks ago
- Open Control Plane for Tables in Data Lakehouse☆331Updated last week
- Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful …☆144Updated 8 months ago
- dbt-starrocks contains all of the code enabling dbt to work with StarRocks☆27Updated this week
- A library that provides useful extensions to Apache Spark and PySpark.☆220Updated this week
- Unity Catalog UI☆40Updated 6 months ago
- This project provides fully automated one-click experience to create Cloud and Kubernetes environment to run Data Analytics workload like…☆55Updated 2 years ago
- Timeseries Anomaly detection and Root Cause Analysis on data in SQL data warehouses and databases☆229Updated 3 years ago
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆157Updated 3 months ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆75Updated last week
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆97Updated 2 years ago
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆237Updated last week
- Data Processing/Feature Calculation Engine for Real-Time AI/ML☆40Updated this week
- ☆40Updated last year
- ☆52Updated 7 months ago
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆62Updated 2 years ago
- Data Tools Subjective List☆83Updated last year
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- ☆79Updated last year
- A simple Spark-powered ETL framework that just works 🍺☆181Updated 3 weeks ago
- A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL☆39Updated 6 months ago
- Storage connector for Trino☆106Updated last week