startreedata / thirdeye
ThirdEye is an integrated tool for realtime monitoring of time series and interactive root-cause analysis.
☆99Updated this week
Alternatives and similar repositories for thirdeye:
Users that are interested in thirdeye are comparing it to the libraries listed below
- ThirdEye is an integrated tool for realtime monitoring of time series and interactive root-cause analysis. It enables anyone inside an or…☆92Updated 2 years ago
- ☆67Updated this week
- Delta reader for the Ray open-source toolkit for building ML applications☆44Updated last year
- dbt-starrocks contains all of the code enabling dbt to work with StarRocks☆23Updated 4 months ago
- Open Control Plane for Tables in Data Lakehouse☆323Updated this week
- Minimal example to run Trino, Minio, and Hive standalone metastore on docker☆48Updated 2 years ago
- Timeseries Anomaly detection and Root Cause Analysis on data in SQL data warehouses and databases☆228Updated 2 years ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆75Updated this week
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆94Updated this week
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆154Updated 2 months ago
- A write-audit-publish implementation on a data lake without the JVM☆46Updated 6 months ago
- Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful …☆144Updated 6 months ago
- Multi-hop declarative data pipelines☆109Updated this week
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆189Updated this week
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆135Updated last year
- Schema modelling framework for decentralised domain-driven ownership of data.☆250Updated last year
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆226Updated this week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆318Updated last year
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆130Updated last month
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆226Updated last month
- DB API 2 interface for Flight SQL with SQLAlchemy extras.☆37Updated 4 months ago
- Delta Lake helper methods. No Spark dependency.☆22Updated 5 months ago
- The metrics layer for your data. Join us at https://metriql.com/slack☆304Updated last year
- ☆80Updated this week
- Sample configuration to deploy a modern data platform.☆87Updated 3 years ago
- ☆79Updated last year
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 2 years ago
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆62Updated 2 months ago
- ☆22Updated 2 months ago
- A Table format agnostic data sharing framework☆38Updated last year