ExpediaGroup / insights-explorer
Insights Explorer is a tool to catalogue and present analytical & research work.
☆13Updated 3 months ago
Alternatives and similar repositories for insights-explorer:
Users that are interested in insights-explorer are comparing it to the libraries listed below
- Apiary provides modules which can be combined to create a federated cloud data lake☆36Updated 11 months ago
- Extensions available for use in Apiary☆10Updated last week
- Sample code to collect Apache Iceberg metrics for table monitoring☆25Updated 7 months ago
- A service which allows Hive Metastore Listeners to be deployed outside of the Hive Metastore Service☆10Updated 4 months ago
- A component which takes nifi flow xml file as input and converts it into terraform script for creating/updating a flow on nifi☆28Updated 3 years ago
- Amundsen Gremlin☆21Updated 2 years ago
- Multi-hop declarative data pipelines☆111Updated this week
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆88Updated last year
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- Terraform scripts for deploying Apiary Data Lake☆19Updated 2 weeks ago
- Deploy Presto on the cloud easily, using Terraform and Packer☆44Updated 2 years ago
- ☆19Updated 5 months ago
- The Data Integration Library project provides a library of generic components based on a multi-stage architecture for data ingress and eg…☆31Updated 2 weeks ago
- Mutation testing framework and code coverage for Hive SQL☆24Updated 3 years ago
- Cloud Storage Connector integrates Apache Pulsar with cloud storage.☆27Updated this week
- Graph Analytics with Apache Kafka☆104Updated this week
- Demonstration of a Hive Input Format for Iceberg☆26Updated 4 years ago
- Service for automatically managing and cleaning up unreferenced data☆46Updated this week
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆50Updated last year
- ⚠️ MAINTENANCE-ONLY MODE: Snowplow maintained SQL data models for working with Snowplow web and mobile behavioral data.☆41Updated 2 months ago
- A library for strong, schema based conversion between 'natural' JSON documents and Avro☆18Updated last year
- Stream Discovery and Stream Orchestration☆122Updated last month
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Updated 3 years ago
- Explore Apache Kafka data pipelines in Kubernetes.☆45Updated 3 weeks ago
- A Kafka Serde that reads and writes records from and to Blob storage (S3, Azure, Google) transparently.☆59Updated this week
- dbt's adapter for dremio☆48Updated 2 years ago
- MonitoFi: Health & Performance Monitor for your Apache NiFi☆62Updated last year
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Updated 3 years ago
- Scala SDK for working with Snowplow enriched events in Spark, AWS Lambda, Flink et al.☆20Updated 4 months ago
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆135Updated last year