ExpediaGroup / insights-explorer
Insights Explorer is a tool to catalogue and present analytical & research work.
☆13Updated 5 months ago
Alternatives and similar repositories for insights-explorer
Users that are interested in insights-explorer are comparing it to the libraries listed below
Sorting:
- Apiary provides modules which can be combined to create a federated cloud data lake☆36Updated last year
- Extensions available for use in Apiary☆11Updated this week
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆88Updated last year
- A service which allows Hive Metastore Listeners to be deployed outside of the Hive Metastore Service☆12Updated 6 months ago
- Mutation testing framework and code coverage for Hive SQL☆24Updated 4 years ago
- Terraform scripts for deploying Apiary Data Lake☆19Updated this week
- Amundsen Gremlin☆21Updated 2 years ago
- Using the Parquet file format (with Avro) to process data with Apache Flink☆14Updated 9 years ago
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Updated 3 years ago
- Multi-hop declarative data pipelines☆115Updated this week
- A component which takes nifi flow xml file as input and converts it into terraform script for creating/updating a flow on nifi☆28Updated 3 years ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆51Updated last year
- A library for strong, schema based conversion between 'natural' JSON documents and Avro☆18Updated last year
- Demonstration of a Hive Input Format for Iceberg☆26Updated 4 years ago
- My speaker profile for events and conferences based on codepo8/presenter-terms☆13Updated last month
- Dione - a Spark and HDFS indexing library☆52Updated last year
- Service for automatically managing and cleaning up unreferenced data☆46Updated last week
- ETLy is an add-on dashboard service on top of Apache Airflow.☆68Updated last year
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- JUnit integration for testing the Apache Hive Metastore and HiveServer2 Thrift APIs☆26Updated 4 months ago
- Marquez Web UI☆22Updated 4 years ago
- ⚠️ MAINTENANCE-ONLY MODE: Snowplow maintained SQL data models for working with Snowplow web and mobile behavioral data.☆41Updated 4 months ago
- Hadoop Yarn aggregated log parser utility☆23Updated 5 years ago
- The Data Integration Library project provides a library of generic components based on a multi-stage architecture for data ingress and eg…☆32Updated last week
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆67Updated 2 months ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆27Updated 8 months ago
- Cloud Storage Connector integrates Apache Pulsar with cloud storage.☆28Updated last week
- An implementation of the DatasourceV2 interface of Apache Spark™ for writing Spark Datasets to Apache Druid™.☆41Updated 3 weeks ago
- Projects developed by Domino's R&D team☆76Updated 3 years ago
- ThirdEye is an integrated tool for realtime monitoring of time series and interactive root-cause analysis. It enables anyone inside an or…☆92Updated 2 years ago