FINRAOS / herd-uiLinks
Herd-UI is a search and discovery tool for business and technical users. Everyone in your organization can use Herd-UI to browse and understand the contents of your Herd managed data lake.
☆16Updated 3 years ago
Alternatives and similar repositories for herd-ui
Users that are interested in herd-ui are comparing it to the libraries listed below
Sorting:
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆52Updated 6 months ago
- pysh-db - The Data Science Toolkit (DSK)☆13Updated 7 years ago
- ☆14Updated 3 years ago
- spark-emr☆15Updated 11 years ago
- Chatlytics is a data query and visualization platform for chat!☆13Updated 8 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆91Updated last year
- Herd-MDL, a turnkey managed data lake in the cloud. See https://finraos.github.io/herd-mdl/ for more information.☆15Updated last year
- Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆138Updated 3 years ago
- The sane way of building a data layer in Airflow☆24Updated 6 years ago
- ETLy is an add-on dashboard service on top of Apache Airflow.☆68Updated 2 years ago
- Events about the open source data stack☆13Updated 3 years ago
- Data Catalog for Databases and Data Warehouses☆35Updated last year
- ⚠️ MAINTENANCE-ONLY MODE: Snowplow maintained SQL data models for working with Snowplow web and mobile behavioral data.☆42Updated 11 months ago
- Autoscaling EMR clusters and Kinesis streams on Amazon Web Services (AWS)☆47Updated 2 years ago
- Automatically loads new partitions in AWS Athena☆19Updated 5 years ago
- The open source version of the Amazon Athena documentation. To submit feedback & requests for changes, submit issues in this repository, …☆84Updated 2 years ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆25Updated last year
- The open source version of the Amazon Redshift Cluster Management Guide.☆48Updated 2 years ago
- A Apache Hive SerDe (short for serializer/deserializer) for the Ion file format.☆31Updated 9 months ago
- Apache Spark AWS Lambda Executor (SAMBA)☆44Updated 7 years ago
- Liquibase support for Redshift☆17Updated last week
- ☆58Updated last month
- A CLI to manage and monitor permissions in AWS Lake Formation☆25Updated 2 years ago
- Data validation library for PySpark 3.0.0☆33Updated 3 years ago
- Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs …☆159Updated 3 years ago
- Presentations and other resources.☆36Updated 5 years ago
- Code that was used as an example during the Data+AI Summit 2020☆15Updated 4 years ago
- Utility functions for dbt projects running on Spark☆34Updated 2 weeks ago
- Bender - Serverless ETL Framework☆188Updated 2 years ago
- Dremio Flight connector. Access Dremio using Arrow flight☆39Updated 5 years ago