FINRAOS / herd-uiLinks
Herd-UI is a search and discovery tool for business and technical users. Everyone in your organization can use Herd-UI to browse and understand the contents of your Herd managed data lake.
☆16Updated 3 years ago
Alternatives and similar repositories for herd-ui
Users that are interested in herd-ui are comparing it to the libraries listed below
Sorting:
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆52Updated 3 months ago
- Chatlytics is a data query and visualization platform for chat!☆13Updated 8 years ago
- spark-emr☆15Updated 11 years ago
- ☆14Updated 2 years ago
- pysh-db - The Data Science Toolkit (DSK)☆13Updated 6 years ago
- Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆138Updated 3 years ago
- Events about the open source data stack☆13Updated 3 years ago
- Bender - Serverless ETL Framework☆188Updated last year
- ETLy is an add-on dashboard service on top of Apache Airflow.☆68Updated 2 years ago
- Data Catalog for Databases and Data Warehouses☆35Updated last year
- Automatically loads new partitions in AWS Athena☆19Updated 5 years ago
- Herd-MDL, a turnkey managed data lake in the cloud. See https://finraos.github.io/herd-mdl/ for more information.☆16Updated last year
- A component which takes nifi flow xml file as input and converts it into terraform script for creating/updating a flow on nifi☆28Updated 3 years ago
- Functional Airflow DAG definitions.☆38Updated 8 years ago
- Presentations and other resources.☆36Updated 5 years ago
- This repo demonstrates how to load a sample Parquet formatted file from an AWS S3 Bucket. A python job will then be submitted to a Apach…☆19Updated 9 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- The open source version of the Amazon Athena documentation. To submit feedback & requests for changes, submit issues in this repository, …☆84Updated 2 years ago
- A Cloud Native Query Engine. Serverless, if it fits your case.☆54Updated 2 years ago
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆20Updated 5 years ago
- ☆54Updated last week
- Marquez Web UI☆21Updated 4 years ago
- Convert a CSV fle to ORCFile☆26Updated 6 years ago
- Autoscaling EMR clusters and Kinesis streams on Amazon Web Services (AWS)☆47Updated last year
- tap-postgres☆68Updated last year
- A collection of datasets and databases☆24Updated 7 years ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated last year
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆90Updated last year
- Using the Parquet file format (with Avro) to process data with Apache Flink☆14Updated 10 years ago
- Liquibase support for Redshift☆17Updated last week