FINRAOS / herd-mdl
Herd-MDL, a turnkey managed data lake in the cloud. See https://finraos.github.io/herd-mdl/ for more information.
☆16Updated 9 months ago
Alternatives and similar repositories for herd-mdl
Users that are interested in herd-mdl are comparing it to the libraries listed below
Sorting:
- Herd-UI is a search and discovery tool for business and technical users. Everyone in your organization can use Herd-UI to browse and unde…☆16Updated 2 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- Apache Pig plugin for Eclipse☆12Updated 8 years ago
- Connect DBVisualizer to Hortonwork HiveServer2☆9Updated 10 years ago
- Stocks -> NiFi -> Kafka -> Profit☆14Updated 6 years ago
- Automatically loads new partitions in AWS Athena☆18Updated 4 years ago
- An AWS Lambda package including two functions to dynamically maintain a security partition around a group of AWS resources which originat…☆12Updated 6 years ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆51Updated last year
- Hive Storage Handler for Kinesis.☆11Updated 9 years ago
- A collection of datasets and databases☆24Updated 6 years ago
- Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http:…☆70Updated 2 years ago
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆20Updated 5 years ago
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 2 years ago
- ☆7Updated 9 years ago
- Very basic web app project that grabs a twitter stream and runs it through Stanfords Core NLP☆10Updated 9 years ago
- Kafka Connect playground☆10Updated 5 years ago
- An Operator for scheduling and executing NiFi Flows as Jobs on Kubernetes☆53Updated 4 years ago
- A solution describing data-processing design pattern for streaming data through Kinesis and Spark Streaming at real-time.☆38Updated 11 months ago
- A component which takes nifi flow xml file as input and converts it into terraform script for creating/updating a flow on nifi☆28Updated 3 years ago
- Apiary provides modules which can be combined to create a federated cloud data lake☆36Updated last year
- Detect memory leaks in minutes without a heap dump.☆17Updated 8 years ago
- Automated schema design for NoSQL applications☆29Updated 9 months ago
- Sandbox for Apache nifi☆24Updated 3 years ago
- A template-based cluster provisioning system☆61Updated 2 years ago
- A secure proxy service for managing OneOps secrets.☆13Updated last year
- Create stacks (aka stax) on AWS (Amazon Web Services) in a private VPC (Virtual Private Cloud) with failover NAT nodes proxying network t…☆17Updated 6 years ago
- A java library for stored queries☆16Updated last year
- Provides a Pythonic interface for reading and writing Avro schemas☆27Updated 2 years ago
- Distributed Dexecutor Using Ignite☆10Updated 7 years ago
- An Object Graph Mapping Library For Gremlin☆32Updated 7 years ago