finos / datahelix
The DataHelix generator allows you to quickly create data, based on a JSON profile that defines fields and the relationships between them, for the purpose of testing and validation
☆142Updated last year
Alternatives and similar repositories for datahelix:
Users that are interested in datahelix are comparing it to the libraries listed below
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Updated 3 years ago
- ThirdEye is an integrated tool for realtime monitoring of time series and interactive root-cause analysis. It enables anyone inside an or…☆92Updated 2 years ago
- An Operator for scheduling and executing NiFi Flows as Jobs on Kubernetes☆53Updated 4 years ago
- Drools processor for Apache NiFi☆38Updated 5 years ago
- Graph Analytics with Apache Kafka☆104Updated last week
- Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).☆121Updated 8 months ago
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆49Updated last year
- An implementation of the DatasourceV2 interface of Apache Spark™ for writing Spark Datasets to Apache Druid™.☆41Updated 3 months ago
- Flexible development framework for building streaming data applications in SQL with Kafka, Flink, Postgres, GraphQL, and more.☆102Updated this week
- Utilities for creating ETL pipelines with mara☆36Updated 2 years ago
- The Data Product Descriptor Specification (DPDS) Repository☆76Updated 2 weeks ago
- Data Mesh Architecture☆74Updated 6 months ago
- ETLy is an add-on dashboard service on top of Apache Airflow.☆69Updated last year
- Egeria's Guidance on Governance as well as large media files such as presentations and movies☆103Updated 2 years ago
- A custom ContentRepository implementation for NiFi to persist data to MinIO Object Storage☆34Updated 2 years ago
- Teiid is a data virtualization system that allows applications to use data from multiple, heterogenous data stores.☆303Updated 2 years ago
- Apache DataLab (incubating)☆153Updated last year
- Data Catalog for Databases and Data Warehouses☆32Updated last year
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- Mirror of Apache NiFi Flow Design System☆44Updated last year
- Marquez Web UI☆22Updated 4 years ago
- The metrics layer for your data. Join us at https://metriql.com/slack☆303Updated last year
- Schema Registry integration for Apache Spark☆40Updated 2 years ago
- Tool to automate data quality checks on data pipelines☆253Updated 2 years ago
- ☆18Updated last year
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆123Updated 3 years ago
- Client swagger for nifi with security☆38Updated 2 years ago
- Functional testing framework for Big Data pipelines.☆57Updated last year