cdapio / cdap-buildLinks
Repository for building CDAP and additional external projects
☆16Updated 2 weeks ago
Alternatives and similar repositories for cdap-build
Users that are interested in cdap-build are comparing it to the libraries listed below
Sorting:
- Cask Hydrator Plugins Repository☆68Updated last month
- CDAP UI☆20Updated this week
- Wrangler Transform: A DMD system for transforming Big Data☆106Updated 2 months ago
- CDAP Kubernetes Operator☆19Updated 3 months ago
- Database plugins☆13Updated last week
- A bridge to Apache Atlas for provenance metadata created in course of using Apache NiFi☆15Updated 2 years ago
- Pipeline library for StreamSets Data Collector and Transformer☆33Updated 2 years ago
- An Operator for scheduling and executing NiFi Flows as Jobs on Kubernetes☆53Updated 5 years ago
- Flink Controller implements a Kubernetes Custom Controller (aka Kubernetes Operator) for Apache Flink☆53Updated last week
- A command line interface for your tiny smart workers.☆16Updated 4 months ago
- A collection of Google Cloud Platform (GCP) plugins☆49Updated this week
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆77Updated this week
- Utilities to showcase OpenMetadata☆31Updated this week
- Using the Parquet file format (with Avro) to process data with Apache Flink☆14Updated 10 years ago
- Kubeflow example of machine learning/model serving☆37Updated 5 years ago
- Foodmart data set in hsqldb format☆27Updated 2 weeks ago
- Scalable CDC Pattern Implemented using PySpark☆18Updated 3 weeks ago
- Playground site for creating/validating data contracts☆10Updated 2 months ago
- Lab project to showcase Flink's performance differences between using a SQL query and implementing the same logic via the DataStream API☆14Updated 5 years ago
- Jupyter Integration for Flink SQL via Ververica Platform☆43Updated 2 years ago
- Get started with Apache Beam and Flink☆43Updated 9 years ago
- spark-drools tutorials☆16Updated last year
- Cloud-based SQL engine using SPARK where data is accessible as JDBC/ODBC data source via Spark ThriftServer.☆31Updated 8 years ago
- Explore Apache Kafka data pipelines in Kubernetes.☆46Updated 4 months ago
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Updated 4 years ago
- DataQuality for BigData☆144Updated last year
- The Internals of Spark on Kubernetes☆72Updated 3 years ago
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆70Updated 2 months ago
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆122Updated this week
- PostgreSQL wire-protocol proxy for Cloud Spanner☆74Updated this week