cdapio / cdap-buildLinks
Repository for building CDAP and additional external projects
☆16Updated this week
Alternatives and similar repositories for cdap-build
Users that are interested in cdap-build are comparing it to the libraries listed below
Sorting:
- Cask Hydrator Plugins Repository☆69Updated last week
- CDAP UI☆20Updated last month
- Wrangler Transform: A DMD system for transforming Big Data☆106Updated last month
- Database plugins☆13Updated this week
- Pipeline library for StreamSets Data Collector and Transformer☆33Updated 2 years ago
- A collection of Google Cloud Platform (GCP) plugins☆49Updated last week
- Playground site for creating/validating data contracts☆10Updated 2 months ago
- A command line interface for your tiny smart workers.☆15Updated 3 months ago
- Using the Parquet file format (with Avro) to process data with Apache Flink☆14Updated 10 years ago
- An Operator for scheduling and executing NiFi Flows as Jobs on Kubernetes☆53Updated 5 years ago
- A bridge to Apache Atlas for provenance metadata created in course of using Apache NiFi☆15Updated 2 years ago
- A collection of Restate examples for AI use cases: agents, A2A, MCP, ...☆48Updated last week
- Utilities to showcase OpenMetadata☆30Updated 3 months ago
- Dione - a Spark and HDFS indexing library☆52Updated last year
- Data abstraction, storage, discovery, and serving system☆33Updated last week
- Demonstration of a Hive Input Format for Iceberg☆26Updated 4 years ago
- Scalable CDC Pattern Implemented using PySpark☆18Updated this week
- Get started with Apache Beam and Flink☆43Updated 8 years ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆76Updated last week
- Hands-on workshop with Iceberg, Redpanda, Debezium and Kafka-Connect☆13Updated last year
- A Github API client to extract events and actions, and load into a database☆28Updated 3 years ago
- Lenses.io JDBC driver for Apache Kafka☆21Updated 4 years ago
- CDAP Kubernetes Operator☆19Updated 2 months ago
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆122Updated last week
- Core & Community developed monitoring integrations for Sematext monitoring agent☆13Updated last year
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated 2 weeks ago
- A repository to store recipes, custom sources, transformations and other things to make your DataHub experience magical☆12Updated 3 years ago
- Explore Apache Kafka data pipelines in Kubernetes.☆46Updated 3 months ago
- Jupyter Integration for Flink SQL via Ververica Platform☆43Updated 2 years ago
- Flink Controller implements a Kubernetes Custom Controller (aka Kubernetes Operator) for Apache Flink☆53Updated 9 months ago