cdapio / cdap-buildLinks
Repository for building CDAP and additional external projects
☆16Updated last week
Alternatives and similar repositories for cdap-build
Users that are interested in cdap-build are comparing it to the libraries listed below
Sorting:
- CDAP UI☆20Updated last week
- Cask Hydrator Plugins Repository☆68Updated last month
- Wrangler Transform: A DMD system for transforming Big Data☆106Updated 2 months ago
- CDAP Kubernetes Operator☆19Updated 2 months ago
- Playground site for creating/validating data contracts☆11Updated 6 months ago
- Database plugins☆13Updated last week
- A collection of Google Cloud Platform (GCP) plugins☆49Updated 3 weeks ago
- Multi Cloud Data Tokenization Solution By Using Dataflow and Cloud DLP☆95Updated last year
- Using the Parquet file format (with Avro) to process data with Apache Flink☆14Updated 10 years ago
- Fivetran data models for QuickBooks using dbt.☆33Updated last week
- The Data Integration Library project provides a library of generic components based on a multi-stage architecture for data ingress and eg…☆32Updated 8 months ago
- ☆67Updated last year
- Open source tools for Google Cloud Storage and Databases.☆63Updated last year
- Flink Controller implements a Kubernetes Custom Controller (aka Kubernetes Operator) for Apache Flink☆52Updated 2 weeks ago
- An Operator for scheduling and executing NiFi Flows as Jobs on Kubernetes☆53Updated 5 years ago
- Marquez Web UI☆21Updated 5 years ago
- Verify Hive SQL without running the sql exactly. Just check the syntax before run.☆24Updated 13 years ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆80Updated this week
- Hadoop utility jar for troubleshooting integration with cloud object stores☆37Updated 2 weeks ago
- Example Spark applications that run on Kubernetes and access GCP products, e.g., GCS, BigQuery, and Cloud PubSub☆37Updated 7 years ago
- The gateway component to make Spark on K8s much easier for Spark users.☆210Updated last month
- A tool for developing and testing ETL and ELT processes for automating the capture, delivery and processing of information in data wareho…☆59Updated 2 years ago
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆70Updated 5 months ago
- Apache iceberg Spark s3 examples☆21Updated last year
- Data abstraction, storage, discovery, and serving system☆35Updated last week
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆122Updated this week
- Historical metadata of your data warehouse is a treasure trove to discover not just insights about changing data patterns, but also quali…☆13Updated 4 years ago
- Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful …☆144Updated last year
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Updated 4 years ago
- A Github API client to extract events and actions, and load into a database☆28Updated 4 years ago