cdapio/cdap

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cdapio/cdap)

cdapio / cdap

An open source framework for building data analytic applications.

☆789

Alternatives and similar repositories for cdap

Users that are interested in cdap are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cdapio / hydrator-plugins
View on GitHub
Cask Hydrator Plugins Repository
☆69Jul 13, 2026Updated last week
cdapio / cdap-ui
View on GitHub
CDAP UI
☆21Jul 13, 2026Updated last week
cdapio / cdap-apps
View on GitHub
CDAP Applications
☆45Jan 29, 2018Updated 8 years ago
cdapio / cdap-operator
View on GitHub
CDAP Kubernetes Operator
☆19Jul 14, 2026Updated last week
data-integrations / wrangler
View on GitHub
Wrangler Transform: A DMD system for transforming Big Data
☆108Jul 15, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Teradata / kylo
View on GitHub
Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies…
☆1,111Jan 12, 2023Updated 3 years ago
data-integrations / google-cloud
View on GitHub
A collection of Google Cloud Platform (GCP) plugins
☆49Jul 2, 2026Updated 2 weeks ago
cdapio / tephra
View on GitHub
Apache Tephra: Transactions for HBase.
☆159Sep 13, 2024Updated last year
cdapio / coopr
View on GitHub
A template-based cluster provisioning system
☆62Mar 4, 2023Updated 3 years ago
apache / gobblin
View on GitHub
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…
☆2,270Jun 24, 2026Updated 3 weeks ago
cdapio / cdap-build
View on GitHub
Repository for building CDAP and additional external projects
☆16Updated this week
data-integrations / database-plugins
View on GitHub
Database plugins
☆14Jun 23, 2026Updated 3 weeks ago
cdap-solutions / dre
View on GitHub
Yet-Another-Rules-Engine -- A easy-to-understand Business Readable DSL for defining production rules.
☆14Mar 24, 2021Updated 5 years ago
Netflix / metacat
View on GitHub
☆1,687Updated this week
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
apache / beam
View on GitHub
Apache Beam is a unified programming model for Batch and Streaming data processing.
☆8,635Updated this week
MarquezProject / marquez
View on GitHub
Collect, aggregate, and visualize a data ecosystem's metadata
☆2,244Updated this week
datahub-project / datahub
View on GitHub
The Context Platform for your Data and AI Stack
☆12,315Updated this week
delta-io / delta
View on GitHub
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…
☆8,924Updated this week
apache / pinot
View on GitHub
Apache Pinot - A realtime distributed OLAP datastore
☆6,117Updated this week
spark-jobserver / spark-jobserver
View on GitHub
REST job server for Apache Spark
☆2,837Mar 3, 2026Updated 4 months ago
YotpoLtd / metorikku
View on GitHub
A simplified, lightweight ETL Framework based on Apache Spark
☆588Jan 24, 2024Updated 2 years ago
dremio / dremio-oss
View on GitHub
Dremio - the missing link in modern data
☆1,488Sep 26, 2025Updated 9 months ago
apache / twill
View on GitHub
Mirror of Apache Twill
☆69Mar 16, 2020Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
apache / nifi
View on GitHub
Apache NiFi
☆6,168Updated this week
harbby / sylph
View on GitHub
Stream computing platform for bigdata
☆406Apr 24, 2024Updated 2 years ago
Alluxio / alluxio
View on GitHub
Alluxio, data orchestration for analytics and machine learning in the cloud
☆7,214Apr 29, 2025Updated last year
pabloem / awesome-beam
View on GitHub
A curated list of awesome resources for Apache Beam
☆144Nov 11, 2022Updated 3 years ago
apache / griffin
View on GitHub
Mirror of Apache griffin
☆1,169Aug 3, 2025Updated 11 months ago
hbutani / spark-druid-olap
View on GitHub
Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…
☆281Aug 3, 2018Updated 7 years ago
uber-archive / AthenaX
View on GitHub
SQL-based streaming analytics platform at scale
☆1,224Jun 21, 2020Updated 6 years ago
OpenLineage / OpenLineage
View on GitHub
An Open Standard for lineage metadata collection
☆2,555Updated this week
linkedin / dr-elephant
View on GitHub
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
☆1,370Aug 22, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
apache / drill
View on GitHub
Apache Drill is a distributed MPP query layer for self describing data
☆2,022Jul 15, 2026Updated last week
apache / kyuubi
View on GitHub
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
☆2,353Updated this week
AbsaOSS / spline
View on GitHub
Data Lineage Tracking And Visualization Solution
☆662Updated this week
cdapio / cdap-hbase-increments
View on GitHub
Efficient read-less increments for HBase
☆10May 28, 2015Updated 11 years ago
hortonworks / streamline
View on GitHub
StreamLine - Streaming Analytics
☆167Aug 27, 2023Updated 2 years ago
Netflix / genie
View on GitHub
Distributed Big Data Orchestration Service
☆1,763Jul 13, 2026Updated last week
apache / carbondata
View on GitHub
High performance data store solution
☆1,448Jul 4, 2026Updated 2 weeks ago