A prototype project of big data platform, the source codes of the book Big Data Platform Architecture and Prototype
☆198Aug 12, 2020Updated 5 years ago
Alternatives and similar repositories for bdp
Users that are interested in bdp are comparing it to the libraries listed below
Sorting:
- A serverless datalake project and framework based on AWS S3,Glue,Athena,MWAA and QuickSight. With a series of best practices, it guides y…☆16Nov 22, 2022Updated 3 years ago
- Exchangis is a lightweight,highly extensible data exchange platform that supports data transmission between structured and unstructured h…☆458Oct 28, 2025Updated 4 months ago
- Moonbox is a DVtaaS (Data Virtualization as a Service) Platform☆506Apr 14, 2023Updated 2 years ago
- 🔥🔥 AllData可定义数据中台,以数据平台为底座,以数据中台为桥梁,以机器学习平台为工厂,以大模型应用为上游产品,提供全链路数字化解决方案。产品正式演示体验、社群咨询、商务采购:https://docs.qq.com/doc/DVHlkSEtvVXVCdEFo☆2,980Feb 26, 2026Updated last week
- Wormhole is a SPaaS (Stream Processing as a Service) Platform☆977Nov 16, 2022Updated 3 years ago
- Unified SQL Analytics Engine Based on SparkSQL☆211Dec 5, 2022Updated 3 years ago
- A chatbot which is designed for open source community, able to answer open source related questions and guide you to do OSS.☆13Apr 2, 2023Updated 2 years ago
- 数据中后台高阶组件☆12Feb 10, 2026Updated 3 weeks ago
- ☆306Oct 5, 2022Updated 3 years ago
- 整合报表工具☆11Oct 20, 2017Updated 8 years ago
- 项目中保留了向开源社区提交过的patch☆16Oct 22, 2017Updated 8 years ago
- Showing the relationship between ImageNet ID and labels and pytorch pre-trained model output ID and labels☆10Oct 11, 2020Updated 5 years ago
- customer visualization for splunk using echarts☆15May 11, 2017Updated 8 years ago
- Delta Lake Examples☆11Apr 24, 2020Updated 5 years ago
- DBus☆1,214Dec 6, 2022Updated 3 years ago
- Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, res…☆814Dec 11, 2024Updated last year
- SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.☆136Mar 6, 2023Updated 2 years ago
- ☆62Oct 22, 2025Updated 4 months ago
- ☆15Jun 7, 2020Updated 5 years ago
- EasyScheduler中文在线文档, 本文档属于1.2.0以前的历史文档, 最新文档在:https://dolphinscheduler.apache.org☆15Jul 6, 2020Updated 5 years ago
- Custom datasource about spark structure streaming☆12Jan 29, 2019Updated 7 years ago
- 使用freemarker 编辑word模板,填充数据后,导出。☆13Jan 27, 2015Updated 11 years ago
- Verify Hive SQL without running the sql exactly. Just check the syntax before run.☆24Oct 19, 2012Updated 13 years ago
- ☆19Jun 16, 2021Updated 4 years ago
- 瑞金医院MMC人工智能辅助构建知识图谱大赛TOP40解决方案☆19Jan 17, 2019Updated 7 years ago
- 大数据【企业级360°全方位用户画像】标签开发部分源码☆19Dec 18, 2020Updated 5 years ago
- 数据治理、数据标准相关的 web 工具☆39Apr 22, 2022Updated 3 years ago
- A data integration framework☆4,110Dec 2, 2025Updated 3 months ago
- 该系统为淘宝店主做海淘生意的进货管理、商品库存而设计☆20Mar 19, 2022Updated 3 years ago
- 蓝泰源大数据基础平台☆17Mar 7, 2018Updated 7 years ago
- 基于 antlr4 的多种数据库SQL解析器,获取SQL中元数据,可用于数据平台产品中的多个场景:ddl语 句提取元数据、sql 权限校验、表级血缘、sql语法校验等场景。支持spark、flink、gauss、starrocks、Oracle、MYSQL、Postgresq…☆403Feb 24, 2026Updated last week
- Open data platform based on Kubernetes. Scaleph supports SeaTunnel、Flink and Doris backended by SeaTunnel on Flink engine、Flink Kubernete…☆399Dec 17, 2025Updated 2 months ago
- 大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。☆1,728Feb 12, 2026Updated 3 weeks ago
- hera 分布式任务调度系统 大数据任务调度系统 任务调度 (数据部门专用)☆373Aug 14, 2023Updated 2 years ago
- trafficLab即竞价大数据监控平台,采用前后端分离的开发框架,包括trafficLab-server后端服务工程,trafficLab-vue前端展示工程,trafficLab-js采集组件。实现的一套竞价词监控分析工具,包括加粉配置,粉丝来源分析,粉丝词汇分析,热词…☆18Dec 16, 2023Updated 2 years ago
- Dig Spark's source code.☆17Feb 1, 2024Updated 2 years ago
- springboot整合mina框架☆17Jun 17, 2022Updated 3 years ago
- DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitizati…☆3,253Nov 4, 2025Updated 4 months ago
- lite-tracer轻量级链路追踪系统,google-dapper个人实现。仅供学习研究☆38Sep 7, 2018Updated 7 years ago