【2026最新版】 大数据 数据分析 电商系统 实时数仓 离线数仓 数据湖 建设方案及实战代码,涉及组件 #flink #paimon #doris #seatunnel #dolphinscheduler #datart #dinky #hudi #iceberg。
☆1,075Oct 8, 2025Updated 5 months ago
Alternatives and similar repositories for data-warehouse-learning
Users that are interested in data-warehouse-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 该项目整合了多款优秀的开源产品,构建了一个功能全面的数据开发平台。平台提供了强大的数据集成、数据开发、数据查询、数据服务、数据质量管理、工作流调度和元数据管理功能。#dinky #dolphinscheduler #datavines #flinkcdc #openmeta…☆628Aug 5, 2025Updated 7 months ago
- 大数据组件学习代码☆65May 6, 2024Updated last year
- Know your data better!Datavines is Next-gen Data Observability Platform, support metadata manage and data quality.☆723Mar 6, 2026Updated 2 weeks ago
- Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.☆3,711Updated this week
- 🔥🔥 AllData可定义数据中台,以数据平台为底座,以数据中台为桥梁,以机器学习平台为工厂,以大模型应用为上游产品,提供全链路数字化解决方案。产品正式演示体验、社群咨询、商务采购:https://docs.qq.com/doc/DVHlkSEtvVXVCdEFo☆2,990Feb 26, 2026Updated 3 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The next generation of cloud-native big data management expert , Aims to help users rapidly build stable, efficient, and scalable cloud-n…☆1,311Jul 22, 2025Updated 8 months ago
- 大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。☆1,735Feb 12, 2026Updated last month
- 数据建设与大数据技术知识体系,包含hadoop、hive、spark、flink主流框架和系列框架,数据中台、数据湖、数据治理、数仓建设、数据化转型等☆444Aug 8, 2025Updated 7 months ago
- Support agile DataOps Based on Flink, DataX and Flink-CDC, Chunjun with Web-UI☆1,292Updated this week
- Make stream processing easier! Easy-to-use streaming application development framework and operation platform.☆4,303Mar 13, 2026Updated last week
- SeaTunnel is a multimodal, high-performance, distributed, massive data integration tool.☆9,190Updated this week
- 该仓库专注于让读者秒懂Flink组件,包含Flink实战代码和文档、200个Flink教程知识点,Flink Datastream、Flink Table、Flink Window、Flink State、Flink Checkpoint、Flink Metrics、Fli…☆763Jun 14, 2024Updated last year
- ☆467Sep 17, 2022Updated 3 years ago
- SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offlin…☆807Jan 22, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- LarkMidTable 是一站式开源的数据中台,实现中台的 基础建设,数据治理,数据开发,监控告警,数据服务,数据的可视化,实现高效赋能数据前台并提供数据服务的产品。☆2,033Aug 20, 2023Updated 2 years ago
- CloudEon uses Kubernetes to install and deploy open-source big data components, enabling the containerized operation of an open-source bi…☆491Oct 31, 2025Updated 4 months ago
- Flink CDC is a streaming data integration tool☆6,375Updated this week
- 这个平台旨在提供一个高效、便捷的数据处理和分析环境,适用于数据科学家、数据工程师以及任何对数据处理有需求的用户。☆55Aug 5, 2025Updated 7 months ago
- Open data platform based on Kubernetes. Scaleph supports SeaTunnel、Flink and Doris backended by SeaTunnel on Flink engine、Flink Kubernete…☆397Dec 17, 2025Updated 3 months ago
- 专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...☆10,432Aug 7, 2023Updated 2 years ago
- A data integration framework☆4,106Dec 2, 2025Updated 3 months ago
- 通用数据生成平台☆13Mar 11, 2025Updated last year
- Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch …☆3,219Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Apache Fluss is a streaming storage built for real-time analytics.☆1,826Updated this week
- flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Ta…☆15,058Mar 9, 2026Updated 2 weeks ago
- Doris表和字段血缘项目☆89Apr 30, 2024Updated last year
- Ultra-Lightweight AI-Powered Big Data Center | 至轻云-超轻量级智能化大数据中心/数据中台☆248Updated this week
- Apache Doris is an easy-to-use, high performance and unified analytics database.☆15,139Updated this week
- a dbt adapter for Apache Doris☆27Nov 17, 2023Updated 2 years ago
- 解析 SQL 字段数据血缘☆97Apr 17, 2025Updated 11 months ago
- 一个实时数仓项目,从0到1搭建实时数仓☆63May 27, 2021Updated 4 years ago
- SuperSonic is the next-generation AI+BI platform that unifies Chat BI (powered by LLM) and Headless BI (powered by semantic layer) paradi…☆4,758Mar 9, 2026Updated 2 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 基于Flink+Kafka的全链路数仓, 包括实时和离线☆42Jan 28, 2023Updated 3 years ago
- Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code☆14,193Updated this week
- 从数据仓库到用户画像,从数据建设到数据应用☆626Jan 26, 2022Updated 4 years ago
- 大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料☆3,148Jan 20, 2026Updated 2 months ago
- Apache Doris MCP Server☆268Mar 13, 2026Updated last week
- 基于Flink实现的商品实时推荐系统。flink统计商品热度,放入redis缓存,分析日志信息,将画像标签和实时记录放入Hbase。在用户发起推荐请求后,根据用户画像重排序热度榜,并结合协同过滤和标签两个推荐模块为新生成的榜单的每一个产品添加关联产品,最后返回新的用户列表。☆4,474Feb 4, 2024Updated 2 years ago
- 基于flink的实时流计算web平台☆1,866Dec 2, 2025Updated 3 months ago