京东RTF实时数据湖,是一个从底层重新构建的系统,解决了数据的接入、解析及清洗等ETL 过程,同时解决了传统离线模式达不到的实时性和流式实时数据做不到的数据清洗、还原,是一套大数据领域改革性的实时数据方案。RTF可以直接查询最新状态的数据,并且无需去重,可以让数据分析人员即使不了解flink或spark等实时计算框架,也能够获取实时数据进行分析。
☆129Sep 29, 2023Updated 2 years ago
Alternatives and similar repositories for rtf-lake
Users that are interested in rtf-lake are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- flink-sql 在 flink 上运行 sql 和 构建数据流的平台 基于 apache flink 1.10.0☆113Jun 21, 2022Updated 3 years ago
- Spark、Flink等离线任务的调度以及实时任务的监控☆305Nov 13, 2025Updated 5 months ago
- Flink 案例开发数据清洗、数据报表☆58Sep 13, 2025Updated 7 months ago
- FlinkTutorial 专注大数据Flink流试处理技术。从基础入门、概念、原理、实战、性能调优、源码解析等内容,使用Java开发,同时含有Scala部分核心代码。欢迎关注我的博客及github。☆70Jun 21, 2022Updated 3 years ago
- Using Flink SQL to build ETL job☆206Sep 29, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- dw etl 工具 mysql 增量、全量抽取 to hive. 合并 hive 数据表, 等数据平台清洗工具☆10Dec 21, 2016Updated 9 years ago
- web-audio-api 可视化音乐播放器☆14May 13, 2020Updated 5 years ago
- ☆12Mar 10, 2019Updated 7 years ago
- 基于Flink流处理的动态实时亿级全端用户画像系统☆487Dec 14, 2022Updated 3 years ago
- Platform for Flink☆282Jan 3, 2023Updated 3 years ago
- 基于canal/kafka conenct的mysql/oracle数据实时同步、flink rest api、flink sql以及udf☆50Sep 8, 2022Updated 3 years ago
- ☆16Nov 16, 2022Updated 3 years ago
- It is a kind of big data computing platform which is driven by the Flink SQL. In particular, it provides the SQL programming.☆21Jan 5, 2023Updated 3 years ago
- 大数据采集,抽取平台,zdh_web是zdh系列服务的可视化管理平台,包含数据采集,调度,权限,审批流,私域营销等模块☆529Apr 27, 2026Updated last week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Real-time ETL developed by Flink, data from MySQL to Greenplum. Use canal to parse the MySQL binlog, put it into kafka, use Flink to cons…☆80Mar 21, 2024Updated 2 years ago
- 此项目主要应用于数据中 台或数据平台的数据总线,支持直接实时监听MySQL、MongoDB、PostgreSQL、Oracle、SQL Server、Db2和Cassandra等数据库的数据变更。☆64Dec 5, 2023Updated 2 years ago
- Detects shot boundaries from news with K-Means. Using Bhattacharya Coefficient for distance.☆10Jun 1, 2017Updated 8 years ago
- 数据清洗模板程序(spring batch)☆10Jul 18, 2016Updated 9 years ago
- Implement a complete data warehouse etl using spark SQL☆14Sep 8, 2022Updated 3 years ago
- Big data smart alarm by sql☆12May 11, 2021Updated 4 years ago
- 给flink开发的web系统。支持页面上定义udf,进行sql和jar任务的提交;支持source、sink、job的管理;可以管理openshift上的flink集群☆290Dec 16, 2022Updated 3 years ago
- 基于spring mvc+redis+logback+elk的日志demo☆12Feb 23, 2017Updated 9 years ago
- 从本地IDEA提交Flink/Spark任务到Yarn/k8s集群☆167Oct 18, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 基于Vue + Web Audio API + Canvas 制作的可视化音乐播放器☆18Mar 4, 2023Updated 3 years ago
- 数据建设与大数据技术知识体系,包含hadoop、hive、spark、flink主流框架和系列框架,数据中台、数据湖、数据治理、数仓建设、数据化转型等☆451Aug 8, 2025Updated 9 months ago
- This repo is deprecated, Please refer to https://github.com/glink-incubator/glink☆17Sep 29, 2023Updated 2 years ago
- Bireme is an incremental synchronization tool for the Greenplum / HashData data warehouse☆139Feb 23, 2022Updated 4 years ago
- Lab project to showcase Flink's performance differences between using a SQL query and implementing the same logic via the DataStream API☆14Apr 15, 2020Updated 6 years ago
- DataLink是一个满足各种异构数据源之间的实时增量同步、离线全量同步,分布式、可扩展的数据交换平台。☆1,120Dec 6, 2022Updated 3 years ago
- Apache Spark - A unified analytics engine for large-scale data processing☆16Jul 24, 2023Updated 2 years ago
- 1.Spark离线批处理,用户实时点击统计;2.SparkSQL日志内容分析;3.受众电影分析 =>(Kafka + SparkStreaming + Redis)和(Kafka + SparkStreaming + Mysql)☆29Jun 21, 2022Updated 3 years ago
- Streaming application development and management system, based on Linkis and DSS, planning to provide the workflow-like graphical drag-an…☆115Apr 25, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Wormhole is a SPaaS (Stream Processing as a Service) Platform☆976Nov 16, 2022Updated 3 years ago
- Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, res…☆813Dec 11, 2024Updated last year
- Full Database Migration Tool based on Alibaba DataX 3.0☆102Oct 28, 2019Updated 6 years ago
- 云开发 支持 Discuz! Q 一键部署与二次开发,基于云开发 CloudBase Framework 开发部署☆20Jun 30, 2021Updated 4 years ago
- A data integration framework☆4,108Dec 2, 2025Updated 5 months ago
- DBus☆1,214Dec 6, 2022Updated 3 years ago
- ☆17Dec 7, 2022Updated 3 years ago