京东RTF实时数据湖,是一个从底层重新构建的系统,解决了数据的接入、解析及清洗等ETL 过程,同时解决了传统离线模式达不到的实时性和流式实时数据做不到的数据清洗、还原,是一套大数据领域改革性的实时数据方案。RTF可以直接查询最新状态的数据,并且无需去重,可以让数据分析人员即使不了解flink或spark等实时计算框架,也能够获取实时数据进行分析。
☆129Sep 29, 2023Updated 2 years ago
Alternatives and similar repositories for rtf-lake
Users that are interested in rtf-lake are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- flink-sql 在 flink 上运行 sql 和 构建数据流的平台 基于 apache flink 1.10.0☆113Jun 21, 2022Updated 3 years ago
- Spark、Flink等离线任务的调度以及实时任务的监控☆304Nov 13, 2025Updated 7 months ago
- Flink 案例开发数据清洗、数据报表☆58Sep 13, 2025Updated 9 months ago
- FlinkTutorial 专注大数据Flink流试处理技术。从基础入门、概念、原理、实战、性能调优、源码解析等内容,使用Java开发,同时含有Scala部分核心代码。欢迎关注我的博客及github。☆70Jun 21, 2022Updated 3 years ago
- Using Flink SQL to build ETL job☆206Sep 29, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- dw etl 工具 mysql 增量、全量抽取 to hive. 合并 hive 数据表, 等数据平台清洗工具☆10Dec 21, 2016Updated 9 years ago
- web-audio-api 可视化音乐播放器☆14May 13, 2020Updated 6 years ago
- LarkMidTable 是一站式开源的数据中台,实现中台的 基础建设,数据治理,数据开发,监控告警,数据服务,数据的可视化,实现高效赋能数据前台并提供数据服务的产品。☆2,052Aug 20, 2023Updated 2 years ago
- ☆12Mar 10, 2019Updated 7 years ago
- 基于Flink流处理的动态实时亿级全端用户画像系统☆486Dec 14, 2022Updated 3 years ago
- Platform for Flink☆280Jan 3, 2023Updated 3 years ago
- ☆16Nov 16, 2022Updated 3 years ago
- It is a kind of big data computing platform which is driven by the Flink SQL. In particular, it provides the SQL programming.☆20Jan 5, 2023Updated 3 years ago
- My MSc on Data Science final project. This is a library for Data Pre-processing Algorithms for Streaming in Flink (DPASF)☆18Jul 1, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Real-time ETL developed by Flink, data from MySQL to Greenplum. Use canal to parse the MySQL binlog, put it into kafka, use Flink to cons…☆80Mar 21, 2024Updated 2 years ago
- 基于 Flink 的 sqlSubmit 程序☆144May 12, 2026Updated last month
- 此项目主要应用于数据中台或数据平台的数据总线,支持直接实时监听MySQL、MongoDB、PostgreSQL、Oracle、SQL Server、Db2和Cassandra等数据库的数据变更。☆64Dec 5, 2023Updated 2 years ago
- Use SQL to query Elasticsearch☆18Oct 11, 2016Updated 9 years ago
- Detects shot boundaries from news with K-Means. Using Bhattacharya Coefficient for distance.☆10Jun 1, 2017Updated 9 years ago
- 数据清洗模板程序(spring batch)☆10Jul 18, 2016Updated 9 years ago
- Implement a complete data warehouse etl using spark SQL☆14Sep 8, 2022Updated 3 years ago
- 中华人民共和国行政区划:省级(省份直辖市自治区)、 地级(城市)、 县级(区县)、 乡级(乡镇街道)、 村级(村委会居委会) ,中国省市区镇村三级四级五级联动地址数据。☆39May 15, 2018Updated 8 years ago
- 云雀 是一款数据集成工具,实现异构数据源的整合,帮助企业构建数据仓库、数据湖 等应用架构☆18May 27, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Big data smart alarm by sql☆12May 11, 2021Updated 5 years ago
- 给flink开发的web系统。支持页面上定义udf,进行sql和jar任务的提交;支持source、sink、job的管理;可以管理openshift上的flink集群☆290Dec 16, 2022Updated 3 years ago
- 基于spring mvc+redis+logback+elk的日志demo☆12Feb 23, 2017Updated 9 years ago
- 从本地IDEA提交Flink/Spark任务到Yarn/k8s集群☆167Oct 18, 2021Updated 4 years ago
- 基于Vue + Web Audio API + Canvas 制作的可视化音乐播放器☆18Mar 4, 2023Updated 3 years ago
- 数据建设与大数据技术知识体系,包含hadoop、hive、spark、flink主流框架和系列框架,数据中台、数据湖、数据治理、数仓建设、数据化转型等☆455Aug 8, 2025Updated 10 months ago
- Lab project to showcase Flink's performance differences between using a SQL query and implementing the same logic via the DataStream API☆14Apr 15, 2020Updated 6 years ago
- Bireme is an incremental synchronization tool for the Greenplum / HashData data warehouse☆138Feb 23, 2022Updated 4 years ago
- DataLink是一个满足各种异构数据源之间的实时增量同步、离线全量同步,分布式、可扩展的数据交换平台。☆1,122Dec 6, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Apache Spark - A unified analytics engine for large-scale data processing☆16Jul 24, 2023Updated 2 years ago
- Streaming application development and management system, based on Linkis and DSS, planning to provide the workflow-like graphical drag-an…☆115Apr 25, 2025Updated last year
- Wormhole is a SPaaS (Stream Processing as a Service) Platform☆976Nov 16, 2022Updated 3 years ago
- Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, res…☆814Dec 11, 2024Updated last year
- Full Database Migration Tool based on Alibaba DataX 3.0☆103Oct 28, 2019Updated 6 years ago
- API网关应用,统一提供WEB服务,实现标准化请求、接口协议转换、多版本管理、登录鉴权、流控、超时控制、调用监控、服务治理、接口测试工具等功能,减少服务端同学的重复开发工作,完成API的统一管理。☆22Dec 2, 2021Updated 4 years ago
- A data integration framework☆4,105Dec 2, 2025Updated 6 months ago