Real-time ETL developed by Flink, data from MySQL to Greenplum. Use canal to parse the MySQL binlog, put it into kafka, use Flink to consume kafka and assemble the data into Greenplum, and more data sources and target sources will be added in the future.
☆80Mar 21, 2024Updated last year
Alternatives and similar repositories for mriya
Users that are interested in mriya are comparing it to the libraries listed below
Sorting:
- 蜜蜂牧场是一个数据采集清洗工具,也是一个ETL工具,同时也是一套脚本语言。☆14Jul 1, 2018Updated 7 years ago
- 此项目主要应用于数据中台或数据平台的数据总线,支持直接实时监听MySQL、MongoDB、PostgreSQL、Oracle、SQL Server、Db2和Cassandra等数据库的数据变更。☆63Dec 5, 2023Updated 2 years ago
- Flink 实时ETL案例☆46Sep 8, 2022Updated 3 years ago
- Stream computing platform for bigdata☆408Apr 24, 2024Updated last year
- flink-sql 在 flink 上运行 sql 和 构建数据流的平台 基于 apache flink 1.10.0☆113Jun 21, 2022Updated 3 years ago
- Using Flink SQL to build ETL job☆205Sep 29, 2023Updated 2 years ago
- 【bigdata】spirngboot+spark 脚手架+相关实例☆22Jun 21, 2022Updated 3 years ago
- 基于ansible的Greenplum集群多主机节点一键安装工具//dbswitch.gitee.io/docs-site/☆15Jul 4, 2021Updated 4 years ago
- ☆17Oct 18, 2019Updated 6 years ago
- 各种安全相关思维导图整理收集☆11Sep 7, 2015Updated 10 years ago
- A distributed data factory, providing data access, etl, scheduling. Easily manage tasks such as hive, spark, clickhouse, flink, shell, py…☆33May 21, 2022Updated 3 years ago
- 基于flink的营销系统☆14Jun 9, 2022Updated 3 years ago
- 执行Flink SQL 文件的客户端☆24Oct 19, 2021Updated 4 years ago
- Image to run Greenplum☆27Jan 30, 2024Updated 2 years ago
- 主要介绍作者使用过的Greenplum技术,欢迎大家交流☆232Feb 5, 2026Updated last month
- 基于flink1.9.1,flink-sql-client模块SDK单独实现,支持Yarn集群的远程SQL任务发布,可以支撑flink sql任务的远程化执行☆47Jan 26, 2026Updated last month
- A data integration framework☆4,109Dec 2, 2025Updated 3 months ago
- tensorflow serving and deep model online https://dataxujing.github.io/tensorflow-serving-Wechat/?transition=convex#/☆19Nov 23, 2018Updated 7 years ago
- Bireme is an incremental synchronization tool for the Greenplum / HashData data warehouse☆138Feb 23, 2022Updated 4 years ago
- Flink 菜鸟公众号代码地址☆64Dec 2, 2024Updated last year
- MyDataHarbor是一个致力于解决任意数据源到任意数据源的分布式、高扩展性、高性能、事务级的数据同步中间件。帮助用户可靠、快速、稳定的对海量数据进行准实时增量同步或者定时全量同步,主要定位是为实时交易系统服务,亦可用于大数据的数据同步(ETL领域)。☆87Aug 13, 2025Updated 7 months ago
- Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, res…☆813Dec 11, 2024Updated last year
- 连接vertica,oracle,mysql,redis,mongodb的数据访问平台☆10Aug 13, 2019Updated 6 years ago
- Mirror of App Suite middleware code☆14Nov 14, 2022Updated 3 years ago
- Example of using greenplum-spark connector☆20Feb 5, 2019Updated 7 years ago
- A demo repository for "streaming etl" with Apache Flink☆44Jun 8, 2016Updated 9 years ago
- Spark、Flink等离线任务的调度以及实时任务的监控☆306Nov 13, 2025Updated 4 months ago
- 给flink开发的web系统。支持页面上定义udf,进行sql和jar任务的提交;支持source、sink、job的管理;可以管理openshift上的flink集群☆289Dec 16, 2022Updated 3 years ago
- SQL语法词法分析 SQL表级血缘 SQL字段级别血缘 SQL函数血缘 SQL编译器☆17Nov 1, 2022Updated 3 years ago
- mysql数据实时增量导入hive☆87Jun 15, 2017Updated 8 years ago
- Instructions for getting started with Ververica Platform on minikube.☆95Jul 9, 2025Updated 8 months ago
- A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources☆2,051Oct 25, 2022Updated 3 years ago
- 分享一些在工作中的大数据实战案例,包括flink、kafka、hadoop、presto等等。欢迎大家关注我的公众号【Hello大数据】,一起成长。☆261Feb 21, 2024Updated 2 years ago
- 实现Local、FTP、HDFS文件系统的统一操作。☆13May 12, 2016Updated 9 years ago
- 2019 年开源年度报告☆11Jan 7, 2020Updated 6 years ago
- A database schema conversion tool☆28Jun 29, 2020Updated 5 years ago
- AMS实时推荐系统☆17Nov 4, 2022Updated 3 years ago
- 汇总Apache Hudi中的一些Demo,便于快速上手Apache Hudi(Apache Hudi Demos to help beginners know about Hudi)☆74Sep 13, 2020Updated 5 years ago
- 北极星数据管理中台☆14Oct 26, 2022Updated 3 years ago