DataX 是阿里巴巴集团内被广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、HDFS、Hive、OceanBase、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。
☆139Mar 24, 2022Updated 3 years ago
Alternatives and similar repositories for DataX
Users that are interested in DataX are comparing it to the libraries listed below
Sorting:
- Bireme is an incremental synchronization tool for the Greenplum / HashData data warehouse☆138Feb 23, 2022Updated 4 years ago
- 围绕 PostgreSQL Greenplum ,实现易用的数据的互迁功能项目☆551Nov 16, 2021Updated 4 years ago
- TPC-H like benchmark for PostgreSQL☆16Feb 14, 2016Updated 10 years ago
- 主要介绍作者使用过的Greenplum技术,欢迎大家交流☆232Feb 5, 2026Updated last month
- Sync MySQL data into elasticsearch or postgresql☆45Dec 3, 2021Updated 4 years ago
- Help you migrate from Greenplum(GPDB) to Cloudberry(CBDB)☆23Jan 27, 2026Updated last month
- datax数据同步elasticsearch的reader和writer插件,支持一对多的扁平数据转换成es的嵌套对象,也支持嵌套对象的读取和ognl表达式过滤,理论上可以无限嵌套。☆91Jul 12, 2025Updated 8 months ago
- Some simple tools for greenplum db, tools developed include UDF, shell, perl etc.☆16Feb 12, 2026Updated last month
- GP专用客户端☆19Aug 29, 2024Updated last year
- DataX是阿里云DataWorks数据集成的开源版本。☆17,143Jul 1, 2025Updated 8 months ago
- 基于DataX的数据同步任务调度工具,支持自定义定时任务,支持crontab表达式,支持自定义添加DataX数据同步任务☆38Feb 15, 2019Updated 7 years ago
- oracle数据同步到Greenplum的shell脚本☆11May 13, 2019Updated 6 years ago
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Jul 9, 2025Updated 8 months ago
- DataX集成可视化页面,选择数据源即可一键生成数据同步任务,支持RDBMS、Hive、HBase、ClickHouse、MongoDB等数据源,批量创建RDBMS数据同步任务,集成开源调度系统,支持分布式、增量同步数据、实时查看运行日志、监控执行器资源、KILL运行进程、…☆5,988Jun 2, 2024Updated last year
- A library based on Hudi for Spark.☆10Nov 30, 2021Updated 4 years ago
- RoaringBitmap extension for greenplum-db☆114Apr 30, 2021Updated 4 years ago
- 基于ansible的Greenplum集群多主机节点一键安装工具//dbswitch.gitee.io/docs-site/☆15Jul 4, 2021Updated 4 years ago
- Consume debezium events to databend☆21Apr 7, 2024Updated last year
- DataX 是阿里巴巴集团内被广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、HDFS、Hive、OceanBase、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。☆23Nov 5, 2021Updated 4 years ago
- Greenplum(v5,v6) exporter for Prometheus☆60May 13, 2024Updated last year
- Image to run Greenplum☆27Jan 30, 2024Updated 2 years ago
- 基于kettle8.0的作业/转换管理框架☆34Mar 14, 2018Updated 8 years ago
- 列式数据库infobright安装、使用及备份☆16Feb 7, 2017Updated 9 years ago
- Kettle plugin that provides support for interacting within many "big data" projects including Hadoop, Hive, HBase, Cassandra, MongoDB, an…☆241Updated this week
- 数仓实时项目☆10May 9, 2019Updated 6 years ago
- Example of using greenplum-spark connector☆20Feb 5, 2019Updated 7 years ago
- A data integration framework☆4,109Dec 2, 2025Updated 3 months ago
- Greenplum System Catalog Reference☆11Oct 28, 2021Updated 4 years ago
- Undo storage implementation☆16Dec 8, 2020Updated 5 years ago
- Full Database Migration Tool based on Alibaba DataX 3.0☆101Oct 28, 2019Updated 6 years ago
- 一个基于Spring Boot & MyBatis的脚手架,代码生成易配置(没有集成mybatis generator,纯freemarker模板),快速开发中小型项目☆29Jun 17, 2022Updated 3 years ago
- [译] zeppelin 中文文档☆14Jul 21, 2023Updated 2 years ago
- Citus shard migration tool☆27May 30, 2021Updated 4 years ago
- Zeus is an open-source, analytical engine for big data hold in data lake; it was designed to provide OLAP (Online Analytical Processing) …☆25Nov 2, 2021Updated 4 years ago
- DBus☆1,212Dec 6, 2022Updated 3 years ago
- 将kettle集成值web应用中,不再需打开kettle窗口运行,采用springmvc+beetlsql框架实现,并通过quartz自动任务进行数据抽取。配置简单方便。(之前需要kettle打开其运行环境,并配置数据库连接的相关信息)☆61Apr 27, 2018Updated 7 years ago
- 基于flinkx的分布式数据中台产品☆10Sep 11, 2020Updated 5 years ago
- Command-line tool for interacting with pgdash.io☆35Jan 18, 2026Updated 2 months ago
- datax web。datax中的web配置界面没有集成在一起开源出来,此为web端配置项目。☆100Mar 19, 2019Updated 7 years ago