StarRocks / DataXLinks
☆17Updated 3 years ago
Alternatives and similar repositories for DataX
Users that are interested in DataX are comparing it to the libraries listed below
Sorting:
- Apache StreamPark quickstart☆74Updated 7 months ago
- Doris表和字段血缘项目☆82Updated last year
- datax数据同步elasticsearch的reader和writer插件,支持一对多的扁平数据转换成es的嵌套对象,也支持嵌套对象的读取和ognl表达式过滤,理论上可以无限嵌套。☆88Updated last month
- Apache DolphinScheduler website☆139Updated this week
- flink sql connector clickhouse zeppelin☆79Updated 3 years ago
- DataX 是阿里巴巴集团内被广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、HDFS、Hive、OceanBase、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。☆138Updated 3 years ago
- ☆105Updated 4 months ago
- DorisDB SQL解析器Java实现;Clickhouse SQL解析器Java实现☆97Updated 3 years ago
- 数据血缘,Hive/Sqoop/HBase/Spark等,发送到kafka后,解析处理使用neo4j生成血缘☆82Updated 4 years ago
- 记录HBase版本API的变迁Demo☆33Updated 6 years ago
- Using Flink SQL to build ETL job☆205Updated last year
- flink 集成CDH5的自定义paracels☆69Updated 3 years ago
- flink-parcel compiler tool☆48Updated 5 years ago
- Cluster manager for Apache Doris☆185Updated last year
- 基于 Flink 的 sqlSubmit 程序☆147Updated last year
- 从本地IDEA提交Flink/Spark任务到Yarn/k8s集群☆165Updated 3 years ago
- 分享一些在工作中的大数据实战案例,包括flink、kafka、hadoop、presto等等。欢迎大家关注我的公众号【大数据技术与应用实战】,一起成长。☆263Updated last year
- FlinkSQL数据脱敏和行级权限解决方案及源码,支持面向用户级别的数据脱敏和行级数据访问控制,即特定用户只能访问到脱敏后的数据或授权过的行。此方案是实时领域Flink的解决方案,类似于离线数仓Hive Ranger中的Row-level Filter和Column Mas…☆142Updated last year
- A fast MPP database for all modern analytics on big data. Powered by Apache Doris(Incubating)☆54Updated 3 years ago
- Asynchronous flink connector based on the Lettuce, supporting sql join and sink, query caching and debugging.☆250Updated 4 months ago
- ☆218Updated last week
- DataSphereStudio documents.☆122Updated 8 months ago
- Demos for Flink connectors on Ververica Platform (VVP)☆42Updated 2 months ago
- 如果你在从事大数据BI的工作,想对比一下MySQL、GreenPlum、Elasticsearch、Hive、Spark SQL、Presto、Impala、Drill、HAWQ、Druid、Pinot、Kylin、ClickHouse、Kudu等不同实现方案之间的表现,…☆285Updated 7 years ago
- hudi 中文文档☆37Updated 5 years ago
- Bireme is an incremental synchronization tool for the Greenplum / HashData data warehouse☆138Updated 3 years ago
- 通过语法树解析获取字段级血缘数据☆61Updated 2 years ago
- DataX分布式集群化、自定义DataX插件、源码修改任务监控以及脏数据存表Hook☆26Updated 4 years ago
- Real-time ETL developed by Flink, data from MySQL to Greenplum. Use canal to parse the MySQL binlog, put it into kafka, use Flink to cons…☆80Updated last year
- Spark、Flink等离线任务的调度以及实时任务的监控☆304Updated last year