duhanmin / datax-on-yarn
实现yarn客户端,datax-on-yarn可以让datax在yarn master上运行
☆17Updated last year
Alternatives and similar repositories for datax-on-yarn:
Users that are interested in datax-on-yarn are comparing it to the libraries listed below
- 为DataX(https://github.com/alibaba/DataX) 提供远程多语言调用(ThriftServer,HttpServer) 分布式运行(DataX on YARN) 功能☆144Updated 2 years ago
- ☆38Updated last year
- kudu可视化工具☆38Updated 5 years ago
- Learning Flink : Flink CEP,Flink Core,Flink SQL☆71Updated 3 years ago
- 通过语法树解析获取字段级血缘数据☆61Updated 2 years ago
- Real-time ETL developed by Flink, data from MySQL to Greenplum. Use canal to parse the MySQL binlog, put it into kafka, use Flink to cons…☆79Updated last year
- A distributed data factory, providing data access, etl, scheduling. Easily manage tasks such as hive, spark, clickhouse, flink, shell, py…☆32Updated 2 years ago
- Flink Sql 教程☆34Updated 4 months ago
- 基于 Flink 的 sqlSubmit 程序☆145Updated last year
- 基于flink1.9.1,flink-sql-client模块SDK单独实现,支持Yarn集群的远程SQL任务发布,可以支撑flink sql任务的远程化执行☆48Updated 2 years ago
- Atlas官方文档中文版☆70Updated 5 years ago
- 因现有的datax、sqoop满足不了需求,使用spark封装了一个数据同步工具。☆9Updated 6 years ago
- 数据血缘,Hive/Sqoop/HBase/Spark等,发送到kafka后,解析处理使用neo4j生成血缘☆82Updated 3 years ago
- Flink 案例代码☆43Updated 2 years ago
- 此项目主要应用于数据中台或数据平台的数据总线,支持直接实时监听MySQL、MongoDB、PostgreSQL、Oracle、SQL Server、Db2和Cassandra等数据库的数据变更。☆62Updated last year
- 基于canal/kafka conenct的mysql/oracle数据实时同步、flink rest api、flink sql以及udf☆50Updated 2 years ago
- Hive-JDBC-Proxy是一个高性能的HiveServer2和Spark ThriftServer的代理服务,具备负载均衡、基于规则转发Hive JDBC Client的请求给到HiveServer2和Spark ThriftServer的能力。☆32Updated 3 years ago
- ☆45Updated 5 years ago
- Apache Hudi Demo☆21Updated this week
- flinksql-platform☆19Updated 4 years ago
- 基于DataX的通用数据同步微服务,一个Restful接口搞定所有通用数据同步☆53Updated 2 years ago
- DataX分布式集群化、自定义DataX插件、源码修改任务监控以及脏数据存表Hook☆25Updated 4 years ago
- 基于antlr4 解析器,支持spark sql, tidb sql, flink sql, Spark/flink jar 运行命令解析器☆31Updated 2 years ago
- 使用Hive读写solr☆31Updated 2 years ago
- 易观开源大数据互联网百亿级记录互传Backquarter项目☆19Updated 2 years ago
- 基于DataX的数据同步任务调度工具,支持自定义定时任务,支持crontab表达式,支持自定义添加DataX数据同步任务☆39Updated 6 years ago
- kafka connector 插件,支持输入 mysql binlog 和 json 格式写入ClickHouse。持续更新☆45Updated 4 years ago
- 最新源码在 [这里](https://github.com/huzekang/springboot-datax.git)☆34Updated last year
- poseidonX 是一个基于jstorm和flink的一体化实时计算服务平台☆55Updated 6 years ago
- sql实现Structured Streaming☆39Updated 6 years ago