melin / datatunnel
DataTunnel 是一个基于spark引擎的超高性能的分布式数据集成软件,支持海量数据的同步。基于spark extensions 扩展的DSL语法,结合的Spark SQL,更加便捷融入数仓 ETLT 过程中,简单易用。
☆18Updated this week
Alternatives and similar repositories for datatunnel:
Users that are interested in datatunnel are comparing it to the libraries listed below
- Streaming data analysis platform based on Flink <至流云-超轻量级流式计算平台/实时同步/数据同步>☆19Updated this week
- Official Product Plugins For TIS☆13Updated this week
- java性能采集工具☆51Updated 6 years ago
- ☆38Updated last week
- an open source dataworks platform☆21Updated 3 years ago
- 执行Flink SQL 文件的客户端☆26Updated 3 years ago
- Define and schedule workflow, support Flink Jar/SQL, ClickHouse/Hive/Mysql SQL, Shell, etc.☆16Updated this week
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Updated 11 months ago
- sql code autocomplete☆40Updated 4 years ago
- SQL for Redis☆12Updated 2 years ago
- Flink China Doc & Blog | Markdown Support & Auto Deploy☆13Updated 4 years ago
- Flink Sql 教程☆34Updated 2 months ago
- Apache Flink Connectors for OceanBase.☆21Updated last week
- It is a kind of big data computing platform which is driven by the Flink SQL. In particular, it provides the SQL programming.☆19Updated 2 years ago
- Alerting and monitoring tool for Apache Spark☆23Updated 2 years ago
- Presto connector for Apache Paimon.☆11Updated 3 weeks ago
- Java client for managing Apache Flink via REST API☆56Updated last month
- kafka connector 插件,支持输入 mysql binlog 和 json 格式写入ClickHouse。持续更新☆45Updated 4 years ago
- streamer 实时计算引擎☆19Updated last year
- Ansible playbooks to help to deploy Apache Hadoop,Spark,Storm,Zookeeper,Elasticsearch,Azkaban,Flume,Hbase,Kafka,Kibana,Logstash☆10Updated 7 years ago
- 易观开源大数据互联网百亿级记录互传Backquarter项目☆19Updated 2 years ago
- Data self exporting and monitoring platform based on Hive data warehouse. https://hc.smartloli.org☆36Updated 7 years ago
- Aloha: a distributed task scheduling and management framework☆64Updated 2 years ago
- ☆15Updated 2 years ago
- 反应式 海量数据治理平台☆39Updated 4 years ago
- akkaflow是一个基于akka架构上构建的分布式高可用DAG工作流调度工具,可以把子节点分配在集群机器上并行执行,高效利用集群资源。☆107Updated 5 years ago
- dag job runner framework☆22Updated 3 months ago
- Demo for service oriented application hosted on Hadoop YARN cluster for HA and scheduling☆23Updated 6 years ago
- 智能数据探索服务(Intelligent Data Exploration Service),一站式Data + AI数据解决方案!☆36Updated last year
- 基于Yarn的容器调度引擎(container scheduler based on yarn)☆36Updated 8 years ago