bytedance / clickhouse_hadoopLinks
Import data from clickhouse to hadoop with pure SQL
☆36Updated 6 years ago
Alternatives and similar repositories for clickhouse_hadoop
Users that are interested in clickhouse_hadoop are comparing it to the libraries listed below
Sorting:
- Guardian of Waterdrop and Spark☆30Updated 2 years ago
- 汇总Apache Hudi中的一些Demo,便于快速上手Apache Hudi(Apache Hudi Demos to help beginners know about Hudi)☆75Updated 4 years ago
- sql code autocomplete☆40Updated 4 years ago
- Data self exporting and monitoring platform based on Hive data warehouse. https://hc.smartloli.org☆36Updated 7 years ago
- flink log connector☆59Updated 9 months ago
- Streaming application development and management system, based on Linkis and DSS, planning to provide the workflow-like graphical drag-an…☆107Updated 2 months ago
- A sample of Flink TiDB Realtime Datawarehouse.☆85Updated 4 years ago
- Real-time ETL developed by Flink, data from MySQL to Greenplum. Use canal to parse the MySQL binlog, put it into kafka, use Flink to cons…☆79Updated last year
- Unified SQL Analytics Engine Based on SparkSQL☆211Updated 2 years ago
- Hive-JDBC-Proxy是一个高性能的HiveServer2和Spark ThriftServer的代理服务,具备负载均衡、基于规则转发Hive JDBC Client的请求给到HiveServer2和Spark ThriftServer的能力。☆32Updated 3 years ago
- Flink Sql 教程☆34Updated 7 months ago
- ☆90Updated 2 years ago
- Learning Flink : Flink CEP,Flink Core,Flink SQL☆72Updated 3 years ago
- 基于flink 1.8 源码二次开发,详见MD☆82Updated 5 years ago
- 基于袋鼠云提供的开源flinkStreamSQL项目,对其实时sql进行可视化功能开发;通过tcpip通信,前端页面选择需要连接的数据库信息,并写sql语句,点击提交后,后端自动执行集群启动和JobGraph提交,并返回结果给前端页面。实现了使用者即使不了解Kafka、fl…☆11Updated 6 years ago
- java性能采集工具☆51Updated 6 years ago
- DataTunnel 是一个基于spark引擎的超高性能的分布式数据集成软件,支持海量数据的同步。基于spark extensions 扩展的DSL语法,结合的Spark SQL,更加便捷融入数仓 ETLT 过程中,简单易用。☆29Updated this week
- ☆33Updated 6 years ago
- A library based on delta for Spark and MLSQL☆61Updated 4 years ago
- presto hbase connector 组件基于Presto Connector接口规范实现,用来给Presto增加查询HBase的功能。相比其他开源版本的HBase Connector,我们的性能要快10到100倍以上。☆241Updated 2 years ago
- 执行Flink SQL 文件的客户端☆25Updated 3 years ago
- facebook presto connectors☆49Updated 3 years ago
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Updated last week
- 基于Yarn的容器调度引擎(container scheduler based on yarn)☆36Updated 9 years ago
- 对yarn的的RM,NM模块代码进行分析☆49Updated 6 years ago
- kudu 学习的一些资料,以及和spark/impala的集成使用☆33Updated 7 years ago
- 为DataX(https://github.com/alibaba/DataX) 提供远程多语言调用(ThriftServer,HttpServer) 分布式运行(DataX on YARN) 功能☆144Updated this week
- This is a library for SQL optimizing/rewriting including Materialized View rewrite☆69Updated 3 years ago
- ACL Management for Apache Spark SQL with Apache Ranger☆17Updated 5 years ago
- 录制Spak视频课程讲解涉及编写的源代码 https://edu.hellobi.com/course/107/overview☆13Updated 6 years ago