rison168 / bigdata-iceberg
Iceberg开发指南,集成数据湖Iceberg在Spark、Flink引擎的等使用示例
☆13Updated 2 years ago
Alternatives and similar repositories for bigdata-iceberg:
Users that are interested in bigdata-iceberg are comparing it to the libraries listed below
- Real-time ETL developed by Flink, data from MySQL to Greenplum. Use canal to parse the MySQL binlog, put it into kafka, use Flink to cons…☆78Updated last year
- Make data connection easier☆22Updated 2 years ago
- dataService platform is a low-code platform, which only needs to write SQL to realize the development of API services, solve the unificat…☆111Updated last year
- Apache Hudi Demo☆21Updated 9 months ago
- Using Flink SQL to build ETL job☆202Updated last year
- 此项目主要应用于数据中台或数据平台的数据总线,支持直接实时监听MySQL、MongoDB、PostgreSQL、Oracle、SQL Server、Db2和Cassandra等数据库的数据变更。☆62Updated last year
- ☆38Updated last year
- 基于 Flink 的 sqlSubmit 程序☆145Updated last year
- Learning Flink : Flink CEP,Flink Core,Flink SQL☆71Updated 3 years ago
- 基于Apache-bahir-kudu-connector的flink-connector-kudu,支持Flink1.11.x DynamicTableSource/Sink,支持Range分区等☆46Updated last year
- flink iceberg integration tests, jobs running on yarn.☆38Updated 3 years ago
- Streaming application development and management system, based on Linkis and DSS, planning to provide the workflow-like graphical drag-an…☆105Updated 3 months ago
- DataSphereStudio documents.☆115Updated 2 months ago
- 为DataX(https://github.com/alibaba/DataX) 提供远程多语言调用(ThriftServer,HttpServer) 分布式运行(DataX on YARN) 功能☆144Updated last year
- Flink Sql 教程☆34Updated 3 months ago
- A distributed data factory, providing data access, etl, scheduling. Easily manage tasks such as hive, spark, clickhouse, flink, shell, py…☆32Updated 2 years ago
- HiveReader for alibaba DataX☆17Updated last year
- Hive-JDBC-Proxy是一个高性能的HiveServer2和Spark ThriftServer的代理服务,具备负载均衡、基于规则转发Hive JDBC Client的请求给到HiveServer2和Spark ThriftServer的能力。☆32Updated 2 years ago
- Apache Ambari Web 中文汉化 2.7.x版本直接修改☆39Updated 2 years ago
- ☆118Updated last year
- 易观开源大数据互联网百亿级记录互传Backquarter项目☆19Updated 2 years ago
- Apache StreamPark quickstart☆70Updated 2 months ago
- 汇总Apache Hudi中的一些Demo,便于快速上手Apache Hudi(Apache Hudi Demos to help beginners know about Hudi)☆74Updated 4 years ago
- 一个实时数仓项目,从0到1搭建实时数仓☆55Updated 3 years ago
- flink sql☆11Updated 2 years ago
- 数据血缘,Hive/Sqoop/HBase/Spark等,发送到kafka后,解析处理使用neo4j生成血缘☆81Updated 3 years ago
- 基于flink1.9.1,flink-sql-client模块SDK单独实现,支持Yarn集群的远程SQL任务发布,可以支撑flink sql任务的远程化执行☆48Updated 2 years ago
- Deploy bigdata platform using docker compose. Big data components include hadoop, hive, hbase, presto, flink, es, kafka, etc.☆139Updated 6 months ago
- 基于袋鼠云提供的开源flinkStreamSQL项目,对其实时sql进行可视化功能开发;通过tcpip通信,前端页面选择需要连接的数据库信息,并写sql语句,点击提交后,后端自动执行集群启动和JobGraph提交,并返回结果给前端页面。实现了使用者即使不了解Kafka、fl…☆11Updated 5 years ago
- 数据采集平台zdh,etl 处理服务☆71Updated this week