dongkelun / hudi-demoView external linksLinks
Apache Hudi Demo
☆22Apr 24, 2025Updated 9 months ago
Alternatives and similar repositories for hudi-demo
Users that are interested in hudi-demo are comparing it to the libraries listed below
Sorting:
- An easy-to-use, scalable spark streaming ETL tool and sdk☆13Aug 14, 2017Updated 8 years ago
- 迁移工具,目标是Oracle,MySQL,SqlServer到PostgreSQL的单项迁移,PostgreSQL和大数据平台Hive,Hbase,Impala等的双向迁移。☆10Dec 3, 2014Updated 11 years ago
- 一个比Spark-Parquet还快5~100倍的存储格式☆12Feb 22, 2016Updated 9 years ago
- Distributed SQL base Realtime Streaming Computation Framework On Apache Storm, Spark☆12Mar 14, 2016Updated 9 years ago
- 通过观看尚硅谷的Flink实战视频,开了一个仓库,记录源码和一些所需要的数据文件,也欢迎大家积极讨论☆16Mar 1, 2021Updated 4 years ago
- 基于ActiveMQ的数据交换中间件☆14Aug 17, 2014Updated 11 years ago
- MySQL to NoSQL real time dataflow☆18Oct 14, 2017Updated 8 years ago
- Apache Flink Hairless Notes☆13Jun 21, 2022Updated 3 years ago
- 基于SparkSQL的电影分析项目实战☆40Nov 21, 2020Updated 5 years ago
- spring+spark streaming+kafka 10版本集成和异常问题处理☆17Jul 21, 2017Updated 8 years ago
- 个性化推荐算法的通用处理框架,基于Mahout和Lucene☆18May 25, 2015Updated 10 years ago
- A sink to save Spark Structured Streaming DataFrame into Hive table☆23May 7, 2018Updated 7 years ago
- Real-time analytics in Apache Flume☆51Feb 2, 2016Updated 10 years ago
- 文本去重算法,研究自推荐系统中新闻的去重,采用了雅虎的Near-duplicates and shingling算法,服务端用c实现,客户端用java实现,利用thrift框架进行通信,为了提高扩展性,去重可以在服务端实现,服务器也提供了计算的接口,方便客户端自己扩展☆24Feb 25, 2014Updated 11 years ago
- 解析Mysql binlog日志并发至Kafka☆23Nov 25, 2016Updated 9 years ago
- SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka☆29Jun 8, 2016Updated 9 years ago
- 记录HBase版本API的变迁Demo☆32Apr 27, 2019Updated 6 years ago
- ☆40Aug 3, 2015Updated 10 years ago
- 本项目转移到https://github.com/cocolian/cocolian-nlp☆34Jun 8, 2014Updated 11 years ago
- A batch-processing system base on Spring Boot and Spring Batch. 一个基于SpringBoot和SpringBatch的批处理系统。☆10Sep 10, 2018Updated 7 years ago
- KettleWeb版☆22Aug 2, 2016Updated 9 years ago
- Google Cloud Dataflow pipelines such as Identity-By-State as well as useful utility classes.☆37Aug 9, 2023Updated 2 years ago
- ☆11Sep 1, 2022Updated 3 years ago
- 这是居于 derby 源代码,通过删减的方式,从里面抽取出sql解析功能。并在此基础上开发出跨库连接查询器。通过该工具可以将连接查询分割成多个单表查询,再将单表结果集进行连接,即将数据库的连接功能上移到工具执行。详情可以 查看wiki:readme☆10Feb 14, 2017Updated 9 years ago
- Spark projects. Learning book "Machine Learning with Spark"☆10Jun 3, 2017Updated 8 years ago
- flink 10 自我学习笔记和代码☆14Jun 29, 2022Updated 3 years ago
- json或SQL语言转为flink或者spark流/批任务☆12Jun 21, 2022Updated 3 years ago
- 各种安全相关思维导图整理收集☆11Sep 7, 2015Updated 10 years ago
- This Pinyin Analysis plugin is used to do conversion between Chinese characters and Pinyin.☆10Mar 28, 2019Updated 6 years ago
- zdh系列-基于java的经营风控引擎☆13Jan 24, 2026Updated 3 weeks ago
- hadoop中Map/Reduce使用示例,输入(DBInputFormat),输出(DBOutputFormat)为MySql数据库表、日志分析Grep、单词排序Sort...对HBase的基本操作,增、删、查、改,使用Map/Reduce批量导入数据到HBase表中..…☆14Apr 6, 2013Updated 12 years ago
- REST job server for Apache Spark☆44May 23, 2025Updated 8 months ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆40Jun 29, 2017Updated 8 years ago
- 基于Spring+Mybatis+Jetty实现简单的用户信息接口。☆11Mar 13, 2015Updated 10 years ago
- Example project to show how to use Kafka from Spark Streaming with the Confluent schema registry☆11Aug 17, 2016Updated 9 years ago
- LightRAG with Neo4j Example Project☆17May 19, 2025Updated 8 months ago
- Zookeeper Monitoring Extension for AppDynamics☆10Sep 29, 2021Updated 4 years ago
- 使用shell脚本部署Apache Doris (incubating) FE & BE☆11Jul 8, 2019Updated 6 years ago
- 数据交换☆10Jun 5, 2024Updated last year