lw-lin / CoolplaySparkLinks
酷玩 Spark: Spark 源代码解析、Spark 类库等
☆3,488Updated 3 years ago
Alternatives and similar repositories for CoolplaySpark
Users that are interested in CoolplaySpark are comparing it to the libraries listed below
Sorting:
- Flink 中文视频课程(持续更新...)☆4,594Updated 4 years ago
- Notes talking about the design and implementation of Apache Spark☆5,323Updated last year
- 基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法☆2,047Updated last year
- 挖坑与填坑☆691Updated 8 years ago
- Learning Apache spark,including code and data .Most part can run local.☆602Updated 3 years ago
- scala、spark使用过程中,各种测试用例以及相关资料整理☆1,086Updated 6 years ago
- Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.☆1,839Updated last year
- Apache Spark 官方文档中文版☆1,185Updated last year
- Apache Kylin☆3,707Updated last month
- spark ml 算法原理剖析以及具体的源码实现分析☆1,958Updated 6 years ago
- hadoop各组件使用,持续更新☆902Updated 2 years ago
- Flink Forward China 2018 Slides☆577Updated 6 years ago
- A data integration framework☆4,055Updated 2 months ago
- 定期更新Hadoop生态圈中常用大数据组件文档 重心依次为: Flink Solr Sparksql ES Scala Kafka Hbase/phoenix Redis Kerberos (项目包含hadoop思维导图 印象笔记 Scala版本简单demo …☆924Updated 2 years ago
- [大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结☆1,616Updated 3 years ago
- A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources☆2,057Updated 2 years ago
- Streaming System 相关的论文读物☆733Updated 3 years ago
- spark源码学习☆303Updated 9 years ago
- Apache Flink官方文档中文翻译计划☆488Updated 2 years ago
- Apache Hive☆5,712Updated this week
- 基于flink的实时流计算web平台☆1,836Updated 9 months ago
- 关于大数据的面试题,包括hadoop、hbase、hive、spark、storm、zookeeper、kafka、flume、logstash、redis、ELK、ETL、算法等等,持续更新中☆442Updated 6 years ago
- Apache Ambari simplifies provisioning, managing, and monitoring of Apache Hadoop clusters.☆2,224Updated this week
- 汇总Apache Hudi相关资料☆552Updated last week
- 大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。☆1,618Updated 2 weeks ago
- Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-l…☆2,553Updated 7 months ago
- 基于Flink实现的商品实时推荐系统。flink统计商品热度,放入redis缓存,分析日志信息,将画像标签和实时记录放入Hbase。在用户发起推荐请求后,根据用户画像重排序热度榜,并结合协同过滤和标签两个推荐模块为新生成的榜单的每一个产品添加关联产品,最后返回新的用户列表。☆4,388Updated last year
- Apache HBase☆5,346Updated this week
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,188Updated this week
- Flink 官方文档中文翻译项目☆382Updated 6 months ago