mattshma / bigdata
hadoop,hbase,storm,spark,etc..
☆160Updated 5 years ago
Alternatives and similar repositories for bigdata:
Users that are interested in bigdata are comparing it to the libraries listed below
- ☆133Updated 7 years ago
- azkaban小助手,增加任务web配置、远程脚本调用、报警扩展、跨项目依赖等功能。☆119Updated 7 years ago
- spark实例代码☆78Updated 7 years ago
- mysql数据实时增量导入hive☆87Updated 7 years ago
- Spark Streaming监控平台,支持任务部署与告警、自启动☆128Updated 6 years ago
- My Blog☆76Updated 6 years ago
- ☆235Updated 2 years ago
- ☆121Updated last month
- ☆77Updated 6 years ago
- Spark源码剖析☆87Updated 7 years ago
- spark性能调优总结 spark config and tuning☆121Updated 6 years ago
- Spark 脚手架工程,标准化 spark 开发、部署、测试流程。☆93Updated 3 months ago
- ☆88Updated last year
- flink技术学习笔记分享☆84Updated 5 years ago
- presto hbase connector 组件基于Presto Connector接口规范实现,用来给Presto增加查询HBase的功能。相比其他开源版本的HBase Connector,我们的性能要快10到100倍以上。☆241Updated 2 years ago
- 封装sparkstreaming动态调节batch time(有数据就执行计算); 支持运行过程中增删topic; 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。☆181Updated 3 years ago
- ☆53Updated 6 years ago
- SparkStreaming中利用MySQL保存Kafka偏移量保证0数据丢失☆45Updated 7 years ago
- DirectKafka examples for Spark Streaming : 1. with checkpointing 2. Custom offset management☆60Updated 8 years ago
- hbase-tools try easy to use and test the hbase,☆46Updated 10 years ago
- 一个手动管理spark streaming集成kafka时的偏移量到zookeeper中的小项目☆135Updated 5 months ago
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆154Updated last year
- ☆137Updated 6 years ago
- Spark源码分析,主要包含SparkContext源码、Executor进程启动、Stage划分、Task执行和Spark2.0的新特性☆82Updated 5 years ago
- ☆42Updated 5 years ago
- spark example code, has some production practice.☆175Updated 8 years ago
- Kafka delivery semantics in the case of failure depend on how and when offsets are stored. Spark output operations are at-least-once. So …☆37Updated 7 years ago
- Flume NG Canal source☆56Updated 6 years ago
- Flink 官方文档中文翻译项目☆380Updated last month