linyiqun / yarn-jobhistory-crawlerLinks
JobHistory上的job信息爬取工具
☆35Updated 10 years ago
Alternatives and similar repositories for yarn-jobhistory-crawler
Users that are interested in yarn-jobhistory-crawler are comparing it to the libraries listed below
Sorting:
- mysql数据实时增量导入hive☆87Updated 8 years ago
- azkaban小助手,增加任务web配置、远程脚本调用、报警扩展、跨项目依赖等功能。☆117Updated 8 years ago
- Spark Streaming监控平台,支持任务部署与告警、自启动☆129Updated 7 years ago
- My Blog☆76Updated 7 years ago
- Spark 脚手架工程,标准化 spark 开发、部署、测试流程。☆94Updated last year
- Stream computing platform for bigdata☆406Updated last year
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆152Updated 2 years ago
- Spark源码剖析☆87Updated 8 years ago
- An ad hoc query service based on the spark sql engine.(基于spark sql引擎的即席查询服务)☆381Updated last year
- ☆236Updated 3 years ago
- presto hbase connector 组件基于Presto Connector接口规范实现,用来给Presto增加查询HBase的功能。相比其他开源版本的HBase Connector,我们的性能要快10到100倍以上。☆242Updated 2 years ago
- ☆53Updated 7 years ago
- sql解析工具。主要解析hive sql、spark sql、presto sql。从sql中解析出输入表、输出表以及字段等信息☆97Updated 2 years ago
- 剥离的模块,用于查看Spark SQL生成的语法树☆92Updated 6 years ago
- spark-scala-maven☆59Updated 6 years ago
- ☆118Updated 2 years ago
- sql实现Structured Streaming☆39Updated 6 years ago
- spark实例代码☆78Updated 8 years ago
- Kafka delivery semantics in the case of failure depend on how and when offsets are stored. Spark output operations are at-least-once. So …☆37Updated 8 years ago
- ☆45Updated 5 years ago
- 一个手动管理spark streaming集成kafka时的偏移量到zookeeper中的小项目☆133Updated last month
- DirectKafka examples for Spark Streaming : 1. with checkpointing 2. Custom offset management☆60Updated 9 years ago
- Flink 菜鸟公众号代码地址☆64Updated 11 months ago
- SparkStreaming中利用MySQL保存Kafka偏移量保证0数据丢失☆44Updated 8 years ago
- ☆133Updated 8 years ago
- ☆91Updated 6 years ago
- Test code for apache calcite☆214Updated 3 years ago
- Spark-2.3.1源码解读☆201Updated 2 years ago
- 记录Spark、Flink研究经验☆26Updated 6 years ago
- 给flink开发的web系统。支持页面上定义udf,进行sql和jar任务的提交;支持source、sink、job的管理;可以管理openshift上的flink集群☆287Updated 2 years ago