sensorsdata / ext-processor-sample
数据预处理模块
☆11Updated 6 years ago
Alternatives and similar repositories for ext-processor-sample:
Users that are interested in ext-processor-sample are comparing it to the libraries listed below
- 海狗-多维在线分析系统☆73Updated 10 years ago
- Ansible playbooks to help to deploy Apache Hadoop,Spark,Storm,Zookeeper,Elasticsearch,Azkaban,Flume,Hbase,Kafka,Kibana,Logstash☆10Updated 8 years ago
- 实时数据分析平台☆41Updated 11 years ago
- 简单高效的URL关键词提取工具☆15Updated 6 years ago
- 日志采集工具☆21Updated 7 years ago
- kafka connector 插件,支持输入 mysql binlog 和 json 格式写入ClickHouse。持续更新☆45Updated 4 years ago
- customized some flume component.☆31Updated 9 years ago
- A experiment for hot word recommend using Openresty & Redis☆19Updated 8 years ago
- java性能采集工具☆51Updated 6 years ago
- 基于Yarn的容器调度引擎(container scheduler based on yarn)☆36Updated 8 years ago
- storm kafka hdfs examples☆21Updated 8 years ago
- 易观KongPlus☆19Updated 2 years ago
- 基于canal的mysql slave实现☆12Updated 11 years ago
- 异构存储数据迁移☆30Updated 7 years ago
- mysql数据迁移工具。支持指定表名、列名,多线程+多进程。保证高可用,数据一致性。☆20Updated 10 years ago
- meerkat 是用于服务监控以及服务降级基础组件,主要为了解决调用外部接口的时候进行成功率,响应时间,QPS指标的监控,同时在成功率下降到预设的阈值以下的时候自动切断外部接口的调用,外部接口成功率恢复后自 动恢复请求☆51Updated 7 years ago
- Caravel is a data exploration platform designed to be visual, intuitive, and interactive☆20Updated 8 years ago
- 杭州第六次 Spark & Flink Meetup☆30Updated 6 years ago
- 分布式任务调度系统☆15Updated 3 years ago
- Distributed Configuration Management Platform(分布式配置管理平台)☆16Updated 8 years ago
- 延云ydb千亿大数据实时解决方案☆30Updated 8 years ago
- 数据库访问中间件,统一的标准sql查询,底层可以是不同的数据库包括mysql、ElasticSearch、kylin、presto等。☆15Updated 6 years ago
- [Cloudframeworks]SMACK Big Data Architecture - user guide / [云框架]SMACK大数据架构-用户指南☆70Updated 7 years ago
- ZkConfig是为zookeeper开发的配置服务工具包,能与现有的Java系统进行良好的集成,也可以使用与非java系统以独立进程运行。提供与spring进行集成的插件。采用注解方式对需要动态更新的内存数据对象进行标注。 ZkConfig用于解决在系统集群中配置文件的实…☆24Updated 9 years ago
- 一个轻量级的分布式任务Executor系统☆39Updated 8 years ago
- Computer Foundations Practices☆19Updated 3 years ago
- bloom filter 过滤器☆54Updated 5 years ago
- ADFS (Ali Distributed File System) is an evolutional version of Hadoop which delivers high availability, auick-restart and other features…☆114Updated last year
- timetunnel is developed to transfer data realtimely,it is used to collect log data and to sync database data in taobao☆39Updated 4 years ago
- ☆22Updated 2 years ago