BZCareer / hadoop-ansible
This big data distro contains ansible provisioning for: Apache Hadoop, Apache Spark, Apache Hive, Apache Pig, Apache Storm, Apache Zookeeper, Apache Kafka, Apache Cassandra, ElasticSearch, Kibana, Logstash, Apache Hbase, Apache Zeppelin, Apache Flink
☆14Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for hadoop-ansible
- Dockerfiles and Docker Compose for HDP 2.6 with Blueprints☆23Updated 6 years ago
- Implement a complete data warehouse etl using spark SQL☆13Updated 2 years ago
- Ansible playbooks to help to deploy Apache Hadoop,Spark,Storm,Zookeeper,Elasticsearch,Azkaban,Flume,Hbase,Kafka,Kibana,Logstash☆10Updated 7 years ago
- ☆30Updated last year
- an data-centric integration platform☆48Updated 3 years ago
- 反应式 海量数据治理平台☆38Updated 4 years ago
- Here is my git repo for my Docker files related to Cloudera Hadoop CDH, to start, best is to check the documentation on https://github.co…☆55Updated 6 years ago
- Custom Service for deploying Apache Alluxio on a running HDP 2.3 / IOP 4.1 Ambari Managed Cluster☆13Updated 7 years ago
- Import data from clickhouse to hadoop with pure SQL☆36Updated 5 years ago
- flink endpoint for open world☆25Updated last year
- CDC Kafka Connect source for Oracle Databases leveraging Oracle Logminer☆31Updated this week
- The Ansible playbooks for CDH6☆18Updated 3 years ago
- Fast Data Cluster (Apache Cassandra, Kafka, Spark, Flink, YARN and HDFS with Vagrant and VirtualBox)☆22Updated last year
- Streaming using Flink to connect Kafka and Elasticsearch☆29Updated 8 years ago
- datacollector-oss☆90Updated 3 months ago
- Example of using greenplum-spark connector☆19Updated 5 years ago
- Flink SQL 实战 -中文博客专栏☆16Updated 2 years ago
- Dockerized HDP Cluster☆84Updated 6 years ago
- an open source dataworks platform☆21Updated 3 years ago
- Ambari stack service for easily installing and managing Hue on HDP cluster☆107Updated 5 years ago
- Hive-JDBC-Proxy是一个高性能的HiveServer2和Spark ThriftServer的代理服务,具备负载均衡、基于规则转发Hive JDBC Client的请求给到HiveServer2和Spark ThriftServer的能力。☆31Updated 2 years ago
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Updated 8 months ago
- Run TPCH Benchmark on Apache Kylin☆22Updated 2 years ago
- 【易车】- Spark、flink、HBase、Hive、flume集成了一些Hadoop的原生api的一些demo(如HDFS、MapReduce:目前就这两个);同时测试一些异常功能☆16Updated 5 years ago
- java 版本 logstash input 插件☆21Updated 5 years ago
- Real-time analytics in Apache Flume☆52Updated 8 years ago
- ☆29Updated 6 years ago