BZCareer / hadoop-ansibleLinks
This big data distro contains ansible provisioning for: Apache Hadoop, Apache Spark, Apache Hive, Apache Pig, Apache Storm, Apache Zookeeper, Apache Kafka, Apache Cassandra, ElasticSearch, Kibana, Logstash, Apache Hbase, Apache Zeppelin, Apache Flink
☆14Updated 8 years ago
Alternatives and similar repositories for hadoop-ansible
Users that are interested in hadoop-ansible are comparing it to the libraries listed below
Sorting:
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Updated last year
- an open source dataworks platform☆21Updated 4 years ago
- 基于袋鼠云提供的开源flinkStreamSQL项目,对其实时sql进行可视化功能开发;通过tcpip通信,前端页面选择需要连接的数据库信息,并写sql语句,点击提交后,后端自动执行集群启动和JobGraph提交,并返回结果给前端页面。实现了使用者即使不了解Kafka、fl…☆11Updated 6 years ago
- datacollector-oss☆95Updated 10 months ago
- ☆49Updated this week
- A Fully HiveServer2-like Multi-tenancy Spark Thrift Server Supporting Impersonation and Multi-SparkContext with Ranger Authorization (GO …☆11Updated 2 years ago
- an data-centric integration platform☆48Updated 3 years ago
- Example of using greenplum-spark connector☆19Updated 6 years ago
- Ansible playbooks to help to deploy Apache Hadoop,Spark,Storm,Zookeeper,Elasticsearch,Azkaban,Flume,Hbase,Kafka,Kibana,Logstash☆10Updated 8 years ago
- some useful User Defined Functions(UDF) for both PrestoSQL and TrinoDB☆18Updated 2 years ago
- CDC Kafka Connect source for Oracle Databases leveraging Oracle Logminer☆32Updated this week
- Kettle Web Integrator - An easy and open way to integrate your web app with Kettle Pentaho Data Integration☆50Updated 9 years ago
- 反应式 海量数据治理平台☆41Updated 4 years ago
- ☆62Updated 2 weeks ago
- ☆15Updated 8 years ago
- ☆30Updated 2 years ago
- IoT Trucking App with Flink (with Table API & SQL)☆15Updated 6 years ago
- Here is my git repo for my Docker files related to Cloudera Hadoop CDH, to start, best is to check the documentation on https://github.co…☆56Updated 6 years ago
- Apache Ambari Web 中文汉化 2.7.x版本直接修改☆39Updated 2 years ago
- Greenplum(v5,v6) exporter for Prometheus☆60Updated last year
- kafka connector 插件,支持输入 mysql binlog 和 json 格式写入ClickHouse 。持续更新☆45Updated 4 years ago
- Hive-JDBC-Proxy是一个高性能的HiveServer2和Spark ThriftServer的代理服务,具备负载均衡、基于规则转发Hive JDBC Client的请求给到HiveServer2和Spark ThriftServer的能力。☆32Updated 3 years ago
- 使用shell脚本部署Apache Doris (incubating) FE & BE☆10Updated 5 years ago
- Collection of examples integrating NiFi with stream process frameworks.☆59Updated 8 years ago
- Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies…☆22Updated 6 years ago
- The nifi of localized support include chinese and japanese .☆30Updated 6 years ago
- DataTunnel 是一个基于spark引擎的超高性能的分布式数据集成软件,支持海量数据的同步。基于spark extensions 扩展的DSL语法,结合的Spark SQL,更加便捷融入数仓 ETLT 过程中,简单易用。☆26Updated last week
- flink iceberg integration tests, jobs running on yarn.☆38Updated 4 years ago
- 最新源码在 [这里](https://github.com/huzekang/springboot-datax.git)☆34Updated last year
- Dockerfiles and Docker Compose for HDP 2.6 with Blueprints☆23Updated 7 years ago