BZCareer / hadoop-ansibleLinks
This big data distro contains ansible provisioning for: Apache Hadoop, Apache Spark, Apache Hive, Apache Pig, Apache Storm, Apache Zookeeper, Apache Kafka, Apache Cassandra, ElasticSearch, Kibana, Logstash, Apache Hbase, Apache Zeppelin, Apache Flink
☆14Updated 8 years ago
Alternatives and similar repositories for hadoop-ansible
Users that are interested in hadoop-ansible are comparing it to the libraries listed below
Sorting:
- Streaming application development and management system, based on Linkis and DSS, planning to provide the workflow-like graphical drag-an…☆107Updated 2 months ago
- 反应式 海量数据治理平台☆41Updated 4 years ago
- flink iceberg integration tests, jobs running on yarn.☆38Updated 4 years ago
- flink endpoint for open world☆28Updated last month
- ☆62Updated last month
- an open source dataworks platform☆21Updated 4 years ago
- CDC Kafka Connect source for Oracle Databases leveraging Oracle Logminer☆32Updated this week
- 基于DataX的数据同步任务调度工具,支持自定义定时任务,支持crontab表达式,支持自定义添加DataX数据同步任务☆39Updated 6 years ago
- an data-centric integration platform☆48Updated 3 years ago
- DataTunnel 是一个基于spark引擎的超高性能的分布式数据集成软件,支持海量数据的同步。基于spark extensions 扩展的DSL语法,结合的Spark SQL,更加便捷融入数仓 ETLT 过程中,简单易用。☆29Updated last week
- datacollector-oss☆96Updated 11 months ago
- Ansible playbooks to help to deploy Apache Hadoop,Spark,Storm,Zookeeper,Elasticsearch,Azkaban,Flume,Hbase,Kafka,Kibana,Logstash☆10Updated 8 years ago
- ☆28Updated 3 years ago
- ☆49Updated this week
- Real-time ETL developed by Flink, data from MySQL to Greenplum. Use canal to parse the MySQL binlog, put it into kafka, use Flink to cons…☆80Updated last year
- Visualis is a BI tool for data visualization. It provides financial-grade data visualization capabilities on the basis of data security a…☆263Updated 6 months ago
- Import data from clickhouse to hadoop with pure SQL☆36Updated 6 years ago
- A sample of Flink TiDB Realtime Datawarehouse.☆85Updated 4 years ago
- 此项目主要应用于数据中台或数据平台的数据总线,支持直接实时监听MySQL、MongoDB、PostgreSQL、Oracle、SQL Server、Db2和Cassandra等数据库的数据变更。☆62Updated last year
- Ambari stack service for easily installing and managing Hue on HDP cluster☆107Updated 5 years ago
- 通过Flink的restful API完成job 提交 启动 查询 取消操作☆20Updated 3 years ago
- Example of using greenplum-spark connector☆19Updated 6 years ago
- Bireme is an incremental synchronization tool for the Greenplum / HashData data warehouse☆137Updated 3 years ago
- Kettle plugins for Apache Beam☆41Updated 2 years ago
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Updated last week
- Guardian of Waterdrop and Spark☆30Updated 2 years ago
- 一个基于Flink的数据流业务处理平台☆25Updated 2 years ago
- Custom Service for deploying Apache Alluxio on a running HDP 2.3 / IOP 4.1 Ambari Managed Cluster☆13Updated 8 years ago
- This repository trackes the code and files for building docker image with Apache Kylin.☆126Updated 3 years ago
- Fast Data Cluster (Apache Cassandra, Kafka, Spark, Flink, YARN and HDFS with Vagrant and VirtualBox)☆22Updated last year