pippozq / hadoop-ansibleLinks
Install hadoop cluster with ansible
☆40Updated 7 years ago
Alternatives and similar repositories for hadoop-ansible
Users that are interested in hadoop-ansible are comparing it to the libraries listed below
Sorting:
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Updated last year
- Examples for how to use the Flink Docker images in a variety of ways☆91Updated 3 years ago
- Java client for managing Apache Flink via REST API☆57Updated 5 months ago
- Export Hadoop YARN (resource-manager) metrics in prometheus format☆54Updated 2 months ago
- Flume JSON Interceptor Plugin☆15Updated 3 years ago
- Exports Hadoop HDFS content statistics to Prometheus☆155Updated last week
- Running Presto on k8s☆38Updated 5 years ago
- This is a mirror from https://gerrit.wikimedia.org. See https://www.mediawiki.org/wiki/Developer_access for contributing.☆39Updated 2 years ago
- Elasticsearch-cdc plugin, which supports capture data changes in elasticsearch, and sink the cdc data into kafka.☆40Updated 3 years ago
- Guardian of Waterdrop and Spark☆30Updated 2 years ago
- Ansible playbooks to help to deploy Apache Hadoop,Spark,Storm,Zookeeper,Elasticsearch,Azkaban,Flume,Hbase,Kafka,Kibana,Logstash☆10Updated 8 years ago
- Discover Flink clusters on Hadoop YARN for Prometheus☆23Updated 4 years ago
- Hadoop FSImage Analyzer (HFSA)☆59Updated 2 weeks ago
- Flink image for Kubernetes that fixes Jobmanage connection issue☆26Updated 6 years ago
- Flink Controller implements a Kubernetes Custom Controller (aka Kubernetes Operator) for Apache Flink☆53Updated 6 months ago
- Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.☆31Updated 4 years ago
- Here is my git repo for my Docker files related to Cloudera Hadoop CDH, to start, best is to check the documentation on https://github.co…☆56Updated 6 years ago
- Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies…☆22Updated 6 years ago
- Prometheus jmx_exporter configurations for Cloudera Hadoop☆37Updated 7 years ago
- Yarn on Docker - Managing Hadoop Yarn cluster with Docker Swarm.☆37Updated 3 years ago
- Aloha: a distributed task scheduling and management framework☆64Updated 2 years ago
- 使用K8S部署Apache Doris (incubating)(原百度palo)☆12Updated 6 years ago
- DataTunnel 是一个基于spark引擎的超高性 能的分布式数据集成软件,支持海量数据的同步。基于spark extensions 扩展的DSL语法,结合的Spark SQL,更加便捷融入数仓 ETLT 过程中,简单易用。☆26Updated last week
- ☆20Updated 2 years ago
- Demo showcasing Spark Streaming, Kafka, Kudu - all in Python☆27Updated 8 years ago
- Presto connector for Apache Paimon.☆11Updated 5 months ago
- phoenix☆12Updated 2 years ago
- High-availability and horizontal scalability for InfluxDB☆46Updated 5 years ago
- Instructions for getting started with Ververica Platform on minikube.☆92Updated 5 months ago
- A sample of Flink TiDB Realtime Datawarehouse.☆85Updated 4 years ago