sakserv / hadoop-mini-clustersLinks
hadoop-mini-clusters provides an easy way to test Hadoop projects directly in your IDE
☆296Updated 3 years ago
Alternatives and similar repositories for hadoop-mini-clusters
Users that are interested in hadoop-mini-clusters are comparing it to the libraries listed below
Sorting:
- Kerberos and Hadoop: The Madness beyond the Gate☆280Updated 2 years ago
- Profiler for large-scale distributed java applications (Spark, Scalding, MapReduce, Hive,...) on YARN.☆128Updated 7 years ago
- The Internals of Spark Structured Streaming☆422Updated 3 weeks ago
- An Open Source unit test framework for Hive queries based on JUnit 4 and 5☆262Updated last year
- Build configuration-driven ETL pipelines on Apache Spark☆161Updated 3 years ago
- Mirror of Apache Bahir☆335Updated 2 years ago
- Examples of Spark 2.0☆212Updated 4 years ago
- Spark, Spark Streaming and Spark SQL unit testing strategies☆216Updated 9 years ago
- Write your Spark data to Kafka seamlessly☆173Updated last year
- ☆103Updated 5 years ago
- ☆243Updated 7 years ago
- Spark connector for SFTP☆98Updated 2 years ago
- The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.☆550Updated 4 years ago
- ACID Data Source for Apache Spark based on Hive ACID☆97Updated 4 years ago
- Custom state store providers for Apache Spark☆92Updated 11 months ago
- Framework for Apache Flink unit tests☆210Updated 6 years ago
- Cloudera Manager Extensibility Tools and Documentation.☆193Updated 2 years ago
- Spark metrics related custom classes and sinks (e.g. Prometheus)☆184Updated 3 years ago
- Scripts for generating Grafana dashboards for monitoring Spark jobs☆241Updated 10 years ago
- Connect Spark to HBase for reading and writing data with ease☆295Updated 8 years ago
- ☆240Updated 4 years ago
- A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…☆182Updated 3 years ago
- Fluent client for interacting with Spark Standalone Mode's Rest API for submitting, killing and monitoring the state of jobs.☆111Updated 7 years ago
- High Performance Kafka Connector for Spark Streaming.Supports Multi Topic Fetch, Kafka Security. Reliable offset management in Zookeeper.…☆634Updated 3 years ago
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.☆285Updated last month
- Spark Structured Streaming / Kafka / Cassandra / Elastic☆184Updated 2 years ago
- Tools for reading data from Solr as a Spark RDD and indexing objects from Spark into Solr using SolrJ.☆447Updated 4 months ago
- The Internals of Delta Lake☆187Updated last month
- SparkOnHBase☆279Updated 4 years ago
- Qubole Sparklens tool for performance tuning Apache Spark☆586Updated last year