netxillon / Hadoop
Hadoop Cluster Configurations
☆32Updated 3 years ago
Alternatives and similar repositories for Hadoop
Users that are interested in Hadoop are comparing it to the libraries listed below
Sorting:
- ☆54Updated 10 years ago
- Code for Tutorial on designing clickstream analytics application using Hadoop☆54Updated 9 years ago
- InputFormat that can split multi-line JSON☆49Updated 10 years ago
- Workshops on how to setup security on Hadoop using HDP sandboxes☆100Updated 7 years ago
- Based off the design of SparkOnHBase. This Repo will support Spark, Spark Streaming, and Spark SQL integration with Kudu.☆50Updated 8 years ago
- Demonstrates how to submit a job to Spark on HDP directly via YARN's REST API from any workstation☆24Updated 9 years ago
- Sample Spark Streaming application for secure consumption from Kafka☆33Updated 7 years ago
- Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http:…☆70Updated 2 years ago
- Flink Examples☆39Updated 9 years ago
- HDF masterclass materials☆28Updated 9 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆41Updated 7 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Spark Streaming HBase Example☆22Updated 9 years ago
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy☆61Updated last year
- ☆49Updated 5 years ago
- Automated (Ansible) installation of HDP via Ambari Blueprint☆16Updated 8 years ago
- Sample UDF and UDAs for Impala.☆64Updated 5 years ago
- Apache Spark is a fast, in-memory data processing engine with elegant and expressive development API's to allow data workers to efficient…☆55Updated 2 years ago
- sample oozie workflows☆18Updated 7 years ago
- This tutorial provides a quick introduction to using Spark☆57Updated 9 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆62Updated 5 years ago
- An Ambari Stack service package for VNC Server with the ability to install developer tools like Eclipse/IntelliJ/Maven as well to 'remote…☆28Updated 8 years ago
- Vagrant files creating multi-node virtual Hadoop clusters with or without security.☆67Updated 5 years ago
- HDP Data Science/Machine Learning demo☆37Updated 9 years ago
- Collection of tools for bootstrapping Apache Ambari & deploying clusters☆83Updated 6 years ago
- Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet☆28Updated 10 years ago
- Hadoop MapReduce tool to convert Avro data files to Parquet format.☆34Updated 11 years ago
- Apache Spark and Apache Kafka integration example☆124Updated 7 years ago
- ☆24Updated 9 years ago
- Visualize your HDFS cluster usage☆229Updated 4 years ago