trifacta / floating-elephantsLinks
Docker containers for Hadoop.
☆21Updated 8 years ago
Alternatives and similar repositories for floating-elephants
Users that are interested in floating-elephants are comparing it to the libraries listed below
Sorting:
- Scripts used to setup a Spark cluster on EC2☆388Updated 8 years ago
- Docker build for Apache Spark☆672Updated 4 years ago
- This repository hold the Amazon Elastic MapReduce sample bootstrap actions☆613Updated 2 years ago
- ☆248Updated 6 years ago
- A Spark cluster setup running on Docker containers☆61Updated 6 years ago
- Edge2AI Workshop☆70Updated 7 months ago
- Demonstrates NiFi template deployment and configuration via a REST API☆70Updated 9 years ago
- A repository used in a NiFi Registry demo☆13Updated 5 years ago
- Rebooting ggplot2 for scalable big data visualization☆28Updated 8 years ago
- Data Science box: Spark, Jupyter, R+RStudio, Zeppelin, Python 2 & 3, Java, Scala.☆39Updated 7 years ago
- Splittable SAS (.sas7bdat) Input Format for Hadoop and Spark SQL☆97Updated 2 weeks ago
- A general purpose framework for automating Cloudera Products☆69Updated 11 months ago
- A rough prototype of a tool for discovering Apache Hive schemas from JSON documents.☆42Updated 2 years ago
- Materials for various Hadoop & Nifi related workshops☆19Updated 4 years ago
- Apache Drill Workshop☆19Updated 9 years ago
- Docker build for Zeppelin, a web-based Spark notebook☆221Updated 6 years ago
- customized cloudera-parcel☆13Updated 7 years ago
- A docker image and kubernetes config files to run Airflow on Kubernetes☆655Updated 6 years ago
- A package that allows R developers to use Hadoop HDFS☆64Updated 7 years ago
- XML Serializer/Deserializer for Apache Hive☆41Updated 6 years ago
- ☆32Updated 5 years ago
- Workshops on how to setup security on Hadoop using HDP sandboxes☆100Updated 7 years ago
- Collection of tools for bootstrapping Apache Ambari & deploying clusters☆83Updated 6 years ago
- Tool to generate a Hive schema from a JSON example doc☆227Updated 6 years ago
- Generates more or less realistic log data for testing simple aggregation queries.☆263Updated 2 years ago
- My Ph.D. thesis on Outlier Selection and One-Class Classification☆121Updated 2 years ago
- A list of useful Apache NiFi resources, processor bundles and tools☆964Updated 5 years ago
- Kerberos and Hadoop: The Madness beyond the Gate☆282Updated 2 years ago
- Vagrant files creating multi-node virtual Hadoop clusters with or without security.☆67Updated 5 years ago
- CLI tool for syncing a Databricks folder structure with a local git repo.☆17Updated last year