guozheng / hadoop-completion
hadoop shell commands auto-complete script for Bash Completion
☆12Updated 10 years ago
Alternatives and similar repositories for hadoop-completion:
Users that are interested in hadoop-completion are comparing it to the libraries listed below
- ☆23Updated 7 years ago
- Tutorial for Deploying Anaconda Cluster and PySpark on top of Red Hat Storage GlusterFS☆8Updated 10 years ago
- [DEPRECATED] For read-only reference of the ALOJA Big Data Benchmarking platform: includes tools to define and deploy clusters, orchestr…☆23Updated 4 years ago
- An ElasticSearch / Graphite shim which translates graphite requests into ElasticSearch data queries for a given mapping☆16Updated 6 years ago
- POC IDS anomaly detection engine built with iPython notebook, matplotlib, pandas, numpy, scikit-learn, d3.js, hyperloglog implementation,…☆79Updated 10 years ago
- Apache Spark under Docker☆9Updated 8 years ago
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Updated 9 years ago
- High Level Kafka Scanner☆19Updated 7 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- Scripts to setup Spark cluster (any version) in any Openstack environment with optional useful tools.☆31Updated 3 years ago
- A javascript shell for elasticsearch☆105Updated 9 years ago
- Proposals for new Jupyter subprojects to enter into incubation☆18Updated 4 years ago
- Platform documentation☆16Updated 8 years ago
- personal cheatsheets on various technologies☆25Updated 8 years ago
- ☆15Updated 3 years ago
- Data Science Command Line Toolbox in a docker container☆28Updated 6 years ago
- A real time streaming implementation of markov chain based fraud detection☆24Updated 10 years ago
- Security log file challenge☆28Updated 8 years ago
- Tools to deploy Hadoop on EMC Isilon☆18Updated 8 years ago
- Real time and offline time series analysis with Spark, Spark Streaming and Storm☆21Updated 4 years ago
- Python module for creating development environments as Docker containers, similar to virtualenv.☆17Updated 8 years ago
- IPython Notebook + D3☆128Updated 10 years ago
- Data Science box: Spark, Jupyter, R+RStudio, Zeppelin, Python 2 & 3, Java, Scala.☆39Updated 6 years ago
- Chapter-wise code for Agile Data the O'Reilly book☆157Updated 11 years ago
- A DockerSwarm Jupyterhub setup, which uses a NFS Server running in a Docker Container for persistent storage☆20Updated 6 years ago
- ☆14Updated 2 years ago
- framework for making streamcorpus data☆11Updated 8 years ago
- A library of modern monitoring tools☆63Updated 6 years ago
- Building Python Data Application Tutorials☆23Updated 6 months ago
- Spark Tutorial at the University of Maryland☆38Updated 10 years ago