elephantscale / HI-labsLinks
☆46Updated 7 years ago
Alternatives and similar repositories for HI-labs
Users that are interested in HI-labs are comparing it to the libraries listed below
Sorting:
- Coding exercises for Apache Spark☆104Updated 10 years ago
- Sample Spark Code☆91Updated 7 years ago
- ☆76Updated 10 years ago
- Kite SDK Examples☆99Updated 4 years ago
- Complete Pipeline Training at Big Data Scala By the Bay☆71Updated 10 years ago
- Vagrant projects for various use-cases with Spark, Zeppelin, IPython / Jupyter, SparkR☆34Updated 9 years ago
- An Apache Spark standalone application using the Spark API in Scala. The application uses Simple Build Tool(SBT) for building the project…☆30Updated 9 years ago
- Monitor Twitter stream for S&P 500 companies to identify & act on unexpected increases in tweet volume☆39Updated 9 years ago
- Scala: The Unpredicted Lingua Franca for Data Science☆129Updated 7 years ago
- Demonstrates NiFi template deployment and configuration via a REST API☆70Updated 8 years ago
- Workshop for Hadoop Operations Best Practices☆10Updated 10 years ago
- Generates more or less realistic log data for testing simple aggregation queries.☆261Updated 2 years ago
- XML Serializer/Deserializer for Apache Hive☆41Updated 6 years ago
- ODPi specifications, developed by ODPi Runtime and ODPi Operations projects. Currently in Emeritus status☆35Updated 6 years ago
- Data and example code for Programming Pig, by Alan F. Gates☆187Updated 9 years ago
- Training materials for Strata, AMP Camp, etc☆148Updated 10 years ago
- Simplify getting Zeppelin up and running☆56Updated 9 years ago
- ☆35Updated 9 years ago
- BerkeleyX: CS100.1x, Introduction to Big Data with Apache Spark☆12Updated 10 years ago
- A Spark WordCountJob example as a standalone SBT project with Specs2 tests, runnable on Amazon EMR☆119Updated 9 years ago
- Supporting material (code, schemas etc) for Unified Log Processing (Manning Publications)☆98Updated 3 years ago
- Reference Architectures for Apache Spark☆38Updated 8 years ago
- MLeap allows for easily putting Spark ML pipelines into production☆78Updated 9 years ago
- Gallery of Apache Zeppelin notebooks☆216Updated 6 years ago
- Functional, Typesafe, Declarative Data Pipelines☆139Updated 7 years ago
- Code for Tutorial on designing clickstream analytics application using Hadoop☆55Updated 10 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- Distributed DataFrame: Productivity = Power x Simplicity For Scientists & Engineers, on any Data Engine☆167Updated 4 years ago
- Pig on Apache Spark☆82Updated 10 years ago
- These are some code examples☆55Updated 5 years ago