bbejeck / hadoop-algorithms
☆55Updated 11 years ago
Alternatives and similar repositories for hadoop-algorithms:
Users that are interested in hadoop-algorithms are comparing it to the libraries listed below
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Sparse feature extraction with Spark☆30Updated 6 years ago
- This is an introduction of Apache Spark DataFrames.☆41Updated 9 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- Getting started with Spark, Spark Streaming, Spark SQL, DataFrame☆36Updated 8 years ago
- Sparking Using Java8☆17Updated 10 years ago
- Training materials for Strata, AMP Camp, etc☆150Updated 9 years ago
- 阅读论文备份☆17Updated 8 years ago
- A subproject of Predictiveworks that provides common access to Cassandra, Elasticsearch, HBase, MongoDB, Parquet, JDBC database and other…☆13Updated 10 years ago
- ☆11Updated 10 years ago
- Trident-ML : A realtime online machine learning library☆381Updated last year
- Helpful user defined fuctions / table generating functions for Hive☆101Updated 8 years ago
- An Ambari Stack service package for VNC Server with the ability to install developer tools like Eclipse/IntelliJ/Maven as well to 'remote…☆28Updated 8 years ago
- Spark MLlib code optimized to efficiently support sparse data☆51Updated 8 years ago
- Online Machine Learning Algorithms☆30Updated last year
- A library for financial and time series calculations on Apache Spark☆28Updated 9 years ago
- Parallel Iterative Algorithm (SGD) on Hadoop's YARN framework☆42Updated 12 years ago
- Time series and energy data analysis API for Spark.☆19Updated 12 years ago
- Examples of Integrating Spark Streaming, Flume, and HBase to solve Streaming problems☆19Updated 11 years ago
- Spark example code demonstrating RDD, DataFrame and DataSet APIs.☆37Updated 9 years ago
- ☆33Updated 9 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 9 years ago
- General Vectorization Lib for Machine Learning Tools☆31Updated 8 years ago
- Apache Spark applications☆70Updated 7 years ago
- Scriptable scheduler for periodical Hadoop workflows☆22Updated 7 years ago
- ☆9Updated 9 years ago
- Code for Packt Publishing's Scala Data Analysis Cookbook.☆49Updated 9 years ago
- Set of Hadoop, Spark and Storm based tools for web and customer analytic☆34Updated 3 years ago
- ☆13Updated 9 years ago