glennmurray / sparkstr
Spark Streaming jobs.
☆11Updated 9 years ago
Related projects: ⓘ
- Spark Parameter Optimization and Tuning☆31Updated 6 years ago
- Additional useful algorithms that can be used with spark.☆24Updated 9 years ago
- Vowpal Wabbit Webservice. A web service that accepts VW formatted text and runs it through a VW daemon instance.☆40Updated 8 years ago
- Distributed Matrix Library☆70Updated 7 years ago
- Secondary sort and streaming reduce for Apache Spark☆78Updated last year
- Topic Modeling on Apache Spark☆94Updated 5 years ago
- A Spark port of TFOCS: Templates for First-Order Conic Solvers (cvxr.com/tfocs)☆89Updated 5 months ago
- A curated inventory of machine learning methods available on the Apache Spark platform, both in official and third party libraries.☆65Updated 7 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated 10 months ago
- ☆110Updated 7 years ago
- Training materials for Strata, AMP Camp, etc☆150Updated 8 years ago
- An efficient updatable key-value store for Apache Spark☆250Updated 7 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆91Updated 8 years ago
- MLeap allows for easily putting Spark ML pipelines into production☆78Updated 7 years ago
- Project for the talk on NLP using LSTM implementation from DL4J on Spark☆20Updated 8 years ago
- ☆39Updated this week
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Oracle Data Science Bootcamp 2014☆25Updated 9 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 8 years ago
- An implementation of Markov Clustering algorithm for Spark in Scala☆34Updated 7 years ago
- ☆32Updated 4 years ago
- Example of running a Genetic Algorithm (Travelling Salesman) on Apache Spark☆42Updated 7 years ago
- ☆42Updated this week
- An API for Distributed Machine Learning☆154Updated 7 years ago
- A command line tool for Spark packages☆19Updated last year
- Code to allow running BIDMach on Spark including HDFS integration and lightweight sparse model updates (Kylix).☆15Updated 4 years ago
- Demonstration code for MLeap, both Jupyter notebooks and projects☆24Updated 5 years ago
- Approximate Nearest Neighbors in Spark☆175Updated 3 years ago
- Yggdrasil: Faster Decision Trees Using Column Partitioning in Spark☆31Updated 6 years ago
- ☆39Updated this week