spark-ml / mllib-grid-searchLinks
An example project for doing grid search in MLlib
☆13Updated 10 years ago
Alternatives and similar repositories for mllib-grid-search
Users that are interested in mllib-grid-search are comparing it to the libraries listed below
Sorting:
- Scala client for the Lightning data visualization server (WIP)☆47Updated 5 years ago
- Library for building reproducible data pipelines to support experimentation☆20Updated 9 years ago
- open source version of the Bonsai library☆26Updated 9 years ago
- Dependency and data pipeline management framework for Spark and Scala☆15Updated 8 years ago
- Scriptable scheduler for periodical Hadoop workflows☆22Updated 7 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago
- MLeap allows for easily putting Spark ML pipelines into production☆78Updated 8 years ago
- Parquet Command-line Tools☆18Updated 8 years ago
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Updated 10 years ago
- Additional useful algorithms that can be used with spark.☆24Updated 10 years ago
- Code to allow running BIDMach on Spark including HDFS integration and lightweight sparse model updates (Kylix).☆15Updated 4 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 6 years ago
- Proposals for new Jupyter subprojects to enter into incubation☆18Updated 4 years ago
- Benchmarks of artificial neural network library for Spark MLlib☆11Updated 9 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Updated 8 years ago
- Provided Guidance on Creating End to End Solutions for Common SILK Use Cases☆13Updated 9 years ago
- A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine☆46Updated 5 years ago
- Distributed Matrix Library☆72Updated 8 years ago
- Reproducing Distributed Systems and Experiments on Cloud☆40Updated last year
- ☆24Updated 10 years ago
- ReactiveLDA is a fast, lightweight implementation of the Latent Dirichlet Allocation (LDA) algorithm, using a parallel vanilla Gibbs samp…☆61Updated 9 years ago
- Templates for projects based on top of H2O.☆38Updated 3 months ago
- Complete Pipeline Training at Big Data Scala By the Bay☆71Updated 9 years ago
- Machine learning evaluation database☆24Updated 7 years ago
- ☆24Updated 9 years ago
- something to help you spark☆65Updated 6 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- CUDA kernel and JNI code which is called by Apache Spark's MLlib.☆19Updated 9 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆93Updated 9 years ago
- Distributed Streaming Quantiles (for PySpark)☆38Updated 11 years ago