szilard / benchm-ml
A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).
☆1,870Updated 2 years ago
Related projects: ⓘ
- Anomaly Detection with R☆3,554Updated 5 years ago
- A python tutorial on bayesian modeling techniques (PyMC3)☆2,481Updated 7 years ago
- Simplified interface for TensorFlow (mimicking Scikit Learn) for Deep Learning☆3,180Updated 3 years ago
- Tutorial on scikit-learn and IPython for parallel machine learning☆1,591Updated 7 years ago
- Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow☆2,736Updated 2 years ago
- Sparkling Water provides H2O functionality inside Spark cluster☆961Updated last month
- [UNMAINTAINED] Automated machine learning for analytics & production☆1,641Updated 3 years ago
- PySpark + Scikit-learn = Sparkit-learn☆1,152Updated 3 years ago
- A web-based application for quick, scalable, and automated hyperparameter tuning and stacked ensembling in Python.☆1,266Updated 6 years ago
- ggplot port for python☆3,696Updated last year
- Distributed Deep learning with Keras & Spark☆1,573Updated last year
- A global, black box optimization engine for real world metric optimization.☆1,305Updated last year
- H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random F…☆6,861Updated this week
- An R package for causal inference in time series☆1,687Updated last year
- A collection of tutorials and examples for solving and understanding machine learning and pattern classification tasks☆4,125Updated 9 months ago
- Machine Learning Problem Bible | Problem Set Here >>☆717Updated 4 years ago
- Tutorials and training material for the H2O Machine Learning Platform☆1,477Updated 4 months ago
- Machine learning evaluation metrics, implemented in Python, R, Haskell, and MATLAB / Octave☆1,624Updated last year
- Forecasting Functions for Time Series and Linear Models☆1,113Updated 2 weeks ago
- The probability and statistics cookbook☆2,238Updated last year
- Code accompanying the book "Machine Learning for Hackers"☆3,664Updated 5 years ago
- TensorFlow for R☆1,326Updated 3 months ago
- (Deprecated) Scikit-learn integration package for Apache Spark☆1,079Updated 4 years ago
- Open source time series library for Python☆2,107Updated 10 months ago
- Machine Learning in R☆1,640Updated last month
- dplyr for python☆763Updated 7 years ago
- A library for reading text files over multiple cores.☆1,061Updated last year
- A probabilistic programming language in TensorFlow. Deep generative models, variational inference.☆4,830Updated 6 months ago
- THIS IS THE **OLD** PYMC PROJECT (VERSION 2). PLEASE USE PYMC INSTEAD:☆879Updated 4 years ago
- Observations from Ian on successfully delivering data science products☆543Updated 3 years ago