szilard / datascience-latencyLinks
Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for the most common analytical tasks (SQL-like data munging, linear and non-linear supervised learning etc.) with the typically available tools on commodity hardware.
☆20Updated 8 years ago
Alternatives and similar repositories for datascience-latency
Users that are interested in datascience-latency are comparing it to the libraries listed below
Sorting:
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago
- Algorithm's team Jupyter Notebooks☆113Updated last month
- Machine Learning with Scikit-Learn (material for pydata Amsterdam 2016)☆30Updated 9 years ago
- Benchmarks of the H2O Ensemble R interface (H2O 2.0).☆14Updated 4 years ago
- Modeling Social Data, Applied Mathematics, Columbia University (Spring 2015)☆33Updated 5 years ago
- Quick informal survey at the Los Angeles Machine learning meetup about tools used for machine learning.☆51Updated 10 years ago
- Advanced workshop on XGBoost with Tianqi Chen in Santa Monica, June 2, 2016☆26Updated 8 years ago
- A collection of data science examples implemented across a variety of languages and libraries.☆33Updated 9 years ago
- Example scripts for various deep learning APIs.☆28Updated 9 years ago
- Latent dirichlet allocation (LDA) for datamicroscopes☆41Updated 9 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Updated 9 years ago
- ☆24Updated 9 years ago
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 10 years ago
- Machine learning evaluation database☆24Updated 7 years ago
- ☆28Updated 9 years ago
- Docker images for data science from Wise.io☆50Updated 9 years ago
- Fast, easy and intuitive machine learning prototyping.☆124Updated 11 years ago
- Install directions and example notebooks for Udacity's Deep Learning classes☆28Updated 9 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 9 years ago
- Fast Ensembles of Sparse Trees☆38Updated 9 years ago
- scikit-learn addon to operate on set/"group"-based features☆41Updated 8 years ago
- Source code for the tutorial series at http://www.thoughtly.co/blog/prototype☆32Updated 10 years ago
- Materials for my PyData Seattle talk☆21Updated 9 years ago
- k-means + a linear model = good results☆55Updated 10 years ago
- Code and data for "The Geometry of Classifiers"☆26Updated 5 years ago
- Code for PyData Talk on "Classifying Products Based on Images and Text using Keras"☆30Updated 8 years ago
- ☆11Updated 8 years ago
- Datasets and notebooks☆13Updated 8 years ago
- ggplot2-inspired d3 app to make instant interactive visualizations☆55Updated 13 years ago
- ☆34Updated 9 years ago