bentaylordata / datascience
Data science repo to help others
☆12Updated 9 years ago
Alternatives and similar repositories for datascience:
Users that are interested in datascience are comparing it to the libraries listed below
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 9 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Updated 9 years ago
- Machine Learning with Scikit-Learn (material for pydata Amsterdam 2016)☆30Updated 9 years ago
- An in depth tutorial on sklearn's Pipeline and FeatureUnion classes.☆16Updated 7 years ago
- feng - feature engineering for machine-learning champions☆27Updated 8 years ago
- Sample repo for luigi tasks & config☆36Updated 8 years ago
- Machine learning evaluation database☆24Updated 7 years ago
- Notebooks (and slides) for my PyData NYC 2014 tutorial on the more advanced features of scikit-learn.☆69Updated 10 years ago
- Example scripts for various deep learning APIs.☆28Updated 9 years ago
- Docker container with a PyData stack and JupyterHub server☆37Updated 8 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 8 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- Materials for Mike's PyCon Canada 2016 PySpark Tutorial☆12Updated 8 years ago
- A Topic Modeling toolbox☆92Updated 8 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆20Updated 7 years ago
- ETL data pipeline for SixFifty modelling & analytics☆13Updated 5 years ago
- Dask powered gridsearch and pipeline a la scikit-learn☆42Updated 9 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- The ultimate twitter streaming data collector☆40Updated 8 years ago
- Scripts to Analyze Pronto's Data Release☆24Updated 9 years ago
- Healthcare Twitter Analysis☆26Updated 8 years ago
- Material for some talks I have given☆62Updated 6 months ago
- Dato/Turi DS Conf talk on NLP and Elasticsearch analysis of reviews, plus JS implementation☆45Updated 8 years ago
- Proposals for new Jupyter subprojects to enter into incubation☆18Updated 4 years ago
- Reinforcement Learning Algorithms☆14Updated 6 years ago
- ☆11Updated 8 years ago
- A couple projects using scikit-learn illustrating project decision making.☆15Updated 8 years ago
- Tools for performing hyperparameter search with Scikit-Learn and Dask http://dask-searchcv.readthedocs.io☆11Updated 7 years ago
- ggplot2-inspired d3 app to make instant interactive visualizations☆55Updated 12 years ago