jattenberg / datascience-utilities
datascience oriented utilities: histograms, aggregations, plots, data manipulation, and other common tasks.
☆41Updated last year
Alternatives and similar repositories for datascience-utilities:
Users that are interested in datascience-utilities are comparing it to the libraries listed below
- Docker images for data science from Wise.io☆50Updated 8 years ago
- PDF and python files for creating time maps and downloading tweets☆59Updated 4 years ago
- Algorithm's team Jupyter Notebooks☆113Updated 8 years ago
- Machine Learning with Scikit-Learn (material for pydata Amsterdam 2016)☆30Updated 8 years ago
- Advanced git and github course material☆39Updated 6 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆20Updated 7 years ago
- ☆11Updated 8 years ago
- Python (PyMC) adaptation of the R code from "Doing Bayesian Data Analysis"☆65Updated 7 years ago
- My IPython startup files.☆109Updated 10 years ago
- Because you're computing conversion rates wrong☆16Updated 7 years ago
- Material for the statistics in Python tutorial☆43Updated 9 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 10 years ago
- Materials for the IPython/Jupyter workshop at the NGCM Summer Academy, at Southampton University, Boldrewood campus.☆46Updated 7 years ago
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 9 years ago
- My talk at Strata 2014 in Santa Clara, CA☆73Updated 11 years ago
- Library for GPU-related statistical functions☆84Updated 12 years ago
- k-means + a linear model = good results☆55Updated 10 years ago
- A book on the applications of topic models.☆14Updated 7 years ago
- Material for some talks I have given☆62Updated 5 months ago
- A Topic Modeling toolbox☆92Updated 8 years ago
- Topic modeling web application☆40Updated 9 years ago
- Additional files for the Otto Group Challenge hosted by Kaggle☆36Updated 9 years ago
- Benchmarks of the H2O Ensemble R interface (H2O 2.0).☆14Updated 4 years ago
- Simple validator for submissions to DrivenData competitions☆19Updated 5 years ago
- A bot tweeting nonsensical craft beer reviews via Markov chains☆49Updated 9 years ago
- Notebook version of an article on the Fast Forward Labs blog☆61Updated 7 years ago
- Articles on Data Science, Jupyter, and Pandas☆18Updated 9 years ago
- Code for Pythonic visualization blog post☆40Updated 7 years ago
- Simple approximate-nearest-neighbours in Python using locality sensitive hashing.☆140Updated 12 years ago
- Advanced workshop on XGBoost with Tianqi Chen in Santa Monica, June 2, 2016☆26Updated 8 years ago