jattenberg / datascience-utilities
datascience oriented utilities: histograms, aggregations, plots, data manipulation, and other common tasks.
☆41Updated last year
Related projects: ⓘ
- Material for some talks I have given☆63Updated this week
- PDF and python files for creating time maps and downloading tweets☆58Updated 4 years ago
- Docker images for data science from Wise.io☆50Updated 8 years ago
- Modeling Social Data, Applied Mathematics, Columbia University (Spring 2015)☆34Updated 5 years ago
- Replication materials for Bayesian measurement error model of dichotomous measures of democracy.☆16Updated 9 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 8 years ago
- Python code for Hadley Whickham's article on Tidy Data.☆34Updated 7 years ago
- ☆11Updated 8 years ago
- ☆24Updated 5 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆20Updated 7 years ago
- Example scripts for various deep learning APIs.☆28Updated 9 years ago
- Algorithm's team Jupyter Notebooks☆113Updated 8 years ago
- Wiki of links and data science resources started in datascientists.slack.com☆14Updated 9 years ago
- My talk at Strata 2014 in Santa Clara, CA☆74Updated 10 years ago
- build process for turning ipython notebooks into markdown files for your jekyll blog☆18Updated 9 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 6 years ago
- A collection of IPython Notebooks containing my research.☆20Updated 6 years ago
- ETL data pipeline for SixFifty modelling & analytics☆13Updated 4 years ago
- ggplot2-inspired d3 app to make instant interactive visualizations☆55Updated 12 years ago
- Simulations for my blog post, as well as some helper functions for R users and Python users.☆18Updated 7 years ago
- Machine Learning with Scikit-Learn (material for pydata Amsterdam 2016)☆30Updated 8 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 10 years ago
- Notebook version of an article on the Fast Forward Labs blog☆61Updated 7 years ago
- ☆41Updated 9 years ago
- Quick & dirty repo for hosting the Notebook for t-SNE presentation at delivered at Python Quants and PyData London meetups☆9Updated 8 years ago
- Advanced workshop on XGBoost with Tianqi Chen in Santa Monica, June 2, 2016☆26Updated 7 years ago
- Materials for talk on scikit-learn☆27Updated 8 years ago
- Topic modeling web application☆39Updated 9 years ago
- Code for Pythonic visualization blog post☆40Updated 7 years ago
- Code for Sentiment Analysis Symposium tutorial demos.☆15Updated 7 years ago