alexmilowski / data-science
Code snippets for data acquisition and organization in data science.
☆22Updated 8 years ago
Alternatives and similar repositories for data-science:
Users that are interested in data-science are comparing it to the libraries listed below
- Word2Vec models with Twitter data using Spark. Blog:☆65Updated 6 years ago
- [development moved to termite-data-server]☆61Updated 11 years ago
- Library for Geo-Inferencing in Twitter Data☆28Updated 8 years ago
- Repository for exploratory data transformation & visualization talk☆27Updated 8 years ago
- ☆46Updated last month
- Code for Pythonic visualization blog post☆40Updated 7 years ago
- ☆11Updated 9 years ago
- Stability analysis for topic models☆51Updated 8 years ago
- Kaggle competition☆23Updated 9 years ago
- Data Server for Topic Models☆121Updated last year
- A Topic Modeling toolbox☆92Updated 8 years ago
- Data and code for "Fast Data Applications with Spark and Python"☆25Updated 8 years ago
- Notebooks (and slides) for my PyData NYC 2014 tutorial on the more advanced features of scikit-learn.☆69Updated 10 years ago
- This project is for the notebooks, code, and data for the "Vocabulary Analysis of Job Descriptions" tutorial at PyData 2017 Seattle☆20Updated 7 years ago
- Materials for the workshop Advanced Text Analysis with SpaCy and Scikit-Learn, given at NYU during NYCDH Week 2017, at PyData NYC in Nov.…☆82Updated 2 years ago
- An in depth tutorial on sklearn's Pipeline and FeatureUnion classes.☆16Updated 7 years ago
- Public Kaggle Code and Info☆43Updated 9 years ago
- Some IPython notebooks I've created...☆29Updated 9 years ago
- Studying news events and internal displacement.☆43Updated 7 years ago
- Scripts to Analyze Pronto's Data Release☆24Updated 9 years ago
- (development moved to new repos)☆116Updated 11 years ago
- 12 Week Data Science Immersive☆27Updated 9 years ago
- (Deprecated) Task for the Search & Discovery data analyst job.☆21Updated 9 years ago
- Files for London PyData London, 2015☆15Updated 9 years ago
- Simple employee cost/benefit model with plots. Supports a series of blog entries.☆70Updated 10 years ago
- repository for code related to the end-to-end data analysis in python workshop, from the Open Data Science Conference 2015☆15Updated 9 years ago
- R code for analyzing tweets relating to #AAA2011 (text mining, topic modelling, network analysis, clustering and sentiment analysis)☆71Updated 11 years ago
- Text Thresher crowd sourced text annotator☆17Updated 7 years ago
- Instructions & code for the EuroPython 2014 training session "Topic Modeling for Fun and Profit"☆110Updated 10 years ago
- Tutorial and review of word2vec / doc2vec☆104Updated 9 years ago