alexmilowski / data-scienceLinks
Code snippets for data acquisition and organization in data science.
☆22Updated 9 years ago
Alternatives and similar repositories for data-science
Users that are interested in data-science are comparing it to the libraries listed below
Sorting:
- Word2Vec models with Twitter data using Spark. Blog:☆66Updated 6 years ago
- Repository for exploratory data transformation & visualization talk☆27Updated 9 years ago
- [development moved to termite-data-server]☆61Updated 11 years ago
- ☆11Updated 10 years ago
- (Deprecated) Task for the Search & Discovery data analyst job.☆21Updated 10 years ago
- Code for Pythonic visualization blog post☆40Updated 8 years ago
- Scripts to Analyze Pronto's Data Release☆24Updated 10 years ago
- Stability analysis for topic models☆51Updated 9 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- Code examples and data for the KiwiPyCon 2014 NLP tutorial☆39Updated 11 years ago
- Simple employee cost/benefit model with plots. Supports a series of blog entries.☆70Updated 11 years ago
- Data Server for Topic Models☆122Updated 2 years ago
- Data and code for "Fast Data Applications with Spark and Python"☆25Updated 9 years ago
- Library for Geo-Inferencing in Twitter Data☆28Updated 9 years ago
- AXA Driver Telematics Challenge on Kaggle.com☆51Updated 8 years ago
- ☆35Updated 12 years ago
- Analyzing Clickstream Data using Markov Chains and data mining SPACE algorithm☆29Updated 7 years ago
- ☆46Updated 4 months ago
- This repository contains materials for demos, tutorials, and talks by Dato Inc.☆172Updated 9 years ago
- Notebooks (and slides) for my PyData NYC 2014 tutorial on the more advanced features of scikit-learn.☆69Updated 11 years ago
- Instructions & code for the EuroPython 2014 training session "Topic Modeling for Fun and Profit"☆110Updated 11 years ago
- the 2nd place solution for West Nile Virus Prediction challenge on Kaggle☆36Updated 10 years ago
- ☆160Updated 8 years ago
- An external PySpark module that works like R's read.csv or Panda's read_csv, with automatic type inference and null value handling. Parse…☆90Updated 10 years ago
- Tutorial and review of word2vec / doc2vec☆104Updated 10 years ago
- Public Machine Learning and Data Competition Repo☆54Updated 10 years ago
- 12 Week Data Science Immersive☆27Updated 10 years ago
- IPython notebook for PyData SF 2014 tutorial: "Gradient Boosted Regression Trees in scikit-learn"☆63Updated 8 years ago
- Kaggle competition☆23Updated 10 years ago
- An in depth tutorial on sklearn's Pipeline and FeatureUnion classes.☆16Updated 8 years ago