alexmilowski / data-scienceLinks
Code snippets for data acquisition and organization in data science.
☆22Updated 9 years ago
Alternatives and similar repositories for data-science
Users that are interested in data-science are comparing it to the libraries listed below
Sorting:
- Word2Vec models with Twitter data using Spark. Blog:☆66Updated 6 years ago
- Some IPython notebooks I've created...☆29Updated 9 years ago
- ☆11Updated 10 years ago
- [development moved to termite-data-server]☆61Updated 11 years ago
- Data and code for "Fast Data Applications with Spark and Python"☆25Updated 9 years ago
- Scripts to Analyze Pronto's Data Release☆24Updated 9 years ago
- Data Server for Topic Models☆122Updated 2 years ago
- Repository for exploratory data transformation & visualization talk☆27Updated 9 years ago
- A curated inventory of machine learning methods available on the Apache Spark platform, both in official and third party libraries.☆65Updated 8 years ago
- ☆46Updated 2 months ago
- Kaggle competition☆23Updated 10 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- Library for Geo-Inferencing in Twitter Data☆28Updated 9 years ago
- Code and Notebooks for the Natural Language Processing with Python course.☆65Updated 7 years ago
- Stability analysis for topic models☆51Updated 8 years ago
- Code for Pythonic visualization blog post☆40Updated 8 years ago
- An in depth tutorial on sklearn's Pipeline and FeatureUnion classes.☆16Updated 8 years ago
- Notebooks (and slides) for my PyData NYC 2014 tutorial on the more advanced features of scikit-learn.☆69Updated 10 years ago
- IPython notebook for PyData SF 2014 tutorial: "Gradient Boosted Regression Trees in scikit-learn"☆63Updated 8 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- Analyzing Clickstream Data using Markov Chains and data mining SPACE algorithm☆29Updated 7 years ago
- A library that allows serialization of SciKit-Learn estimators into PMML☆71Updated 6 years ago
- field experiments tutorial☆27Updated 11 years ago
- Source code for the "Practical Data Science in Python" tutorial☆58Updated 10 years ago
- the 2nd place solution for West Nile Virus Prediction challenge on Kaggle☆36Updated 10 years ago
- Predicting happiness from demographics and poll answers☆45Updated 8 years ago
- (Deprecated) Task for the Search & Discovery data analyst job.☆21Updated 10 years ago
- An external PySpark module that works like R's read.csv or Panda's read_csv, with automatic type inference and null value handling. Parse…☆90Updated 9 years ago
- Files for London PyData London, 2015☆15Updated 10 years ago
- This project is for the notebooks, code, and data for the "Vocabulary Analysis of Job Descriptions" tutorial at PyData 2017 Seattle☆20Updated 8 years ago