unnati-xyz / scalable-data-science-platform
Content for architecting a data science platform for products using Luigi, Spark & Flask.
☆163Updated 5 years ago
Alternatives and similar repositories for scalable-data-science-platform:
Users that are interested in scalable-data-science-platform are comparing it to the libraries listed below
- An external PySpark module that works like R's read.csv or Panda's read_csv, with automatic type inference and null value handling. Parse…☆90Updated 9 years ago
- ☆146Updated 8 years ago
- PyData NYC 2015 conference☆94Updated 9 years ago
- Deep Learning for Pugs☆74Updated 7 years ago
- Sample repo for luigi tasks & config☆36Updated 8 years ago
- ☆85Updated 6 years ago
- ☆263Updated 5 years ago
- PyData Seattle 2015: Python Data Bikeshed☆127Updated 9 years ago
- DePy 2015 Talk☆117Updated 7 years ago
- A short guide for transitioning from Python to Scala☆65Updated 9 years ago
- Code for Learning with Data Blog☆64Updated 7 years ago
- A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support☆262Updated 7 years ago
- Notebooks (and slides) for my PyData NYC 2014 tutorial on the more advanced features of scikit-learn.☆69Updated 10 years ago
- ☆52Updated 8 years ago
- ☆160Updated 8 years ago
- Curated list of all dataset websites that I find☆84Updated 6 years ago
- Natural Language Processing with Spark's MLlib☆62Updated 7 years ago
- PyData, The Complete Works of☆298Updated 8 years ago
- All Kaggle competitions☆91Updated 8 years ago
- Presentation at Perth Data Science Meetup, February 2015☆72Updated 9 years ago
- Sparkling Pandas☆362Updated last year
- Learn the pyspark API through pictures and simple examples☆169Updated 4 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 6 years ago
- A couple projects using scikit-learn illustrating project decision making.☆15Updated 8 years ago
- Building Python Data Applications with Blaze and Bokeh Tutorial, SciPy 2015☆144Updated 9 years ago
- ☆41Updated 9 years ago
- Magic functions for using Jupyter Notebook with Apache Spark and a variety of SQL databases.☆172Updated 6 years ago
- Code & Data for Introduction to Machine Learning with Scikit-Learn☆81Updated 6 years ago
- Material for some talks I have given☆62Updated 4 months ago
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 9 years ago