unnati-xyz / scalable-data-science-platformLinks
Content for architecting a data science platform for products using Luigi, Spark & Flask.
☆163Updated 5 years ago
Alternatives and similar repositories for scalable-data-science-platform
Users that are interested in scalable-data-science-platform are comparing it to the libraries listed below
Sorting:
- An external PySpark module that works like R's read.csv or Panda's read_csv, with automatic type inference and null value handling. Parse…☆90Updated 9 years ago
- DePy 2015 Talk☆117Updated 7 years ago
- PyData Seattle 2015: Python Data Bikeshed☆127Updated 9 years ago
- Deep Learning for Pugs☆74Updated 7 years ago
- ☆146Updated 9 years ago
- Sample repo for luigi tasks & config☆36Updated 9 years ago
- ☆160Updated 8 years ago
- PyData NYC 2015 conference☆94Updated 9 years ago
- Curated list of all dataset websites that I find☆84Updated 6 years ago
- A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support☆261Updated 7 years ago
- Presentation at Perth Data Science Meetup, February 2015☆72Updated 10 years ago
- Building Python Data Applications with Blaze and Bokeh Tutorial, SciPy 2015☆144Updated 9 years ago
- Notebooks (and slides) for my PyData NYC 2014 tutorial on the more advanced features of scikit-learn.☆69Updated 10 years ago
- Code for Learning with Data Blog☆65Updated 8 years ago
- ☆84Updated 7 years ago
- ☆52Updated 8 years ago
- Sparkling Pandas☆363Updated last year
- PyData, The Complete Works of☆299Updated 8 years ago
- All Kaggle competitions☆91Updated 8 years ago
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 10 years ago
- ☆263Updated 5 years ago
- ☆34Updated 9 years ago
- A short guide for transitioning from Python to Scala☆65Updated 9 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- ☆190Updated last year
- Tools, wrappers, etc... for data science with a concentration on text processing☆206Updated 2 years ago
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆58Updated 4 years ago
- My talk at Strata 2014 in Santa Clara, CA☆73Updated 11 years ago
- Magic functions for using Jupyter Notebook with Apache Spark and a variety of SQL databases.☆171Updated 6 years ago
- Information for setting up for the BerkeleyX Spark Intro MOOC, and lab assignments for the course☆348Updated 4 years ago