unnati-xyz / scalable-data-science-platformView external linksLinks
Content for architecting a data science platform for products using Luigi, Spark & Flask.
☆163Jan 27, 2020Updated 6 years ago
Alternatives and similar repositories for scalable-data-science-platform
Users that are interested in scalable-data-science-platform are comparing it to the libraries listed below
Sorting:
- Sample repo for luigi tasks & config☆36Jun 5, 2016Updated 9 years ago
- Curated list of all dataset websites that I find☆83Oct 17, 2018Updated 7 years ago
- Introduction to Statistics and Basics of Mathematics for Data Science - The Hacker's Way☆1,468Nov 26, 2017Updated 8 years ago
- Portland Python Meetup March 2015☆40Mar 27, 2015Updated 10 years ago
- Introduction to Deep Learning for Natural Language Processing☆606May 26, 2020Updated 5 years ago
- Create a dashboard with python!☆769Sep 9, 2019Updated 6 years ago
- Introduction to Deep Learning for Image Recognition☆153Jul 11, 2016Updated 9 years ago
- Simple demonstration of how to build a complex real time machine learning visualization tool.☆16Mar 26, 2016Updated 9 years ago
- Python package for consolidated and extensive Univariate,Bivariate Data Analysis and Visualization catering to both categorical and conti…☆204Sep 28, 2016Updated 9 years ago
- ☆190Jul 6, 2023Updated 2 years ago
- Material for some talks I have given☆61Sep 18, 2024Updated last year
- A Python tool that automatically cleans data sets and readies them for analysis.☆1,077May 22, 2019Updated 6 years ago
- ☆84Mar 9, 2018Updated 7 years ago
- Sklearn implementation of GBM to predict mu(X) and std(X) on heteroscedastic data☆25Jun 3, 2016Updated 9 years ago
- Bombolone is a tasty Content Management System for Python based on Flask, MongoDB, AngularJS, Sass and Bootstrap. It's designed to be a s…☆75Jul 31, 2015Updated 10 years ago
- dplyr for python☆761Dec 30, 2016Updated 9 years ago
- Solving NLP problems with Vowpal Wabbit: Tutorial and more☆183Mar 8, 2016Updated 9 years ago
- Official page of the userR! 2016 conference☆17Sep 21, 2017Updated 8 years ago
- Docker container for Shiny Server☆14Mar 31, 2016Updated 9 years ago
- Machine learning model evaluation made easy: plots, tables, HTML reports, experiment tracking and Jupyter notebook analysis.☆467Feb 27, 2025Updated 11 months ago
- Articles on Data Science, Jupyter, and Pandas☆18Nov 24, 2015Updated 10 years ago
- Just a boilerplate for PySpark and Flask☆36Aug 2, 2018Updated 7 years ago
- PyData NYC 2015 conference☆94Nov 11, 2015Updated 10 years ago
- a web application framework for python☆832Mar 12, 2022Updated 3 years ago
- LAAVA: Long-read AAV Analysis☆13Dec 9, 2025Updated 2 months ago
- Dask powered gridsearch and pipeline a la scikit-learn☆42Nov 2, 2015Updated 10 years ago
- ☆35Jan 17, 2015Updated 11 years ago
- Tools for exploratory data analysis in Python☆647Aug 5, 2025Updated 6 months ago
- Sequential Monte Carlo sampler for PyMC2 models.☆13Apr 4, 2018Updated 7 years ago
- Principal component analysis plugin for D3.js☆10Aug 20, 2017Updated 8 years ago
- Deploy Dask on Marathon☆10Feb 6, 2017Updated 9 years ago
- Source Code for the event management project at https://eventmanagement.pythonanywhere.com☆12Jun 9, 2017Updated 8 years ago
- Complete software system for real time stock price prediction.☆11May 6, 2017Updated 8 years ago
- a very fast parser for sparse matrix at libsvm format☆10Nov 13, 2017Updated 8 years ago
- A minimal boilerplate for the RESTful services using Flask, SQLAlchemy and Flask-RestPlus (for the swagger-UI).☆14May 1, 2023Updated 2 years ago
- Repo for data surrounding fast food nutrition and ingredients☆10Nov 11, 2018Updated 7 years ago
- Python 2/3 compatible .npz CIFAR-10 dataset☆10Mar 1, 2017Updated 8 years ago
- A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support☆260Nov 3, 2017Updated 8 years ago
- ☆20Sep 15, 2021Updated 4 years ago