Content for architecting a data science platform for products using Luigi, Spark & Flask.
☆161Jan 27, 2020Updated 6 years ago
Alternatives and similar repositories for scalable-data-science-platform
Users that are interested in scalable-data-science-platform are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆52Jul 15, 2016Updated 9 years ago
- Sample repo for luigi tasks & config☆36Jun 5, 2016Updated 10 years ago
- Curated list of all dataset websites that I find☆84Oct 17, 2018Updated 7 years ago
- Introduction to Statistics and Basics of Mathematics for Data Science - The Hacker's Way☆1,454Nov 26, 2017Updated 8 years ago
- Material for some talks I have given☆62Sep 18, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Introduction to Deep Learning for Natural Language Processing☆603May 26, 2020Updated 6 years ago
- The ultimate twitter streaming data collector☆40Nov 8, 2016Updated 9 years ago
- Portland Python Meetup March 2015☆40Mar 27, 2015Updated 11 years ago
- Python package for consolidated and extensive Univariate,Bivariate Data Analysis and Visualization catering to both categorical and conti…☆205Sep 28, 2016Updated 9 years ago
- ☆84Mar 9, 2018Updated 8 years ago
- A Python tool that automatically cleans data sets and readies them for analysis.☆1,083May 22, 2019Updated 7 years ago
- dplyr for python☆761Dec 30, 2016Updated 9 years ago
- Docker container for Shiny Server☆14Mar 31, 2016Updated 10 years ago
- Geographic Data Science with PySAL - Scipy'16☆37Jul 12, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Create a dashboard with python!☆767Sep 9, 2019Updated 6 years ago
- Machine learning model evaluation made easy: plots, tables, HTML reports, experiment tracking and Jupyter notebook analysis.☆467Feb 27, 2025Updated last year
- ☆190Jul 6, 2023Updated 2 years ago
- Solving NLP problems with Vowpal Wabbit: Tutorial and more☆183Mar 8, 2016Updated 10 years ago
- a web application framework for python☆833Mar 27, 2026Updated 2 months ago
- Advance concepts for optimizing pandas, dask and numba☆11Sep 8, 2018Updated 7 years ago
- Articles on Data Science, Jupyter, and Pandas☆18Nov 24, 2015Updated 10 years ago
- Getting started with Bokeh - Europython 2015 Talk☆15Jul 24, 2015Updated 10 years ago
- Introduction to Deep Learning for Image Recognition☆152Jul 11, 2016Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A wrapper around tweepy to produce pandas dataframes for analysis☆74Jul 26, 2016Updated 9 years ago
- Multi-GPU reinforcement learning using Deep Q-Network in TensorFlow for OpenAI Gym☆183Jul 15, 2016Updated 9 years ago
- ☆28May 4, 2017Updated 9 years ago
- PyData NYC 2015 conference☆93Nov 11, 2015Updated 10 years ago
- Sequential Monte Carlo sampler for PyMC2 models.☆13Apr 4, 2018Updated 8 years ago
- Code and Presentation slides for Teaching the Elephant to Read☆17Apr 19, 2016Updated 10 years ago
- Sklearn implementation of GBM to predict mu(X) and std(X) on heteroscedastic data☆25Jun 3, 2016Updated 10 years ago
- An extensive machine learning library, made from scratch (Python).☆110Jun 24, 2018Updated 7 years ago
- common data analysis and machine learning tasks using python☆33May 18, 2016Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Introduction to data visualization in Python, including plotting data from NetCDF files. Originally created for a Short Course in Data As…☆49Oct 22, 2021Updated 4 years ago
- Python 2/3 compatible .npz CIFAR-10 dataset☆10Mar 1, 2017Updated 9 years ago
- <||> Interfaces to Popular R Functions for Data Science Pipelines, and More☆74Sep 9, 2016Updated 9 years ago
- dask-searchcv is now part of dask-ml: https://github.com/dask/dask-ml☆241Oct 13, 2018Updated 7 years ago
- Python solver for mixed-effects models☆97Jun 3, 2025Updated last year
- Tools for exploratory data analysis in Python☆648Aug 5, 2025Updated 10 months ago
- A Jupyter Extension for Adding Pug Photos to Your Notebook☆11Mar 2, 2016Updated 10 years ago