Content for architecting a data science platform for products using Luigi, Spark & Flask.
☆162Jan 27, 2020Updated 6 years ago
Alternatives and similar repositories for scalable-data-science-platform
Users that are interested in scalable-data-science-platform are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆52Jul 15, 2016Updated 9 years ago
- Sample repo for luigi tasks & config☆36Jun 5, 2016Updated 9 years ago
- Curated list of all dataset websites that I find☆83Oct 17, 2018Updated 7 years ago
- Introduction to Statistics and Basics of Mathematics for Data Science - The Hacker's Way☆1,454Nov 26, 2017Updated 8 years ago
- Material for some talks I have given☆62Sep 18, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Introduction to Deep Learning for Natural Language Processing☆605May 26, 2020Updated 5 years ago
- Portland Python Meetup March 2015☆40Mar 27, 2015Updated 11 years ago
- Python package for consolidated and extensive Univariate,Bivariate Data Analysis and Visualization catering to both categorical and conti…☆205Sep 28, 2016Updated 9 years ago
- ☆84Mar 9, 2018Updated 8 years ago
- A Python tool that automatically cleans data sets and readies them for analysis.☆1,080May 22, 2019Updated 6 years ago
- dplyr for python☆760Dec 30, 2016Updated 9 years ago
- Docker container for Shiny Server☆14Mar 31, 2016Updated 10 years ago
- Geographic Data Science with PySAL - Scipy'16☆37Jul 12, 2016Updated 9 years ago
- Create a dashboard with python!☆768Sep 9, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Machine learning model evaluation made easy: plots, tables, HTML reports, experiment tracking and Jupyter notebook analysis.☆468Feb 27, 2025Updated last year
- ☆190Jul 6, 2023Updated 2 years ago
- Solving NLP problems with Vowpal Wabbit: Tutorial and more☆183Mar 8, 2016Updated 10 years ago
- a web application framework for python☆833Mar 27, 2026Updated 3 weeks ago
- Articles on Data Science, Jupyter, and Pandas☆18Nov 24, 2015Updated 10 years ago
- Getting started with Bokeh - Europython 2015 Talk☆15Jul 24, 2015Updated 10 years ago
- Official page of the userR! 2016 conference☆17Sep 21, 2017Updated 8 years ago
- Introduction to Deep Learning for Image Recognition☆152Jul 11, 2016Updated 9 years ago
- A wrapper around tweepy to produce pandas dataframes for analysis☆74Jul 26, 2016Updated 9 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Multi-GPU reinforcement learning using Deep Q-Network in TensorFlow for OpenAI Gym☆183Jul 15, 2016Updated 9 years ago
- ☆28May 4, 2017Updated 8 years ago
- PyData NYC 2015 conference☆94Nov 11, 2015Updated 10 years ago
- Sequential Monte Carlo sampler for PyMC2 models.☆13Apr 4, 2018Updated 8 years ago
- This repository contains research we conduct at Vocapouch we want to share with the world.☆22Jul 13, 2017Updated 8 years ago
- Code and Presentation slides for Teaching the Elephant to Read☆17Apr 19, 2016Updated 10 years ago
- Sklearn implementation of GBM to predict mu(X) and std(X) on heteroscedastic data☆25Jun 3, 2016Updated 9 years ago
- An extensive machine learning library, made from scratch (Python).☆111Jun 24, 2018Updated 7 years ago
- common data analysis and machine learning tasks using python☆33May 18, 2016Updated 9 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Introduction to data visualization in Python, including plotting data from NetCDF files. Originally created for a Short Course in Data As…☆50Oct 22, 2021Updated 4 years ago
- A distributed in-memory fabric based on shared-memory blocks and datashape. Any language can operate on the data.☆13Feb 12, 2016Updated 10 years ago
- Python 2/3 compatible .npz CIFAR-10 dataset☆10Mar 1, 2017Updated 9 years ago
- Bombolone is a tasty Content Management System for Python based on Flask, MongoDB, AngularJS, Sass and Bootstrap. It's designed to be a s…☆75Jul 31, 2015Updated 10 years ago
- <||> Interfaces to Popular R Functions for Data Science Pipelines, and More☆75Sep 9, 2016Updated 9 years ago
- dask-searchcv is now part of dask-ml: https://github.com/dask/dask-ml☆241Oct 13, 2018Updated 7 years ago
- Python solver for mixed-effects models☆97Jun 3, 2025Updated 10 months ago