Content for architecting a data science platform for products using Luigi, Spark & Flask.
☆162Jan 27, 2020Updated 6 years ago
Alternatives and similar repositories for scalable-data-science-platform
Users that are interested in scalable-data-science-platform are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆52Jul 15, 2016Updated 9 years ago
- Sample repo for luigi tasks & config☆36Jun 5, 2016Updated 9 years ago
- Curated list of all dataset websites that I find☆83Oct 17, 2018Updated 7 years ago
- Material for some talks I have given☆62Sep 18, 2024Updated last year
- The ultimate twitter streaming data collector☆40Nov 8, 2016Updated 9 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Portland Python Meetup March 2015☆40Mar 27, 2015Updated 11 years ago
- Python package for consolidated and extensive Univariate,Bivariate Data Analysis and Visualization catering to both categorical and conti…☆205Sep 28, 2016Updated 9 years ago
- ☆84Mar 9, 2018Updated 8 years ago
- A Python tool that automatically cleans data sets and readies them for analysis.☆1,083May 22, 2019Updated 7 years ago
- Just a boilerplate for PySpark and Flask☆36Aug 2, 2018Updated 7 years ago
- dplyr for python☆761Dec 30, 2016Updated 9 years ago
- Docker container for Shiny Server☆14Mar 31, 2016Updated 10 years ago
- Geographic Data Science with PySAL - Scipy'16☆37Jul 12, 2016Updated 9 years ago
- Machine learning model evaluation made easy: plots, tables, HTML reports, experiment tracking and Jupyter notebook analysis.☆468Feb 27, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆190Jul 6, 2023Updated 2 years ago
- Solving NLP problems with Vowpal Wabbit: Tutorial and more☆183Mar 8, 2016Updated 10 years ago
- a web application framework for python☆833Mar 27, 2026Updated 2 months ago
- Advance concepts for optimizing pandas, dask and numba☆11Sep 8, 2018Updated 7 years ago
- Articles on Data Science, Jupyter, and Pandas☆18Nov 24, 2015Updated 10 years ago
- Getting started with Bokeh - Europython 2015 Talk☆15Jul 24, 2015Updated 10 years ago
- Official page of the userR! 2016 conference☆17Sep 21, 2017Updated 8 years ago
- Introduction to Deep Learning for Image Recognition☆152Jul 11, 2016Updated 9 years ago
- A wrapper around tweepy to produce pandas dataframes for analysis☆74Jul 26, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Multi-GPU reinforcement learning using Deep Q-Network in TensorFlow for OpenAI Gym☆183Jul 15, 2016Updated 9 years ago
- ☆28May 4, 2017Updated 9 years ago
- PyData NYC 2015 conference☆94Nov 11, 2015Updated 10 years ago
- Sequential Monte Carlo sampler for PyMC2 models.☆13Apr 4, 2018Updated 8 years ago
- This repository contains research we conduct at Vocapouch we want to share with the world.☆22Jul 13, 2017Updated 8 years ago
- Code and Presentation slides for Teaching the Elephant to Read☆17Apr 19, 2016Updated 10 years ago
- Sklearn implementation of GBM to predict mu(X) and std(X) on heteroscedastic data☆25Jun 3, 2016Updated 9 years ago
- An extensive machine learning library, made from scratch (Python).☆111Jun 24, 2018Updated 7 years ago
- Introduction to data visualization in Python, including plotting data from NetCDF files. Originally created for a Short Course in Data As…☆50Oct 22, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Scripts and code written whilst learning and experimenting with machine learning☆13Jul 18, 2022Updated 3 years ago
- A distributed in-memory fabric based on shared-memory blocks and datashape. Any language can operate on the data.☆13Feb 12, 2016Updated 10 years ago
- Bombolone is a tasty Content Management System for Python based on Flask, MongoDB, AngularJS, Sass and Bootstrap. It's designed to be a s…☆75Jul 31, 2015Updated 10 years ago
- <||> Interfaces to Popular R Functions for Data Science Pipelines, and More☆74Sep 9, 2016Updated 9 years ago
- dask-searchcv is now part of dask-ml: https://github.com/dask/dask-ml☆241Oct 13, 2018Updated 7 years ago
- Python solver for mixed-effects models☆97Jun 3, 2025Updated 11 months ago
- Tools for exploratory data analysis in Python☆649Aug 5, 2025Updated 9 months ago