A Pythonic introduction to methods for scaling your data science and machine learning work to larger datasets and larger models, using the tools and APIs you know and love from the PyData stack (such as numpy, pandas, and scikit-learn).
☆121Nov 20, 2022Updated 3 years ago
Alternatives and similar repositories for data-science-at-scale
Users that are interested in data-science-at-scale are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Jul 12, 2021Updated 4 years ago
- Introduction to Dask for PyTorch Workflows☆13Mar 3, 2021Updated 5 years ago
- Python implementation of Gibbs sampling for the naı̈ve Bayes model presented by Resnik and Hardisty☆14Feb 10, 2018Updated 8 years ago
- The ecosystem of geospatial machine learning tools in the Pangeo world.☆12Mar 17, 2025Updated last year
- Cubed-Sphere data processing with xarray☆18Jan 16, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆14Jun 2, 2022Updated 3 years ago
- This repository contains the resources used for presentation/discussion in weekly iRE Lab meetings.☆14Sep 8, 2017Updated 8 years ago
- A xarray extension to show velocity fields as interactive maps in jupyterlab☆12Dec 2, 2020Updated 5 years ago
- This is a repository of code and datasets for blog posts or articles I've written.☆12Feb 1, 2019Updated 7 years ago
- A Panel app to demonstrate distorsions created by non-perceptual colormaps on geophysical data☆12Jan 22, 2026Updated 2 months ago
- ☆53Mar 16, 2026Updated 2 weeks ago
- A small utility for generating ND array pyramids using Xarray and Zarr.☆116Feb 1, 2026Updated last month
- Python interface to TileDB Cloud REST API☆15Mar 13, 2026Updated 2 weeks ago
- ☆115Nov 7, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆12Apr 20, 2021Updated 4 years ago
- JupyterHub deployment for ENGR101 Winter 2018 at Portland Community College☆11Dec 8, 2022Updated 3 years ago
- Deep Learning from Scratch with PyTorch☆121Jul 10, 2020Updated 5 years ago
- A place to provide Coiled feedback☆29Mar 5, 2025Updated last year
- Materials for the "Recommender Systems through the lens of Decision Theory" tutorial delivered at the 30th Web Conference (WWW '21).☆11Apr 13, 2021Updated 4 years ago
- EMNLP 2020: Filtering before Iteratively Referring for Knowledge-Grounded Response Selection in Retrieval-Based Chatbots☆12Dec 15, 2020Updated 5 years ago
- Simple examples of data pipelines from xarray to ML training☆22Dec 19, 2019Updated 6 years ago
- ☆33Aug 14, 2020Updated 5 years ago
- A High-Performance Data Science Toolkit for the Earth Sciences☆71Jun 8, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- The course material for the programming course in DEES, University of Manchester☆10Jan 6, 2026Updated 2 months ago
- Central repository for xarray-contrib organization☆11Aug 26, 2022Updated 3 years ago
- Scripts and other artifacts for MODIS data ingestion into Amazon public hosting.☆14Jun 1, 2021Updated 4 years ago
- Example of using AWS for serverless on-demand seismic processing.☆14Mar 30, 2021Updated 5 years ago
- Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances☆12Aug 14, 2022Updated 3 years ago
- An IPython notebook analysis of the UWC Tampines commercial building dataset☆13Apr 25, 2019Updated 6 years ago
- Easy to use Python library of customized functions for cleaning and analyzing data.☆523Mar 12, 2026Updated 2 weeks ago
- Unmap data from a pseudocolor image, with or without knowing the colormap.☆18Apr 4, 2023Updated 2 years ago
- This repo contains a short version of a dask tutorial.☆12Dec 5, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Python package to call processed EE objects via the REST API to local data☆36Jun 8, 2024Updated last year
- ☆19Feb 27, 2025Updated last year
- ☆99Mar 20, 2026Updated last week
- Models and parameterizations for the turbulent ocean surface boundary layer in Julia☆25Dec 1, 2022Updated 3 years ago
- ☆43Dec 9, 2025Updated 3 months ago
- ☆21Sep 29, 2021Updated 4 years ago
- Building a Zarr data cube using serverless cloud compute☆56Sep 2, 2025Updated 6 months ago