A Pythonic introduction to methods for scaling your data science and machine learning work to larger datasets and larger models, using the tools and APIs you know and love from the PyData stack (such as numpy, pandas, and scikit-learn).
☆121Nov 20, 2022Updated 3 years ago
Alternatives and similar repositories for data-science-at-scale
Users that are interested in data-science-at-scale are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Introduction to Dask for PyTorch Workflows☆13Mar 3, 2021Updated 5 years ago
- The ecosystem of geospatial machine learning tools in the Pangeo world.☆12Mar 17, 2025Updated last year
- ☆14Jun 2, 2022Updated 3 years ago
- A xarray extension to show velocity fields as interactive maps in jupyterlab☆12Dec 2, 2020Updated 5 years ago
- A Panel app to demonstrate distorsions created by non-perceptual colormaps on geophysical data☆12Jan 22, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Pangeo Forge public roadmap☆18May 22, 2024Updated last year
- ☆53Apr 4, 2026Updated 2 weeks ago
- nyhackr website written using RMarkdown☆10Updated this week
- A small utility repo for checkerboard sampling☆11Jul 28, 2025Updated 8 months ago
- A small utility for generating ND array pyramids using Xarray and Zarr.☆116Apr 6, 2026Updated last week
- Python interface to TileDB Cloud REST API☆15Mar 13, 2026Updated last month
- ☆115Nov 7, 2022Updated 3 years ago
- JupyterHub deployment for ENGR101 Winter 2018 at Portland Community College☆11Dec 8, 2022Updated 3 years ago
- Deep Learning from Scratch with PyTorch☆121Jul 10, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A place to provide Coiled feedback☆29Mar 5, 2025Updated last year
- Materials for the "Recommender Systems through the lens of Decision Theory" tutorial delivered at the 30th Web Conference (WWW '21).☆11Apr 13, 2021Updated 5 years ago
- Simple examples of data pipelines from xarray to ML training☆22Dec 19, 2019Updated 6 years ago
- A simple R package showcasing how RStudio project templates can be used.☆26Dec 20, 2016Updated 9 years ago
- This repository replicates the figures from the 3rd edition of the book "Recursive Macroeconomic Theory" by Lars Ljungqvist and Thomas J.…☆12Feb 9, 2016Updated 10 years ago
- A High-Performance Data Science Toolkit for the Earth Sciences☆71Jun 8, 2024Updated last year
- 'math+econ+code' masterclass on equilibrium transport and matching models in economics☆36Jun 15, 2023Updated 2 years ago
- The course material for the programming course in DEES, University of Manchester☆10Jan 6, 2026Updated 3 months ago
- Central repository for xarray-contrib organization☆11Aug 26, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Scripts and other artifacts for MODIS data ingestion into Amazon public hosting.☆14Jun 1, 2021Updated 4 years ago
- Materials for MIT workshop "Practical Computing Tutorials for Earth Scientists"☆39Apr 24, 2020Updated 5 years ago
- Earth System Model Collection specification☆13Feb 3, 2023Updated 3 years ago
- An IPython notebook analysis of the UWC Tampines commercial building dataset☆13Apr 25, 2019Updated 6 years ago
- Easy to use Python library of customized functions for cleaning and analyzing data.☆520Updated this week
- Unmap data from a pseudocolor image, with or without knowing the colormap.☆18Apr 4, 2023Updated 3 years ago
- This repo contains a short version of a dask tutorial.☆12Dec 5, 2022Updated 3 years ago
- Python package to call processed EE objects via the REST API to local data☆36Jun 8, 2024Updated last year
- ☆19Feb 27, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆43Sep 13, 2021Updated 4 years ago
- Au Naturel is a LaTeX template built on top of the standard article class, roughly emulating some characteristics of the Nature Publishin…☆10May 2, 2018Updated 7 years ago
- Models and parameterizations for the turbulent ocean surface boundary layer in Julia☆25Dec 1, 2022Updated 3 years ago
- ☆44Dec 9, 2025Updated 4 months ago
- ⛔️ DEPRECATED GPU Ocean Python/CUDA codebase☆11Nov 9, 2023Updated 2 years ago
- ☆21Sep 29, 2021Updated 4 years ago
- Notebooks for Pangeo Showcase Talk on Oct 12, 2022☆14Oct 13, 2022Updated 3 years ago